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PRIMARY NUCLEOTIDE SEQUENCE OF THE SHRIMP WHITE SPOT 
BACILLIFORM VIRUS (WSBV), DISCOVERY SYSTEMS CONTAINING THIS 
SEQUENCE AND DETECTION KITS AND ANTIVIRAL TARGETS FOR 
DETECTION AND CONTROLLING SHRIMP VIRUS OUTBREAK AND SPREAD 

RELATED APPLICATION 

The present application claims priority to Chinese patent application No. 99124717.5, 
filed November 24, 1999. 

FIELD OF THE INVENTION 

The present invention is in the field of genomic discovery systems. The present 
invention specifically provides the complete shrimp white spot bacilliform virus (WSBV) 
genome and isolated fragments thereof in a form that is commercially useful, including 
detection kits, antiviral agents, reagents such as nucleic acid arrays, and computer-based 
systems. 

BACKGROUND OF THE INVENTION 

The shrimp and prawn (hereafter collectively referred to as shrimp) industry is a rapid 
growth worldwide industry worth billions of dollars. Worldwide, the shrimp industry relies 
on both the harvesting of wild shrimp and aquaculture, which is the controlled farming of 
fish, shellfish, and plants. Aquaculture, particularly aquaculture of shrimp, is growing rapidly 
due to increasing consumer demand for shrimp and other seafood. Aquaculture has been 
expanding at an annual rate far surpassing the growth of livestock meat, capture fisheries, and 
agricultural production. The aquaculture industry delivers high-quality protein for human and 
animal consumption and provides a substantial source of income and employment, 
particularly for developing countries. Aquaculture accounts for nearly 20 percent of the 
world's harvest of fish, shellfish, and seaweeds. The total worldwide value of giant tiger 
prawn production is the greatest of any aquaculture species. Aquaculture of giant tiger prawn 
significantly contributes to many Asian and Latin American economies, where the majority 
of giant tiger prawn production occurs. Shrimp accounted for approximately a quarter of the 
overall value of Asian fish exports in 1996. In the United States, harvesting and processing 
shrimp, including both aquaculture and harvesting wild shrimp, is a $3 billion dollar a year 
industry that employs over 1 1,000 people. Furthermore, shrimp aquaculture in the U.S. has 
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the potential to become a high-growth business. The risk of viral diseases to cultured shrimp 
is the primary obstacle to the growth of the shrimp aquaculture industry. 

Shrimp viral disease is a major worldwide concern of the shrimp industry. Both 
aquaculture and wild shrimp are vulnerable to viral infection, which can lead to devastating 
economic consequences. Furthermore, shrimp viruses may affect other crustaceans such as 
crabs and crayfish. Drastic declines in the populations of wild shrimp or other crustaceans 
due to viral disease can also dramatically affect other species in the food chain that depend 
upon shrimp for food and can lead to severe ecological consequences. 

Major pathogenic shrimp viruses include White Spot Bacilliform Virus (WSBV), 
Infectious Hypodermal and Hematopoietic Virus (IHHNV), Tara Syndrome Virus (TSV), and 
Yellow Head Virus (YHV). IHHNV and TSV are endemic throughout South and Central 
America, while WSBV and YHV are endemic throughout Asia. All U.S. shrimp species are 
susceptible to infection and disease from one or more of these four viruses. Susceptibility of 
U.S. species of shrimp to these viruses may lead to restrictions on the importation of foreign 
shrimp into the U.S. 

Past incidents of viral outbreaks illustrate the devastating affects that a viral outbreak 
can have on the shrimp industry. An outbreak of IHHNV in 1 987 in the Gulf of California 
shrimp fishery reduced shrimp to levels that could not support commercial harvests until 
1994. Outbreaks in 1995 and 1996 on U.S. shrimp farms caused a 50 to 95 percent loss of 
production at affected farms. Shrimp exports from China to the U.S. dropped 75% between 
1990 and 1995 due to infection by WSBV. 

WSBV is regarded as one of the most highly pathogenic viruses of penaeid shrimp. 
No uniform name exists for WSBV. It is also known as White Spot Syndrome Virus 
(WSSV), Prawn White Spot Bacilliform Virus (PWSBV), White Spot Baculiform Virus 
(WSBV), Baculoviral Hypodermal and Hematopoietic Necrosis Virus (HHNBV), Rod- 
shaped Nuclear Virus of Penaeus japonicus (RV-PJ), Systemic Ectodermal and Mesodermal 
Bacilliform Virus (SEMBV). Other acronyms include WSV, WSDV, and LNBV. The virus is 
a non-occluded, circular, double-stranded DNA bacilliform virus with a genome of 
approximately 300kb. WSBV virions are enveloped nucleocapsids with bacilliform 
morphology and a tail-like extension at one end. 

White Spot Syndrome, caused by WSBV, is also known by such names as Red 
Disease, China Virus Disease, and Shrimp Explosive Epidemic Disease. Infected shrimp 
display rapid reduction in food consumption and lethargy. Gross observations include a loose 
cuticle and a red color to the entire body and appendages along with small subcutaneous 
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white spots. Histological examination reveals prominent intranuclear inclusion bodies in the 
cuticular epithelium, subcutis, and connective tissues. Cumulative mortality rates reach 100% 
within 3 to 10 days of the onset of clinical signs. No significant resistance to WSBV has been 
reported. All native U.S. species of shrimp are susceptible to WSBV infection under 
experimental conditions. WSBV is widely spread throughout most of the shrimp growing 
regions of Asia and the Indo-Pacific, including China, Japan, Korea, Thailand, Indonesia, 
Taiwan, Vietnam, Malaysia, and India. Lethal outbreaks of WSBV virus have recently been 
recorded in Texas and South Carolina. Furthermore, the virus has been shown to infect other 
crustaceans including amphipods, ostracods, swimming crabs, crayfish, copepods, and shore 
flies. The possibility exists that these organisms could act as a reservoir through which 
further shrimp infection, or infection of other species, can occur. 

In view of the serious economic and ecological risks posed to the worldwide shrimp 
industry and shrimp populations by viruses, particularly WSBV, a strong need exists for 
antiviral agents and detection systems. Detection systems should be highly specific, rapid, 
and sensitive. To facilitate development of antiviral agents and detection systems, knowledge 
of the complete genomic sequence and protein encoding sequences of WSBV is needed. Prior 
to the present invention, very few reports on WSBV genomic sequences existed, and only a 
small fraction of the entire WSBV genome had been sequenced. To date, only six WSBV 
sequenced have been patented, published or stored in public genome databases, such as 
Genbank. AH sequences to date are short sequences ranging in length from 420 bp to 2424 
bp. J.S. Kim and others from Korea have sequenced 2424bp (wsu 92007,1997) and 420bp 
(wsu 89843, 1997); K.Mitsuo and others from Japan have sequenced two fragments, 1447bp 
(PN JP 1997201 196-A/2) and 1461bp (FN JP 1997201 196-A/l) in length; Chufang Luo et.al. 
from Taiwan sequenced 1461 bp (PMU50923, 1996); L.M. Nunan et. al. from the United 
States has reported 868bp sequence in J. Virological Methods (1997(63): pi 93-201). These 
known sequences are no more than lOkb in length all together. In addition, these sequences 
are randomly sequenced with no systematic analysis; therefore determining sequence 
function is difficult. Since the complete genome of WSBV is more than 300kb in length, the 
analysis of the complete genomic DNA sequence and it's complete structure, the 
determination of the expressed sequences, and prediction of the functions of encoded proteins 
are all new scientific achievements. These achievements are the basis for the present 
invention. The present invention is directed to providing the complete primary nucleotide 
sequence of WSBV and isolated fragments thereof, protein encoding sequences of WSBV, 
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and antiviral agents and detection systems based on the nucleotide and protein encoding 
sequences provided by the present invention. 

DNA Viruses 

Generally, transcription of a DNA virus genome occurs in the nucleus of the host cell, 
utilizing host cell polymerases and other host enzymes for viral mRNA synthesis and viral 
replication. Viral gene transcription is modulated by the interaction of specific DNA-binding 
proteins with promoter and enhancer elements in the viral genome. Commonly, the viral 
promoter and enhancer elements are similar in sequence to those of the host cell in order to 
allow the host cell's transcriptional activation factors and DNA-dependent RNA polymerase 
to bind the viral control elements. Cells from different tissues or species express different 
DNA-binding proteins, and this is a major factor in determining which species, and which 
cells and tissues of that species, that the virus can infect. 

Viruses, in general, depend on the host cell ribosomes, transfer RNA (tRNA), and 
mechanisms of posttranslational modification to produce their proteins. Generally, viral 
mRNA encoding non-structural viral proteins, such as DNA-binding proteins and enzymes, 
are transcribed first. These are followed by late viral gene products encoding structural 
proteins. 

Viruses utilize various methods to promote preferential translation of their viral 
mRNA over host cell mRNA. In some instances, concentration of viral mRNA in the host 
cell is so large that it occupies most of the cell's ribosomes, thereby preventing translation of 
host cell mRNA. Viruses may inhibit synthesis and/or induce degradation of the cell's nucleic 
acids. Many viruses increase the permeability of the host cell membrane, thereby reducing 
the ribosomal affinity for most cellular mRNA. 

Viral DNA replication begins at a unique sequence in the genome called the origin of 
replication, or ori. The ori is recognized by viral or host nuclear factors and DNA-dependent 
DNA polymerase. Viral DNA synthesis is semi-conservative and a primer is required by the 
DNA polymerase to initiate synthesis of the new DNA molecule. 

Viral Screening Tests and Antiviral Agents 

Viral screening tests and detection kits, such as nucleic acid arrays, can be developed 
based on either nucleic acids or polypeptides provided by the present invention. A nucleic 
acid probe to a virus specific nucleotide sequence, or an antibody to a virus specific protein, 
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is introduced into contact with a sample, such as a sample of shrimp cells, whereby the 
presence of the virus is detected using an assay system. 

Antiviral agents, either nucleic acid or protein-based, directly interfere with viral 
function or preferably, interfere with viral replication to stop or prevent spread of the virus in 
a population, such as in a population of shrimp. Knowledge of the nucleic acid and protein 
sequences of the virus allows antiviral agents to be designed to attack a number of viral 
targets necessary for viral replication or function, such as viral encoded enzymes or structural 
proteins. Attachment of the virus to the host cell is the first step in viral replication and is 
mediated by the interaction of a viral attachment protein and a host cell surface receptor. This 
interaction can be blocked by neutralizing antibodies, which bind to and coat the virion, or 
receptor antagonists which are peptide or carbohydrate analogues of the viral attachment 
protein and competitively block the interaction of the virus with the cell. Agents can be 
designed that bind to the viral attachment protein and prevent penetration of the virus into the 
cytoplasm or nucleus of the host cell and/or uncoating of the virus. These agents thereby 
prevent the virus from delivering its genome into the host cell. Viral mRNA expression and 
utilization can be targeted with anti-viral agents. Antisense oligonucleotides can be designed 
to bind to newly transcribed viral RNA and thereby prevent the viral RNA from being 
processed to mRNA in the nucleus, delivered to the cytoplasm, and bound to the ribosome. 
Many antiviral drugs are nucleoside analogues, which inhibit viral polymerases. Viral 
polymerases are often less specific for substrate than are host polymerases, therefore the viral 
polymerase will often bind a nucleotide analogue with a modified base and/or sugar several 
hundredfold better than the host enzyme. Antiviral drugs can therefore be preferentially 
incorporated into the viral genome. DNA viruses, such as WSBV, are particularly susceptible 
to these types of drugs due to the extent and rapid rate of nucleotide incorporation during 
viral replication. Inhibition of posttranslational modification of viral proteins, such as 
phosphorylation, may also inhibit viral replication. 



SUMMARY OF THE INVENTION 

The present invention is based on the sequencing and assembly of the WSBV 
genome. The present invention provides the primary nucleotide sequence of the WSBV 
genome (SEQ ID NO: 1) and predicted transcript sequences (SEQ ID NOS: 2, 4, 6... 280, 
282, 284, 286-293: See the Sequence Listing and the Figure Sheets for both the genomic and 
transcript sequences) and polymorphic sites on these transcripts summarized in Table 1 
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hereinafter, and protein encoded sequence produced from each of the genes found in the 
WSBV genome. This information is provided in the form of sequences and annotation 
information and can be used to generate computer based discovery systems, nucleic acid 
detection reagents and kits such as nucleic acid arrays, protein based detection kits, and 
antiviral targets. 

The present invention provides these nucleotide sequences of the WSBV genome, and 
representative fragments thereof, in a form that can be used, analyzed, and commercialized. 
For example, the present invention provides the nucleic acid sequences as contiguous strings 
of primary sequences in a form readable by computers, such as recorded on computer 
readable media, e.g., magnetic storage media, such as floppy discs, hard disc storage medium, 
and magnetic tape; optical storage media such as CD-ROM; electrical storage media such as 
RAM and ROM; and hybrids of these categories such as magnetic/optical storage media. 
The present invention specifically provides a Sequence Listing in computer readable form 
stored on such media. Such compositions are useful in the discovery of drug and antiviral 
targets. 

The present invention further provides systems, particularly computer-based systems 
that contain the primary sequence information of the present invention stored in data storage 
means. Such systems are designed to identify commercially important fragments of the 
WSBV genome. 

Another embodiment of the present invention is directed to isolated fragments, and 
collections of fragments, of the WSBV genome. The fragments of the WSBV genome 
include, but are not limited to, fragments that encode peptides, hereinafter open reading 
frames (ORFs) and fragments that modulate the expression of an operably linked ORF, 
hereinafter expression modulating fragments (EMFs). The ORFs are provided in the 
Sequence Listing and in Figure 3. 

The present invention further includes kits, such as nucleic acid arrays, detection 
reagents and microfluidic devices, that comprise one or more fragments of the WSBV 
genome of the present invention, particularly ORFs. The kits, such as arrays, can be used to 
track the expression of many genes, even all genes, or rationally selected subsets thereof, 
contained in the WSBV genome. 

The identification of the entire coding set of sequences from the genome of WSBV 
will be of great value to all laboratories working with this organism and for a variety of 
commercial and ecological purposes. Many fragments of the WSBV genome will be 
immediately identified by similarity searches against protein and nucleic acid databases and 
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by identifying structural motifs present in protein domains and will be of immediate value to 
WSBV researchers and for commercial value for controlling WSBV infection in shrimp 
populations. A specific example concerns viral envelope proteins, many of which interact 
with host cells. Proteins of this family can readily be configured into screens and assays for 
detecting chemical modulators of the protein activity. The biological significance of this and 
other families of proteins for controlling viral replication is well known. Many of the known 
antiviral agents modulate the activity of these types of proteins. The WSBV genome will 
allow one to identify all potential antiviral targets. 

The present invention is further directed to isolated WSBV proteins encoded by the 
ORFs of the present invention. A variety of methodologies known in the art can be utilized to 
obtain any one of the proteins of the present invention. The amino acid sequence can be 
synthesized using commercially available peptide synthesizers. In an alternative method, the 
viral protein can be purified from cells infected with the virus. 

The invention further provides antibodies that selectively bind one of the WSBV 
proteins encoded by the present invention. Antibodies have use in viral detection and control 
and can be generated using the protein encoding sequences provided by the present invention. 
Such antibodies include both monoclonal and polyclonal antibodies, and fragments thereof. 
The invention further provides hybridomas capable of producing the above-described 
monoclonal antibodies. 

The present invention provides methods of identifying WSBV in a test sample, such 
as a sample of shrimp. Such methods comprise incubating cells extracted from the test 
sample with one or more of the antibodies or probes based on the nucleic acid sequences 
provided by the present invention under conditions that allow a skilled artisan to determine if 
the test sample contains the ORF or product produced therefrom. 

Using the isolated proteins of the present invention, the present invention further 
provides methods of obtaining and identifying agents capable of binding to a protein encoded 
by one of the ORFs of the present invention. Specifically, such agents include antibodies, 
peptides, carbohydrates, pharmaceutical agents and the like. Such methods comprise the steps 
of contacting an agent with an isolated protein encoded by one of the ORFs of the present 
invention and determining whether the agent binds to said protein. 

DESCRIPTION OF THE FIGURE SHEETS 

Figure 1 provides a block diagram of a computer system 1 02 that can be used to 
implement the computer-based systems of the present invention. 
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Figure 2 (Sheets 1-40) provides the primary genomic sequence of WSBV. 
Figure 3 (Sheets 1-160) provides: 

1 ) the predicted transcript sequence of the WSBV gene and starting ATG 
site (SEQ ID NOS: 2, 4, 6, 8 . . . 280, 282, 284, 286-293); 

2) the predicted protein sequence of the WSBV gene (SEQ ID NOS: 3, 5, 7, 
9 . . .281,283, 285); 

3) results of a BLAST query run using default parameters that shows 
proteins producing significant alignments with the predicted WSBV 
protein sequence of the present invention. 

4) comments 

5) TaqMan primer/probe sets. Oligonucleotide sequences useful as primers 
and/or probes for amplifying and/or screening for the WSBV genes provided 
by the present invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

General Description 

The present invention is based on the sequencing and assembly of the WSBV 
genome. In this process, the primary nucleotide sequence of 5795 nucleic acid fragments was 
determined. These fragments were assembled into a single contiguous sequence of 305,107 
bp. After assembly, the sequences were analyzed with various computer packages and 
compared with all external data sources. The result of this analysis was the identification of 
1 50 predicted genes/transcripts contained in the WSBV genome. The present invention 
provides the genomic nucleic acid sequence of WSBV (SEQ ID NO: 1), see Figure 2, Sheets 
1-40, as well as the predicted gene structure of all 150 identified genes (SEQ ID NOS: 2, 4, 
6... 280, 282, 284, 286-293 ) and polymorphic sites on these transcripts summarized in Table 
1, and predicted amino acid sequences of all of the encoded proteins (SEQ ID NOS: 3, 5, 
7... 281, 283, 285), see Figure 3, sheets 1-160. 
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Table 1 . - Summary of SBV Polymorphic Sites 



Sites 


Po 1 ymorph i sm 


13091 


T/C 


26054 


T/G 


26074 


C/T 


48030 


T/C 


48134 


T/C 


50395 


C/T 


76148 


A/G 


80650 


T/A 


84076 


*/TGC 


89028 


A/* 


93522 


T/G 


93595 


54xl0=540bp repetitive sequences/* 




Within repeats: 




89 G/T 




143 G/T 




197 G/T 




251 G/T 




305 G/T 


97419 


CC/** 


105400 


C/T 


108024 


A/C 


108027 


G/* 


126375 


* / GGAAGAAGAAGAGGAAGA 


133906 


*/G/GG 


139662 


G/T 


142288 


C/T 


166492 


TGT/*** 


180850 


C/T 


181761 


TCC/*** 


200713 


G/A 


214165 


C/T 


230870 


G/A 


232887 


G/A 


238111 


G/A 


247636 


G/T 


258606 


***/CTA/CTACTA 


271845 


C/T 


272344 


A/C 


272751 


C/T 


273191 


G/A 


282103 


G/* 


282189 


G/A 



The nucleotide sequences of the present invention, or representative fragments 
thereof, are provided in a form that can be readily used, analyzed, and interpreted by a skilled 
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artisan. In one embodiment, the sequences are provided as contiguous strings of primary 
sequence information corresponding to the nucleotide sequences provided in the figures. 

As used herein, a "representative fragment of the nucleotide sequence provided 
herein" refers to any portion of these sequences that are not presently represented within a 
publicly available database. Preferred representative fragments of the present invention are 
WSBV open reading frames and expression modulating fragments (ORFs and EMFs 
respectively, see figure 3 and below). 

The nucleotide sequence information provided herein was obtained by sequencing the 
WSBV genome using a shotgun sequencing method known in the art. WSBV genomic DNA 
was initially obtained for sequencing by extraction and purification of viral DNA from 
infected shrimp tissues using the method of Yang et al. {J, Virological Methods, 67:1-4 
(1997)), which is hereby incorporated by reference. The nucleotide sequences provided 
herein are highly accurate, although not necessarily a 100% perfect, representation of the 
nucleotide sequence of the WSBV genome. 

Using the information provided herein together with routine cloning and sequencing 
methods, one of ordinary skill in the art is able to identify, clone and sequence all 
"representative fragments" of interest including open reading frames (ORFs) encoding a large 
variety of WSBV proteins. In very rare instances, this may reveal a nucleotide sequence 
error present in the nucleotide sequence disclosed herein. Thus, once the present invention is 
made available (i.e., the information in the Sequence Listing and figures in a useable form), 
resolving a rare sequencing error would be well within the skill of the art. Nucleotide 
sequence editing software is publicly available. 

Even if all of the very rare sequencing errors in the sequences herein disclosed were 
corrected, the resulting nucleotide sequence would still be at least 90% identical, and more 
likely 99% identical, and most likely 99.99% identical to the nucleotide sequence provided 
herein. 

Thus, the present invention further provides nucleotide sequences that are at least 
90% identical, or greater, to the nucleotide sequences of the present invention in a form 
which can be readily used, analyzed and interpreted by the skilled artisan. Methods for 
determining whether a nucleotide sequence is at least 90% identical to the nucleotide 
sequence of the present invention are routine and readily available to the skilled artisan. For 
example, the well known BLAST algorithm can be used to generate the percent identity of 
nucleotide sequences. 
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The present invention further provides a prediction of all of the genes within the 
WSBV genome. This information is provided in Figure 3. The information in the figures can 
be used to generate WSBV detection kits, antiviral agents, expression arrays, microfluidic 
devices, individual gene fragments, proteins, antibodies, promoters, protein and nucleotide 
based viral screens and the like, and to identify commercially important genes and gene 
products. 

Specific Embodiments 
Computer Related Embodiments 

The nucleotide sequences provided in the present invention, a representative fragment 
thereof, or nucleotide sequences at least 90% identical to these sequences, may be "provided" 
in a variety of mediums to facilitate use thereof. As used herein, "provided" refers to a 
manufacture, other than an isolated nucleic acid molecule, that contains a nucleotide 
sequence of the present invention, i.e., the nucleotide sequences provided in the present 
invention, a representative fragment thereof, or nucleotide sequences at least 90% identical to 
these sequences. Such a manufacture provides the WSBV genome or a subset thereof (e.g., a 
WSBV open reading frame (ORF)) in a form that allows a skilled artisan to examine the 
manufacture using means not directly applicable to examining the WSBV genome or a subset 
thereof as it exists in nature or in purified form. 

In one application of this embodiment, a nucleotide sequence of the present invention 
can be recorded on computer readable media. As used herein, "computer readable media" 
refers to any medium that can be read and accessed directly by a computer. Such media 
include, but are not limited to: magnetic storage media, such as floppy discs, hard disc 
storage medium, and magnetic tape; optical storage media such as CD-ROM; electrical 
storage media such as RAM and ROM; and hybrids of these categories such as 
magnetic/optical storage media. A skilled artisan can readily appreciate how any of the 
presently known computer readable mediums can be used to create a manufacture comprising 
computer readable medium having recorded thereon a nucleotide sequence of the present 
invention. One such medium is provided with the present application, namely, the present 
application contains computer readable medium (CD-R) that has the sequence contigs 
provided/recorded thereon in ASCII text format in a Sequence Listing. 

As used herein, "recorded" refers to a process for storing information on computer 
readable medium. A skilled artisan can readily adopt any of the presently known methods for 
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recording information on computer readable medium to generate manufactures comprising 
the nucleotide sequence information of the present invention. 

A variety of data storage structures are available to a skilled artisan for creating a 
computer readable medium having recorded thereon a nucleotide or amino acid sequence of 
the present invention. The choice of the data storage structure will generally be based on the 
means chosen to access the stored information. In addition, a variety of data processor 
programs and formats can be used to store the nucleotide sequence information of the present 
invention on computer readable medium. The sequence information can be represented in a 
word processing text file, formatted in commercially-available software such as WordPerfect 
and Microsoft Word, or represented in the form of an ASCII file, stored in a database 
application, such as OB2, Sybase, Oracle, or the like. A skilled artisan can readily adapt any 
number of data processor structuring formats (e.g. text file or database) in order to obtain 
computer readable medium having recorded thereon the nucleotide sequence information of 
the present invention. 

By providing the nucleotide sequences of the present invention, a representative 
fragment thereof, or nucleotide sequences at least 90% identical to these sequences, in 
computer readable form, a skilled artisan can routinely access the sequence information for a 
variety of purposes. Computer software is publicly available which allows a skilled artisan to 
access sequence information provided in a computer readable medium. Software which 
implements the BLAST (Altschul et at, J. Mol Biol. 215:403-410 (1990)) and BLAZE 
(Brutlag et at, Comp. Chem. 17:203-207 (1993)) search algorithms on a Sybase system an 
be used to identify open reading frames (ORFs) within the WSBV genome that contain 
homology to ORFs or proteins from other organisms. Such ORFs are protein-encoding 
fragments within the WSBV genome and are useful in producing commercially important 
proteins such as proteins used as drug or antiviral targets. 

The present invention further provides systems, particularly computer-based systems, 
which contain the sequence information described herein. Such systems are designed to 
identify commercially important fragments of the WSBV genome. 

As used herein, "a computer-based system" refers to the hardware means, software 
means, and data storage means used to analyze the nucleotide sequence information of the 
present invention. The minimum hardware means of the computer-based systems of the 
present invention comprises a central processing unit (CPU), input means, output means, and 
data storage means. A skilled artisan can readily appreciate that any one of the currently 
available computer-based system are suitable for use in the present invention. Such system 
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can be changed into a system of the present invention by utilizing the sequence information 
provided on the CD-R, or a subset thereof without any experimentation. 

As stated above, the computer-based systems of the present invention comprise a data 
storage means having stored therein a nucleotide sequence of the present invention and the 
necessary hardware means and software means for supporting and implementing a search 
means. As used herein, "data storage means" refers to memory which can store nucleotide 
sequence information of the present invention, or a memory access means which can access 
manufactures having recorded thereon the nucleotide sequence information of the present 
invention. 

As used herein, "search means" refers to one or more programs that are implemented 
on the computer-based system to compare a target sequence or target structural motif with the 
sequence information stored within the data storage means. Search means are used to 
identify fragments or regions of the WSBV genome which match a particular target sequence 
or target motif. A variety of known algorithms are disclosed publicly and a variety of 
commercially available software for conducting search means are available and can be used 
in the computer-based systems of the present invention. Examples of such software include, 
but is not limited to, MacPattern (EMBL), BLASTN and BLASTX (NCBIA). A skilled 
artisan can readily recognize that any one of the available algorithms or implementing 
software packages for conducting homology searches can be adapted for use in the present 
computer-based systems. 

As used herein, a "target sequence" can be any DNA or amino acid sequence of six or 
more nucleotides or two or more amino acids. A skilled artisan can readily recognize that the 
longer a target sequence is, the less likely a target sequence will be present as a random 
occurrence in the database. The most preferred sequence length of a target sequence is from 
about 1 0 to 100 amino acids or from about 30 to 300 nucleotide residues. However, it is well 
recognized that searches for commercially important fragments of the WSBV genome, such 
as sequence fragments involved in gene expression and protein processing, may be of shorter 
length. 

As used herein, "a target structural motif," or "target motif," refers to any rationally 
selected sequence or combination of sequences in which the sequence(s) is chosen based on a 
three-dimensional configuration which is formed upon the folding of the target motif There 
are a variety of target motifs known in the art. Protein target motifs include, but are not 
limited to. enzymatic active sites and signal sequences. Nucleic acid target motifs include, 
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but are not limited to, promoter sequences, hairpin structures and inducible expression 
elements (protein binding sequences). 

A variety of structural formats for the input and output means can be used to input and 
output the information in the computer-based systems of the present invention. A preferred 
format for an output means ranks fragments of the WSBV genome possessing varying 
degrees of homology to the target sequence or target motif. Such presentation provides a 
skilled artisan with a ranking of sequences which contain various amounts of the target 
sequence or target motif and identifies the degree of homology contained in the identified 
fragment. 

A variety of comparing means can be used to compare a target sequence or target 
motif with the data storage means to identify sequence fragments of the WSBV genome. 
Software which implements the BLAST and BLAZE algorithms (Altschul et al., J Mol. Biol. 
215:403-410 (1990)) can be used to identify open reading frames within the WSBV genome. 
A skilled artisan can readily recognize that any one of the publicly available homology search 
programs can be used as the search means for the computer-based systems of the present 
invention. 

One application of this embodiment is provided in Figure 1. Figure 1 provides a 
block diagram of a computer system 102 that can be used to implement the present invention. 
The computer system 102 includes a processor 106 connected to a bus 104. Also connected 
to the bus 104 are a main memory 108 (preferably implemented as random access memory, 
RAM) and a variety of secondary storage devices 1 10, such as a hard drive 112 and a 
removable medium storage device 114. The removable medium storage device 1 14 may 
represent, for example, a floppy disk drive, a CD- ROM drive, a magnetic tape drive, etc. A 
removable storage medium 1 16 (such as a floppy disk, a compact disk, a magnetic tape, etc.) 
containing control logic and/or data recorded therein may be inserted into the removable 
medium storage device 114. The computer system 102 includes appropriate software for 
reading the control logic and/or the data from the removable storage medium 1 16 once 
inserted in the removable medium storage device 1 14. 

The nucleotide sequences of the present invention may be stored in a well known 
manner in the main memory 108, any of the secondary storage devices 1 10, and/or a 
removable storage medium 1 16. Software for accessing and processing the genomic 
sequence (such as search tools, comparing tools, etc.) reside in main memory 108 during 
execution. 
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Biochemical Embodiments 
Nucleic Acid Fragments 

Another embodiment of the present invention is directed to isolated fragments of the 
WSBV genome. The fragments of the WSBV genome of the present invention include, but 
are not limited to, fragments that encode peptides, hereinafter open reading frames (ORFs) 
and fragments which modulate the expression of an operably linked ORF. Some of these 
fragments are identified and described in Figure 3. The isolated nucleic acid molecules of the 
present invention include, but are not limited to, single stranded and double stranded DNA, 
and single stranded RNA. 

"Nucleotide sequence" refers to a heteropolymer of deoxyribonucleotides. Generally, 
DNA segments encoding the polypeptides and proteins provided by this invention are 
assembled from fragments of the WSBV genome or single nucleotides, short oligonucleotide 
linkers, or from a series of oligonucleotides, to provide a synthetic nucleic acid molecule. 

As used herein, an "isolated nucleic acid molecule" or an "isolated fragment of the 
WSBV genome" refers to a nucleic acid molecule possessing a specific nucleotide sequence 
which has been subjected to purification means to reduce, from the composition, the number 
of compounds which are normally associated with the composition. A variety of purification 
means can be used to generate the isolated fragments of the present invention. These include, 
but are not limited to, methods that separate constituents of a solution based on charge, 
solubility, or size. 

In one embodiment, WSBV DNA can be mechanically sheared to produce fragments 
of about 2kb, lOkb, or 15-20 kb in length. These fragments can then be used to generate a 
WSBV library by inserting them into vectors, such as plasmid or lambda vectors, using 
methods well known in the art. Primers flanking each fragment, for example an ORF, can 
then be generated using nucleotide sequence information provided in the present invention. 
PCR cloning can then be used to isolate the ORF from the WSBV DNA library. PCR cloning 
is well known in the art. Thus, given the availability of the present identified gene coding 
sequences of the WSBV genome, it is routine experimentation to isolate any ORF, or other 
fragment of the assembly of the present invention, particularly using the information 
provided in Figure 3. Such fragments can be applied to an array, microfluidic device, or other 
detection kit format and used to detect expression of a viral gene (see below). 

As used herein, an "open reading frame" (ORF) means a series of triplets coding for 
amino acids without any termination codons and is a sequence translatable into protein. A 
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skilled artisan can readily identify ORFs in the WSBV genome using the gene coding 
sequences provided herein and/or the computer-based systems of the present invention. 

As used herein, an "expression modulating fragment" (EMF) means a series of 
nucleotide molecules which modulates the expression of an operably linked ORF or another 
EMF. 

As used herein, a viral sequence is said to "modulate the expression of an operably 
linked sequence" when the expression of the sequence is altered by the presence of the EMF. 
EMFs include, but are not limited to, promoters, and promoter modulating sequences 
(inducible elements). One class of viral EMFs are fragments which induce the expression of 
an operably linked viral ORF in response to a specific host regulatory factor or physiological 
event, such as a host anti-viral response, 

EMF sequences can be identified within the WSBV genome by their proximity to the 
ORFs identified using the computer-based systems of the present invention. EMFs may be 
found immediately 5' to the ORF. Alternatively, EMFs can be identified using known EMFs 
as a target sequence or target motif in the computer-based systems of the present invention. 

The presence and activity of an EMF can be confirmed using an EMF trap vector. An 
EMF trap vector contains a cloning site 5' to a marker sequence. A marker sequence encodes 
an identifiable phenotype, such as antibiotic resistance or a complementing nutrition 
auxotrophic factor, which can be identified or assayed when the EMF trap vector is placed 
within an appropriate host under appropriate conditions. An EMF will modulate the 
expression of an operably linked marker sequence. A sequence that is suspected of being an 
EMF is cloned in all three reading frames in one or more restriction sites upstream from the 
marker sequence in the EMF trap vector. The vector is then transformed into an appropriate 
host using known procedures and the phenotype of the transformed host is examined under 
appropriate conditions. 

The sequences falling within the scope of the present invention are not limited to the 
specific sequences herein described, but also include variations thereof. Variations can be 
routinely determined by comparing the sequence provided in the present invention, or a 
representative fragment thereof, with a sequence from another WSBV isolate. Furthermore, 
to accommodate the degeneracy of the genetic code, the invention includes nucleic acid 
molecules coding for the same amino acid sequences as do the specific ORFs disclosed 
herein. In other words, in the coding region of an ORF, substitution of one codon for another 
that encodes the same amino acid is expressly contemplated. 
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Any specific sequence disclosed herein can be readily screened for errors by 
resequencing a particular fragment, such as an ORF, in both directions (i.e., sequence both 
strands). Alternatively, error screening, or variant detection, can be performed by sequencing 
corresponding polynucleotides of WSB V origin isolated by using part or all of the fragments 
in question as a probe or primer. 

Nucleic Acid Fragment Uses 

The nucleic acid molecules of the present invention are useful for probes, primers, 
chemical intermediates, and in biological assays. The nucleic acid molecules are useful as 
hybridization probes for viral messenger RNA, viral transcript/cDNA, and viral genomic DNA 
to isolate full-length viral cDNA and viral genomic clones encoding the peptides described in 
Figure 3, and for use in viral screens and antiviral agents. 

Oligonucleotide probes have long been used to detect complementary nucleic acid 
sequences in a nucleic acid of interest (the "target" nucleic acid) in the form of detection 
kits/reagents. In some assay formats, the oligonucleotide probe is tethered, i.e., by covalent 
attachment, to a solid support, and arrays of oligonucleotide probes immobilized on solid 
supports have been used to detect specific nucleic acid sequences in a target nucleic acid. 
See, e.g., PCT patent publication Nos. WO 89/10977 and 89/1 1 548. In other formats, the 
detection reagents are supplied in solution. 

The probe can correspond to any sequence along the entire length of the nucleic acid 
molecules provided in the figures. However, as discussed, fragments are not to be construed as 
encompassing fragments disclosed prior to the present invention. 

Each of the ORFs of the WSBV genome that can be routinely identified using the 
computer system of the present invention can be used in numerous ways as polynucleotide 
reagents. The sequences can be used as diagnostic probes or diagnostic amplification primers 
to detect the expression of a particular gene or groups of genes. This is particularly useful in 
the form of nucleic acid arrays employing 1 or more, 10 or more, 100 or more, or most to all 
of the WSBV ORFs in a single array. 

The nucleic acid molecules are also useful as primers for PGR to amplify any given 
region of a nucleic acid molecule and are useful for synthesizing antisense molecules of desired 
length and sequence. 

The nucleic acid molecules are useful as primers for the 5' nuclease PCR assay 
(hereafter referred to as the TaqMan assay). The TaqMan assay provides a sensitive and rapid 
means of detecting viral nucleic acid and therefore is well suited for use in viral screening 
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applications such as detection kits. The TaqMan assay detects the accumulation of a specific 
amplified product during PCR. The TaqMan assay utilizes an oligonucleotide probe labeled 
with a fluorescent reporter dye at the 5 ? end of the probe and a quencher dye at the 3* end of 
the probe. During the PCR reaction, the 5 ? nuclease activity of DNA polymerase cleaves the 
probe, thereby separating the reporter dye and the quencher dye and resulting in increased 
fluorescence of the reporter. Accumulation of PCR product is detected directly by monitoring 
the increase in fluorescence of the reporter dye. The 5' nuclease activity of DNA polymerase 
cleaves the probe between the reporter and the quencher only if the probe hybridizes to the 
target and is amplified during PCR. Therefore, only the target sequence of interest is 
detected. 

Preferred TaqMan primer and probe sequences are disclosed in Figure 3. It will be 
apparent to one of skill in the art that the disclosed primers and probes of the present 
invention are useful as diagnostic probes or amplification primers for screening for the 
presence of WSBV in a biological sample or for isolating or screening particular WSBV 
genes. 

The nucleic acid molecules are also useful for expressing antigenic portions of the 
WSBV proteins that can then be used, for example, to develop antibodies to the viral antigens. 

The nucleic acid molecules are also useful as hybridization probes for determining the 
presence, level, form and distribution of WSBV nucleic acid expression. Accordingly, the 
probes can be used to detect the presence of, or to determine levels of, a specific viral nucleic 
acid molecule, either DNA or RNA, in cells and tissues of shrimp or other organisms under 
moderate or stringent conditions. One example of stringent hybridization conditions are 
hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45 °C, followed by one or 
more washes in 0.2 X SSC, 0.1% SDS at 50-65 °C. Examples of moderate to low stringency 
hybridization conditions are well known in the art. Furthermore, probes corresponding to the 
viral peptides described herein can be used to assess expression and/or gene copy number in a 
given infected cell, tissue, or organism. For example, Northern blots can be used for RNA 
detection, Southern blots can be used for DNA detection, and Western blots can be used for 
peptide/protein detection. These uses are relevant for detecting the presence of virus in shrimp 
as well as for monitoring the distribution of virus throughout various cells and tissues of shrimp 
during the course of viral infection. 

In addition, each of the expression modulating fragments (EMFs) can be used in DNA- 
protein binding assays to screen for modulating peptides which may be present in cells and 
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tissues of shrimp or other organisms. This use is relevant for obtaining the specific host 
regulatory factors that interact with promoters in the WSBV genome. 

Nucleic acid expression assays are also useful for drug screening to identify compounds 
that modulate viral nucleic acid expression. The invention thus provides a method for 
identifying a compound that can be used to treat viral infection. The method typically includes 
assaying the ability of the compound to modulate the expression of viral nucleic acid and thus 
identifying a compound that can be used to treat a viral infection. The assays can be performed 
in cell-based and cell-free systems. Cell-based assays include cells infected with virus particles 
or recombinant cells genetically engineered to express specific viral nucleic acid sequences. 
Cell-free assays can be used to detect the ability of a compound to directly bind to a nucleic acid 
fragment or protein. 

The assay for viral nucleic acid expression can involve direct assay of nucleic acid 
levels, such as mRNA levels, or on collateral compounds involved in the signal pathway. 
Furthermore, the expression of host cell genes that are up- or down-regulated in response to the 
viral protein can also be assayed. In this embodiment the regulatory regions of these genes can 
be operably linked to a reporter gene such as luciferase. 

Thus, modulators of viral gene expression can be identified by a method wherein a cell 
infected with virus is contacted with a candidate compound and the expression of viral mRNA 
determined. The level of expression of viral mRNA in the presence of the candidate compound 
is compared to the level of expression of viral mRNA in the absence of the candidate compound. 
The candidate compound can then be identified as a modulator of viral nucleic acid expression 
based on this comparison and be used, for example, to disrupt viral replication. When 
expression of viral mRNA is statistically significantly greater in the presence of the candidate 
compound than in its absence, the candidate compound is identified as a stimulator of viral 
nucleic acid expression. When viral nucleic acid expression is statistically significantly less in 
the presence of the candidate compound than in its absence, the candidate compound is 
identified as an inhibitor of viral nucleic acid expression. 

The invention further provides methods of treating viral infection, with the nucleic acid 
as a target, using a compound identified through drug screening as a gene modulator to modulate 
viral nucleic acid expression in cells and tissues infected with the virus. Modulation includes 
both up-regulation (i.e. activation or agonization) or down-regulation (suppression or 
antagonization) of nucleic acid expression. Generally, viral nucleic acid expression is down- 
regulated to prevent viral replication and treat viral infection. 
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Alternatively, a modulator for viral nucleic acid expression can be a small molecule or 
drug identified using the screening assays described herein as long as the drug or small molecule 
modulates viral nucleic acid expression in the cells and tissues infected with the virus. 

The nucleic acid molecules are also useful for monitoring the effectiveness of 
modulating compounds on the expression or activity of the viral gene in a treatment regimen. 
Thus, the gene expression pattern can serve as a barometer for the continuing effectiveness of 
treatment with the compound, particularly with compounds to which a virus can develop 
resistance. The gene expression pattern can also serve as a marker indicative of a 
physiological response of the virus to the compound. Accordingly, such monitoring would 
allow either increased administration of the compound or the administration of alternative 
compounds to which the virus has not become resistant. 

The nucleic acid molecules are also useful as antisense constructs to control viral gene 
expression in infected cells and tissues. A DNA antisense nucleic acid molecule is designed to 
be complementary to, and therefore bind to, a region of the viral gene necessary for 
transcription, thereby preventing transcription and hence production of viral protein. An 
antisense RNA or DNA nucleic acid molecule would hybridize to the viral mRNA and thus 
block translation of viral mRNA into protein by the host cell's translational machinery. 
Alternatively, a class of antisense molecules can be used to inactivate viral mRNA in order to 
decrease expression of viral nucleic acid and inhibit viral replication or function. These 
molecules can therefore be used to treat viral infection. This technique involves cleavage of 
viral mRNA by ribozymes that recognize one or more regions of viral mRNA that attenuate the 
ability of the mRNA to be translated by host cell translational machinery. Possible regions 
include coding or control regions; particularly coding or control regions encoding or regulating 
proteins that play critical roles in viral function or replication, such as entry into the nucleus of 
the host cell or virion assembly. 

The nucleic acid molecules of the present invention can be employed to create transgenic 
viral resistant shrimp. Several possible mechanisms could be employed to impart WSBV 
resistance to shrimp using the nucleic acid and protein coding sequences provided by the present 
invention. One possible mechanism of imparting WSBV resistance to shrimp involves 
transforming shrimp cells with viral nucleic acids that express an attenuated virion coat protein 
such that when the transgenic shrimp is infected with WSBV, the expressed coat protein 
envelopes the virus and thereby prevents translation of the viral DNA. In this example, the 
virion coat protein can either be constitutively expressed or regulated by a promoter that is 
activated upon WSBV infection. Shrimp cells can be transformed with viral DNA under 
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suitable conditions known in the art. The WSBV construct in a vector can be microinjected 
directly into host cells using micropipettes, [Crossway, Mol Gen. Genetics, 202:179-85 (1985)], 
or using polyethylene glycol [Krens et al, Nature, 296:72-74 (1982)]. Alternatively, shrimp 
cells may be transformed by incubating the shrimp cells or tissue with an inoculum of bacteria 
that have been transformed with a vector comprising a gene that imparts WSBV resistance. The 
transformed shrimp cells are then grown and regenerated into shrimp such that the proteins 
expressed by the transformed cells impart WSBV resistance to the shrimp. 

Nucleic Acid Arrays and Detection Reagents 

The present invention further provides detection reagents and kits, such as arrays or 
microarrays, of nucleic acid molecules that are based on the novel WSBV sequence 
information provided in the present invention and particularly the transcript information 
(SEQ ID NOS: 2, 4, 6. . . 280, 282, 284, 286-293) provided in Figure 3 and polymorphic sites 
on these transcripts summarized in Table 1. 

As used herein "Arrays" or "Microarrays" refers to an array of distinct 
polynucleotides or oligonucleotides synthesized on a substrate, such as paper, nylon or other 
type of membrane, filter, chip, glass slide, or any other suitable solid, or semi-solid support. 
The development of arraying technologies such as photolithographic synthesis of a nucleic 
acid array and high density spotting of cDNA products has provided methods for making 
very large arrays of oligonucleotide probes in very small areas. See U.S. Pat. No. 5,143,854 
and PCT patent publication Nos. WO 90/15070 and 92/10092. Microfabricated arrays of 
large numbers of oligonucleotide probes, called "DNA chips", offer great promise for a wide 
variety of applications. In one embodiment, the microarray is prepared and used according to 
the methods described in US Patent 5,837,832, Chee et al., PCT application W095/1 1995 
(Chee et al.\ Lockhart, D. J. et al. (1996; Nat. Biotech. 14: 1675-1680) and Schena, M. et al. 
(1996; Proc. Natl Acad. ScL 93: 10614-10619), all of which are incorporated herein in their 
entirety by reference. In other embodiments, such arrays are produced by the methods 
described by Brown et. al., US Patent No. 5,807,522. 

The microarray or detection kit is preferably composed of a large number of unique, 
single-stranded nucleic acid sequences, usually either synthetic antisense oligonucleotides or 
fragments of cDNAs, fixed to a solid support. The oligonucleotides are preferably about 6-60 
nucleotides in length, more preferably 15-30 nucleotides in length, and most preferably about 
20-25 nucleotides in length. For a certain type of microarray or detection kit, it may be 
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preferable to use oligonucleotides that are only 7-20 nucleotides in length. For others, such as 
cDNA, longer lengths are possible and preferable. These can be of the order of lkb or more. 

The microarray or detection kit may contain oligonucleotides that cover the known 5' 
or 3' sequence, sequential oligonucleotides that cover the full-length sequence; or unique 
oligonucleotides selected from particular areas along the length of the sequence. 
Polynucleotides used in the microarray or detection kit may be oligonucleotides that are 
specific to a viral gene or genes of interest. 

In order to produce oligonucleotides to a known sequence for a microarray or 
detection kit, the viral gene(s) of interest (or an ORF identified from the contigs of the 
present invention) is typically examined using a computer algorithm which starts at the 5* or 
at the 3' end of the nucleotide sequence. Typical algorithms will then identify oligomers of 
defined length that are unique to the gene, have a GC content within a range suitable for 
hybridization, and lack predicted secondary structure that may interfere with hybridization. 
In certain situations it may be appropriate to use pairs of oligonucleotides on a microarray or 
detection kit. The "pairs" will be identical, except for one nucleotide that preferably is 
located in the center of the sequence. The second oligonucleotide in the pair (mismatched by 
one) serves as a control. The number of oligonucleotide pairs may range from one to two 
millions. The oligomers are synthesized at designated areas on a substrate using a light- 
directed chemical process. The substrate may be paper, nylon or other type of membrane, 
filter, chip, glass slide or any other suitable solid support. 

In another aspect, an oligonucleotide may be synthesized on the surface of the 
substrate by using a chemical coupling procedure and an ink jet application apparatus, as 
described in PCT application W095/251 1 16 (Baldeschweiler et al.) which is incorporated 
herein in its entirety by reference. In another aspect, a "gridded" array analogous to a dot (or 
slot) blot may be used to arrange and link cDNA fragments or oligonucleotides to the surface 
of a substrate using a vacuum system, thermal, UV, mechanical or chemical bonding 
procedure. An array, such as those described above, may be produced by hand or by using 
available devices (slot blot or dot blot apparatus), materials (any suitable solid support), and 
machines (including robotic instruments), and may contain 8, 24, 96, 384, 1536, 6144 or 
more oligonucleotides, or any other number which lends itself to the efficient use of 
commercially available instrumentation. 

In other embodiments, the array or detection reagent/kit can be produced by spotting a 
nucleic acid molecule onto the surface of a substrate (See Brown et. al., US Patent No. 
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5,807,522). In such embodiments, PCR primers to one or more nucleic acid fragments are 
used to generate nucleic acid molecules suitable for deposition onto a substrate. 

In order to conduct sample analysis using a microarray or detection kit, viral nucleic 
acid is isolated from a biological sample infected with WSBV and the viral nucleic acid is 
made into hybridization probes. Viral nucleic acid may be isolated from biological samples 
obtained from fluids, cultured cells, biopsies, or other tissue preparations from a shrimp or 
other organism of interest that is infected with WSBV. Viral mRNA is isolated, and cDNA is 
produced and used as a template to make antisense RNA (aRNA). The aRNA is amplified in 
the presence of fluorescent nucleotides, and labeled probes are incubated with the microarray 
or detection kit so that the probe sequences hybridize to complementary oligonucleotides of 
the microarray or detection kit. Incubation conditions are adjusted so that hybridization 
occurs with precise complementary matches or with various degrees of less complementarity. 
After removal of nonhybridized probes, a scanner is used to determine the levels and patterns 
of fluorescence. The scanned images are examined to determine degree of complementarity 
and the relative abundance of each oligonucleotide sequence on the microarray or detection 
kit. A detection system may be used to measure the absence, presence, and amount of 
hybridization for all of the distinct viral sequences simultaneously. This data may be used for 
large scale correlation studies on the sequences, expression patterns, mutations, variants, or 
polymorphisms among viral isolates. 

Using such arrays, the present invention provides methods to identify the expression 
of one or more of the ORFs of the present invention. In detail, such methods comprise 
incubating a test sample with one or more nucleic acid molecules and assaying for binding of 
the nucleic acid molecule with components within the test sample. Such assays will typically 
involve arrays comprising most, if not all of the genes in the WSBV genome, or rationally 
selected subsets thereof. The genomic sequence (SEQ ID NO: 1) and transcript sequences 
(SEQ IDNOS: 2, 4, 6. . .280, 282, 284, 286-293) of the WSBV genome of the present 
invention are provided in Figure 2 and Figure 3 and polymorphic sites on these transcripts 
summarized in Table 1 . 

Conditions for incubating a nucleic acid molecule with a test sample vary. Incubation 
conditions depend on the format employed in the assay, the detection methods employed, and 
the type and nature of the nucleic acid molecule used in the assay. One skilled in the art will 
recognize that any one of the commonly available hybridization, amplification or array assay 
formats can readily be adapted to employ the novel fragments of the WSBV genome 
disclosed herein. Examples of such assays can be found in Chard, T, An Introduction to 
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Radioimmunoassay and Related Techniques, Elsevier Science Publishers, Amsterdam, The 
Netherlands (1986); Bullock, G. R. et al., Techniques in Immunocytochemistry, Academic 
Press, Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice and 
Theory of Enzyme Immunoassays: Laboratory Techniques in Biochemistry and Molecular 
Biology, Elsevier Science Publishers, Amsterdam, The Netherlands (1985). 

The test samples of the present invention include, but are not limited to, nucleic acid 
extracts, cells, and protein or membrane extracts of cells infected with WSBV. The test 
sample used in the above-described method will vary based on the assay format, the nature of 
the detection method, and the tissues, cells, or extracts used as the sample to be assayed. 
Methods for preparing nucleic acid extracts or for preparing cells are well known in the art 
and can readily be adapted in order to obtain a sample that is compatible with the system 
utilized. 

In another embodiment of the present invention, kits are provided which contain the 
necessary reagents to carry out the assays of the present invention. 

Specifically, the invention provides a compartmentalized kit to receive, in close 
confinement, one or more containers which comprises: (a) a first container comprising one of 
the nucleic acid molecules that can bind to a fragment of the WSBV genome disclosed 
herein; and (b) one or more other containers comprising one or more of the following: wash 
reagents, reagents capable of detecting the presence of a bound nucleic acid. Preferred kits 
will include detection reagents/arrays/chips/microfluidic devices that are capable of detecting 
the expression of 1 or more, 10 or more, 100 or more, or most or all of the genes expressed in 
WSBV, particularly the genes provided in Figure 3. 

In detail, a compartmentalized kit includes any kit in which reagents are contained in 
separate containers. Such containers include small glass containers, plastic containers, strips 
of plastic, glass or paper, or arraying material such as silica. Such containers allow one to 
efficiently transfer reagents from one compartment to another compartment such that the 
samples and reagents are not cross-contaminated, and the agents or solutions of each 
container can be added in a quantitative fashion from one compartment to another. Such 
containers may include a container which will accept the test sample, a container which 
contains the nucleic acid probe, containers which contain wash reagents (such as phosphate 
buffered saline, Tris-buffers, etc.), and containers which contain the reagents used to detect 
the bound probe. One skilled in the art will readily recognize that the previously unidentified 
ORFs that can be routinely identified using the sequence information disclosed herein can be 
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readily incorporated into one of the established kit formats which are well known in the art, 
particularly expression arrays. 

Protein/Peptide Molecules 

The present invention provides nucleic acid sequences that encode WSBV protein 
molecules (Figure 3). The peptide sequences provided in Figure 3, as well as the obvious 
variants described herein, and using the information in Figure 3, will be referred to herein as 
the WSBV peptides of the present invention or peptides/polypeptides/proteins of the present 
invention. 

Enzymes and other viral proteins are produced during viral activity and replication 
and can be used as targets for screening and quantitating a particular virus, or as antiviral 
targets. Like viruses in general, WSBV utilizes the resources of the host cell for production of 
viral proteins. Viral proteins can be detected using an antibody, or binding portion thereof, to 
the protein or a probe that recognizes proteins or peptides of the present invention. Viral 
antigens present either on the surface or within the infected cell can be detected by various 
antibody tests, including immunoflourescence or enzyme immunoassay (E1A). Virus or 
antigen released from infected cells can be detected by such antibody tests as enzyme linked 
immunosorbent assay (ELISA), radioimmunoassay (RIA), or latex agglutination (LA). 
Protein-based tests such as these for WSBV antigens are useful for detecting outbreaks of 
WSBV in shrimp populations. 

The present invention provides isolated peptide and protein molecules that comprise, 
consist essentially of, or consist of the amino acid sequences of the WSBV peptides disclosed 
in Figure 3, (which are encoded by the transcript sequences that are also shown in Figure 3), 
as well as all obvious variants of these peptides that are within the art to make and use. Some 
of these variants are described in detail below. 

As used herein, a peptide is said to be "isolated" or "purified" when it is substantially 
free of cellular material or free of chemical precursors or other chemicals. The peptides of the 
present invention can be purified to homogeneity or other degrees of purity. The level of 
purification will be based on the intended use. The critical feature is that the preparation allows 
for the desired function of the peptide, even if in the presence of considerable amounts of other 
components. 

In some uses, "substantially free of cellular material" includes preparations of the peptide 
having less than about 30% (by dry weight) other proteins (i.e., contaminating protein), less than 
about 20% other proteins, less than about 10% other proteins, or less than about 5% other 
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proteins. When the peptide is recombinantly produced, it can also be substantially free of culture 
medium, i.e., culture medium represents less than about 20% of the volume of the protein 
preparation. 

The language "substantially free of chemical precursors or other chemicals" includes 
preparations of the peptide in which it is separated from chemical precursors or other chemicals 
that are involved in its synthesis. In one embodiment, the language "substantially free of 
chemical precursors or other chemicals" includes preparations of a WSBV peptide having less 
than about 30% (by dry weight) chemical precursors or other chemicals, less than about 20% 
chemical precursors or other chemicals, less than about 10% chemical precursors or other 
chemicals, or less than about 5% chemical precursors or other chemicals. 

An isolated WSBV peptide can be purified from cells infected with WSBV, purified 
from cells that have been altered to express it (recombinant), or synthesized using known protein 
synthesis methods. For example, a nucleic acid molecule encoding the peptide can be cloned 
into an expression vector, the expression vector introduced into a host cell, and the protein 
expressed in the host cell. The protein can then be isolated from the host cells by an appropriate 
purification scheme using standard protein purification techniques. Many of these techniques 
are described in detail below. 

Accordingly, the present invention provides proteins that consist of the amino acid 
sequences provided in Figure 3 (SEQ ID NOS:3, 5, 7... 281, 283, 285), for example, proteins 
encoded by the transcript/cDNA nucleic acid sequences also shown in Figure 3 (SEQ IDNOS:2, 
4, 6. ..280, 282, 284, 286-293). A protein consists of an amino acid sequence when the amino 
acid sequence is the final amino acid sequence of the protein. 

The present invention further provides proteins that consist essentially of the amino acid 
sequences provided in Figure 3 (SEQ ID NOS:3, 5, 7. ..281, 283, 285), for example, proteins 
encoded by the transcript/cDNA nucleic acid sequences also shown in Figure 3 (SEQ ID NOS:2, 
4, 6... 280, 282, 284, 286-293). A protein consists essentially of an amino acid sequence when 
such an amino acid sequence is present with only a few additional amino acid residues, for 
example from about 1 to about 1 00 or so additional residues, typically from 1 to about 20 
additional residues in the final protein. 

The present invention further provides proteins that comprise the amino acid sequences 
provided in Figure 3 (SEQ ID NOS:3, 5, 7... 281, 283, 285), for example, proteins encoded by 
the transcript/cDNA nucleic acid sequences also shown in Figure 3 (SEQ ID NOS:2, 4, 6. . .280, 
282, 284, 286-293). A protein comprises an amino acid sequence when the amino acid 
sequence is at least part of the final amino acid sequence of the protein. In such a fashion* the 
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protein can be only the peptide or have additional amino acid molecules, such as amino acid 
residues (contiguous encoded sequence) that are naturally associated with it or heterologous 
amino acid residues/peptide sequences* Such a protein can have a few additional amino acid 
residues or can comprise several hundred or more additional amino acids. The preferred classes 
of proteins that are comprised of the peptides of the present invention are the naturally occurring 
mature proteins. A brief description of how various types of these proteins can be made/isolated 
is provided below. 

The peptides of the present invention can be attached to heterologous sequences to form 
chimeric or fusion proteins. Such chimeric and fusion proteins comprise a WSBV peptide 
operatively linked to a heterologous protein having an amino acid sequence not substantially 
homologous to the WSBV peptide. "Operatively linked" indicates that the WSBV peptide and 
the heterologous protein are fused in-frame. The heterologous protein can be fused to the N- 
terminus or C-terminus of the WSBV peptide. 

In some uses, the fusion protein does not affect the activity of the WSBV peptide per se. 
For example, the fusion protein can include, but is not limited to, enzymatic fusion proteins, for 
example beta-galactosidase fusions, yeast two-hybrid GAL fusions, poly-His fusions, MYC- 
tagged, Hi-tagged and Ig fusions. Such fusion proteins, particularly poly-His fusions, can 
facilitate the purification of recombinant WSBV peptide. In certain host cells, expression and/or 
secretion of a protein can be increased by using a heterologous signal sequence. 

A chimeric or fusion protein can be produced by standard recombinant DNA techniques. 
For example, DNA fragments coding for the different protein sequences are ligated together in- 
frame in accordance with conventional techniques. In another embodiment, the fusion gene can 
be synthesized by conventional techniques including automated DNA synthesizers. 
Alternatively, PCR amplification of gene fragments can be carried out using anchor primers 
which give rise to complementary overhangs between two consecutive gene fragments which 
can subsequently be annealed and re-amplified to generate a chimeric gene sequence (see 
Ausubel et al., Current Protocols in Molecular Biology, 1992). Moreover, many expression 
vectors are commercially available that already encode a fusion moiety (e.g., a GST protein). A 
WSBV peptide-encoding nucleic acid can be cloned into such an expression vector such that the 
fission moiety is linked in-frame to the WSBV peptide. 

As mentioned above, the present invention also provides and enables obvious variants of 
the amino acid sequence of the proteins of the present invention, such as naturally occurring 
mature forms of the peptide, sequence variants of the peptides, non-naturally occurring 
recombinantly derived variants of the peptides, and orthologs and paralogs of the peptides. Such 
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variants can readily be generated using art-known techniques in the fields of recombinant 
nucleic acid technology and protein biochemistry. It is understood, however, that variants 
exclude any amino acid sequences disclosed prior to the invention. 

Such variants can readily be identified/made using molecular techniques and the 
sequence information disclosed herein. Further, such variants can readily be distinguished from 
other peptides based on sequence and/or structural homology to the WSBV peptides of the 
present invention. The degree of homology/identity present will be based primarily on whether 
the peptide is a functional variant or non-functional variant, the amount of divergence present in 
the paralog protein family and the evolutionary distance between orthologous viruses. 

To determine the percent identity of two amino acid sequences or two nucleic acid 
sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in one or both of a first and a second amino acid or nucleic acid sequence for 
optimal alignment and non-homologous sequences can be disregarded for comparison 
purposes). In preferred embodiments, at least 30%, 40%, 50%, 60%, 70%, 80%, or 90% or 
more of the length of a reference sequence is aligned for comparison purposes. The amino 
acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are 
then compared. When a position in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in the second sequence, then the 
molecules are identical at that position (as used herein amino acid or nucleic acid "identity" is 
equivalent to amino acid or nucleic acid "homology"). The percent identity between the two 
sequences is a function of the number of identical positions shared by the sequences, taking 
into account the number of gaps, and the length of each gap, which are introduced for optimal 
alignment of the two sequences. 

The comparison of sequences and determination of percent identity and similarity 
between two sequences can be accomplished using a mathematical algorithm. {Computational 
Molecular Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988; Biocomputing: 
Informatics and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; 
Computer Analysis of Sequence Data, Part 1, Griffin, A.M., and Griffin, H.G., eds., Humana 
Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic 
Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton 
Press, New York, 1991). In a preferred embodiment, the percent identity between two amino 
acid sequences is determined using the Needleman and Wunsch (J. MoL Biol (48):444-453 
( 1 970)) algorithm which has been incorporated into the GAP program in the GCG software 
package, using either a Blossom 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 
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12, 10, 8, 6. or 4 and a length weight of 1. 2, 3, 4, 5, or 6. In yet another preferred 
embodiment, the percent identity between two nucleotide sequences is determined using the 
GAP program in the GCG software package (Devereux, J. ? et al. Nucleic Acids Res. 12(1):387 
(1984)), using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and a 
length weight of 1, 2, 3, 4, 5, or 6. In another embodiment, the percent identity between two 
amino acid or nucleotide sequences is determined using the algorithm of E. Myers and W. 
Miller (CABIOS, 4:1 1-17 (1989)) which has been incorporated into the ALIGN program 
(version 2.0), using a PAM120 weight residue table, a gap length penalty of 12 and a gap 
penalty of 4. 

The nucleic acid and protein sequences of the present invention can further be used as 
a "query sequence" to perform a search against sequence databases to, for example, identify 
other viruses related to WSBV or functionally related protein sequences. Such searches can 
be performed using the NBLAST and XBLAST programs (version 2.0) of Altschul, et ah (J. 
Mol Biol 215:403-10 (1990)). BLAST nucleotide searches can be performed with the 
NBLAST program, score = 100, wordlength = 12 to obtain nucleotide sequences homologous 
to the nucleic acid molecules of the invention. BLAST protein searches can be performed 
with the XBLAST program, score = 50, wordlength = 3 to obtain amino acid sequences 
homologous to the proteins of the invention. To obtain gapped alignments for comparison 
purposes, Gapped BLAST can be utilized as described in Altschul et al. {Nucleic Acids Res. 
25(1 7):3389-3402 (1997)). When utilizing BLAST and gapped BLAST programs, the 
default parameters of the respective programs (e.g., XBLAST and NBLAST) can be used. 
The results of one such analysis are provided in Figure 3. 

Full-length pre-processed forms, as well as mature processed forms, of proteins that 
comprise one of the peptides of the present invention can readily be identified as having 
complete sequence identity to one of the WSBV peptides of the present invention as well as 
being encoded by the same viral gene as the WSBV peptide provided herein. 

Variants of a WSBV peptide can readily be identified as being a WSBV protein having a 
high degree of sequence homology/identity (also referred to as "significant sequence 
homology") to at least a portion of the WSBV peptide as well as being encoded by the same 
viral gene as the WSBV peptide provided herein. Viral genes can readily be determined based 
on the WSBV sequence information provided in Figure 3. As used herein, two proteins (or a 
region of the proteins) have significant homology when the amino acid sequences are 
typically at least about 70-80%, 80-90%, and more typically at least about 90-95% or more 
homologous. A significantly homologous amino acid sequence, according to the present 
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invention, will be encoded by a nucleic acid sequence that will hybridize to a WSBV peptide 
encoding nucleic acid molecule under stringent conditions as more fully described below. 

Paralogs of a WSBV peptide can readily be identified as having some degree of 
significant sequence homology/identity to at least a portion of the WSBV peptide, as being 
encoded by a gene from WSBV, and as having similar activity or function. Two proteins will 
typically be considered paralogs when the amino acid sequences typically share at least about 
60% or greater, and more typically at least about 70% or greater homology through a given 
region or domain. Such paralogs will be encoded by a nucleic acid sequence that will 
hybridize to a WSBV peptide encoding nucleic acid molecule under moderate to stringent 
conditions as more fully described below. 

Orthologs of a WSBV peptide can readily be identified as having some degree of 
significant sequence homology/identity to at least a portion of the WSBV peptide as well as 
being encoded by a gene from another virus. Preferred orthologs will be isolated from viruses of 
commercial or medical importance for the development of broad-spectrum diagnostic and anti- 
viral agents. Such orthologs will be encoded by a nucleic acid sequence that will hybridize to 
a WSBV peptide encoding nucleic acid molecule under moderate to stringent conditions, as 
more fully described below, depending on the degree of relatedness of the two viruses 
yielding the proteins. 

Non-naturally occurring variants of the WSBV peptides of the present invention can 
readily be generated using recombinant techniques. Such variants include, but are not limited to, 
deletions, insertions, and substitutions in the amino acid sequence of the WSBV peptide. For 
example, one class of substitutions is conserved amino acid substitutions. Such substitutions are 
those that substitute a given amino acid in a WSBV peptide by another amino acid of like 
characteristics. Typically seen as conservative substitutions are the replacements, one for 
another, among the aliphatic amino acids Ala, Val, Leu, and He; interchange of the hydroxyl 
residues Ser and Thr; exchange of the acidic residues Asp and GIu; substitution between the 
amide residues Asn and Gin; exchange of the basic residues Lys and Arg; and replacements 
among the aromatic residues Phe and Tyr. Guidance concerning which amino acid changes are 
likely to be phenotypically silent are found in Bowie et al, Science 247: 1306-1 310 (1990). 

Variant WSBV peptides can be fully functional or can lack function in one or more 
activities, e.g. ability to bind to host cell receptors or ability to form structural components such 
as the viral nucleocapsid or outer membrane, etc. Fully functional variants typically contain only 
conservative variation or variation in non-critical residues or in non-critical regions. Figure 3 
provides the results of protein analysis and can be used to identify critical domains/regions. 
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Functional variants can also contain substitutions of similar amino acids that result in no change 
or an insignificant change in function. Alternatively, such substitutions may positively or 
negatively affect function to some degree. 

Non-functional variants typically contain one or more non-conservative amino acid 
substitutions, deletions, insertions, inversions, or truncation or a substitution, insertion, 
inversion, or deletion in a critical residue or critical region. 

Amino acids that are essential for function can be identified by methods known in the 
art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham et aL, 
Science 244:1081-1085 (1989)), particularly using the results provided in Figure 3. The latter 
procedure introduces single alanine mutations at every residue in the molecule. The resulting 
mutant molecules are then tested for biological activity such as DNA binding. Sites that are 
critical for virus/host cell receptor binding can also be determined by structural analysis such as 
crystallization, nuclear magnetic resonance or photoaffinity labeling (Smith et ah, J. Mol Biol. 
224:899-904 (1992); de Vos et al Science 255:306-312 (1992)). 

The present invention further provides fragments of the WSBV peptides, in addition to 
proteins and peptides that comprise and consist of such fragments, particularly those comprising 
the residues identified in Figure 3. The fragments to which the invention pertains, however, are 
not to be construed as encompassing fragments that may be disclosed publicly prior to the 
present invention. 

As used herein, a fragment comprises at least 8, 10, 12, 14, 16, or more contiguous 
amino acid residues from a WSBV peptide. Such fragments can be chosen based on the ability 
to retain one or more of the biological activities of the WSBV peptide or could be chosen for the 
ability to perform a function, e.g. bind a substrate or act as an immunogen. Particularly 
important fragments are biologically active fragments, peptides that are, for example, about 8 or 
more amino acids in length. Such fragments will typically comprise a domain or motif of the 
WSBV peptide, e.g., active site or a substrate-binding domain. Further, possible fragments 
include, but are not limited to, domain or motif containing fragments, soluble peptide fragments, 
and fragments containing immunogenic structures. Predicted domains and functional sites are 
readily identifiable by computer programs well known and readily available to those of skill in 
the art (e.g., PROS1TE analysis). 

Polypeptides often contain amino acids other than the 20 amino acids commonly 
referred to as the 20 naturally occurring amino acids. Further, many amino acids, including the 
terminal amino acids, may be modified by natural processes during the course of viral infection, 
such as processing and other post-translational modifications by the host cell, or by chemical 
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modification techniques well known in the art. Common modifications that occur naturally are 
described in basic texts, detailed monographs, and the research literature, and they are well 
known to those of skill in the art. 

Examples of known modifications include, but are not limited to, acetylation, acylation, 
ADP-ribosylation, amidation, covalent attachment of flavin, covalent attachment of a heme 
moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent attachment of a 
lipid or lipid derivative, covalent attachment of phosphotidylinositol, cross-linking, cyclization, 
disulfide bond formation, demethylation, formation of covalent crosslinks, formation of cystine, 
formation of pyroglutamate, formylation, gamma carboxylation, glycosylation, GPI anchor 
formation, hydroxylation, iodination, methylation, myristoylation, oxidation, proteolytic 
processing, phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer-RNA 
mediated addition of amino acids to proteins such as arginylation, and ubiquitination. 

Such modifications are well known to those of skill in the art and have been described in 
great detail in the scientific literature. Several particularly common modifications, 
glycosylation, lipid attachment, sulfation, gamma-carboxylation of glutamic acid residues, 
hydroxylation and ADP-ribosylation, for instance, are described in most basic texts, such as 
Proteins - Structure and Molecular Properties, 2nd Ed., T.E. Creighton, W. H. Freeman and 
Company, New York (1993). Many detailed reviews are available on this subject, such as by 
Wold, F., Posttranslational Covalent Modification of Proteins, B.C. Johnson, Ed., Academic 
Press, New York 1-12 (1983); Seifter et al (Meth. Enzymol 1 82: 626-646 (1990)) and Rattan et 
al. {Ann. N.Y. Acad. Sci. 663:48-62 (1992)). 

Peptides or protein encoding sequences of the present invention can be modified or 
mutated, either naturally, such as by host cell mechanisms, or by techniques known to those of 
skill in the art, to disrupt protein formation or protein function, and thereby disrupt viral 
replication and function. These methods can be used to prevent and/or treat viral infection. 

Accordingly, the WSBV peptides of the present invention also encompass derivatives or 
analogs in which a substituted amino acid residue is not one encoded by the genetic code, in 
which a substituent group is included, in which the mature WSBV peptide is fused with another 
compound, such as a compound to increase or decrease the half-life of the WSBV peptide (for 
example, polyethylene glycol), in which the additional amino acids are fused to the mature 
WSBV peptide, such as a leader or secretory sequence or a sequence for purification of the 
mature WSBV peptide or a pro-protein sequence, or in which the WSBV peptide has been 
modified or mutated, either naturally or recombinantly, to disrupt protein function, and thereby 
disrupt WSBV function and/or replication. 
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Protein/Peptide Uses 

The proteins of the present invention can be used in substantial and specific assays 
related to the functional information provided in the figures; to raise antibodies or to elicit 
another immune response; as a reagent (including the labeled reagent) in assays designed to 
quantitatively determine levels of the protein (or its binding partner or ligand) in biological 
samples; and as markers for infected samples in which the corresponding protein is 
preferentially expressed (either constitutively or at a particular stage of viral infection). 
Where the protein binds or potentially binds to another protein or ligand (such as, for 
example, a host cell receptor protein), the protein can be used to identify the binding 
partner/1 igand so as to develop a system to identify inhibitors of the binding interaction. Any 
or all of these uses are capable of being developed into reagent grade or kit format for 
commercialization as commercial products. 

Methods for performing the uses listed above are well known to those skilled in the 
art. References disclosing such methods include "Molecular Cloning: A Laboratory Manual", 
2d ed., Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. Fritsch and T. Maniatis 
eds., 1989, and "Methods in Enzymology: Guide to Molecular Cloning Techniques", 
Academic Press, Berger, S. L. and A. R. Kimmel eds., 1987. 

The potential uses of the viral peptides of the present invention are based primarily on 
the function of the protein. For example, isolated WSBV peptides serve as targets for 
identifying antiviral agents, particularly for identifying antiviral agents that interfere with 
viral replication in a host cell infected with a virus that expresses the peptide. Specific and 
substantial uses for the molecules of the present invention are provided herein. Further uses 
can readily be determined using the information provided herein, that which is known in the 
art, and routine experimentation. 

The proteins of the present invention (including variants and fragments that may have 
been disclosed prior to the present invention) are useful for biological assays for viruses that are 
related to WSBV. Such assays involve any of the known protein functions or activities or 
properties useful for diagnosis of WSBV infection. 

The proteins of the present invention are also useful in virus screening assays, in cell- 
based or cell-free systems. Cell-based systems can be native, i.e., host cells infected with the 
virus, as a biopsy or expanded in cell culture. In an alternate embodiment, cell-based assays 
involve recombinant host cells expressing the viral protein. Cell-based or cell-free systems can 
be used in assays for protein activity, such as enzymatic activity. Cell-free assays can be used to 
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detect the ability of a compound to directly bind to a protein or nucleic acid fragment of the 
present invention. 

The polypeptides can be used to identify compounds that modulate activity of the 
protein. Both the WSBV peptides of the present invention and appropriate variants and 
fragments can be used in high-throughput screens to assay candidate compounds for the ability 
to bind to the WSBV peptide. These compounds can be further screened against a functional 
WSBV peptide to determine the effect of the compound on the WSBV peptide activity. Further, 
these compounds can be tested in shrimp to determine activity/effectiveness. Compounds can 
be identified that inactivate the WSBV peptide to a desired degree (antagonists). 

Further, the proteins of the present invention can be used to screen a compound for the 
ability to stimulate or inhibit interaction between the WSBV protein and a target molecule that 
normally interacts with the WSBV protein, e.g. a host cell receptor. Such assays typically 
include the steps of combining the WSBV protein with a candidate compound under conditions 
that allow the WSBV protein, or fragment thereof, to interact with the target molecule, and 
detecting the formation of a complex between the WSBV protein and the target or detecting the 
biochemical consequence of the interaction between the WSBV protein and the target, such as 
any of the associated effects of host cell signal transduction such as protein phosphorylation, 
cAMP turnover, or adenylate cyclase activation, etc. 

Candidate compounds include, for example, 1) peptides such as soluble peptides, 
including Ig-tailed fusion peptides and members of random peptide libraries (see, e.g., Lam et 
al., Nature 354:82-84 (1991); Houghten et al t Nature 354:84-86 (1991)) and combinatorial 
chemistry-derived molecular libraries made of D- and/or L- configuration amino acids; 2) 
phosphopeptides (e.g., members of random and partially degenerate, directed phosphopeptide 
libraries, see, e.g., Songyang et ai., Cell 72:767-778 (1993)); 3) antibodies (e.g., polyclonal, 
monoclonal, anti-idiotypic, chimeric, and single chain antibodies as well as Fab, F(ab')2, Fab 
expression library fragments, and epitope-binding fragments of antibodies); and 4) small organic 
and inorganic molecules (e.g., molecules obtained from combinatorial and natural product 
libraries). 

One candidate compound is a non-virulent soluble fragment of the WSBV peptide that 
competes for substrate binding, such as for binding to shrimp cellular receptors. Other candidate 
compounds include non-virulent mutant WSBV peptides or appropriate fragments containing 
mutations that prevent WSBV virulence and thus compete for substrate. Accordingly, a 
fragment that competes for substrate, for example with a higher affinity, or a fragment that binds 
substrate but is inactive or non-virulent, is encompassed by the present invention. 
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The invention further includes other end point assays to identify compounds that 
inhibit WSBV activity. The assays typically involve an assay of events in the shrimp cell signal 
transduction pathway that indicate viral activity. Thus, the phosphorylation of a substrate, 
activation of a protein, a change in the expression of genes that are up- or down-regulated in a 
host cell in response to WSBV infection can be assayed. 

Any of the viral functions mediated by a WSBV protein can be used as an endpoint 
assay. These include all of the biochemical or biological events described herein, and in the 
references cited herein, incorporated by reference for these endpoint assay targets, and other 
functions known to those of ordinary skill in the art or that can be readily identified using the 
information provided in the figures, particularly Figure 3. 

The proteins of the present invention are also useful in competition binding assays in 
methods designed to discover compounds that interact with the viral protein (e.g. binding 
partners and/or ligands). Thus, a compound is exposed to a viral polypeptide under conditions 
that allow the compound to bind or to otherwise interact with the polypeptide. Soluble viral 
polypeptide is also added to the mixture. If the test compound interacts with the soluble viral 
polypeptide, it decreases the amount of complex formed or activity from the viral protein target. 
This type of assay is particularly useful in cases in which compounds are sought that interact 
with specific regions of the viral protein. Thus, the soluble polypeptide that competes with the 
target viral protein region is designed to contain peptide sequences corresponding to the region 
of interest. See Hodgson, Bio/technology, 1 992, Sept 1 0(9), 973-80, for a review of competition 
binding assays and other receptor screening assays. 

To perform cell free drug screening assays, it is sometimes desirable to immobilize 
either the viral protein, or fragment, or its target molecule to facilitate separation of complexes 
from uncomplexed forms of one or both of the proteins, as well as to accommodate automation 
of the assay. 

Techniques for immobilizing viral proteins on matrices can be used in the drug screening 
assays. In one embodiment, a fusion protein can be provided which adds a domain that allows 
the viral protein to be bound to a matrix. For example, glutathione-S-transferase fusion proteins 
can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or 
glutathione derivatized microtitre plates, which are then combined with the cell lysates (e.g., ^S- 
labeled) and a candidate drug compound, and the mixture incubated under conditions conducive 
to complex formation (e.g., at physiological conditions for salt and pH). Following incubation, 
the beads are washed to remove any unbound label, and the matrix immobilized and radiolabel 
determined directly, or in the supernatant after the complexes are dissociated. Alternatively, the 



35 



WO 01/38351 



PCT/US00/28888 



complexes can be dissociated from the matrix, separated by SDS-PAGE, and the level of a viral 
protein target ligand, such as a host cell receptor protein, found in the bead fraction quantitated 
from the gel using standard electrophoretic techniques. For example, either the viral protein or 
its target ligand can be immobilized utilizing conjugation of biotin and streptavidin using 
techniques well known in the art. Alternatively, antibodies reactive with the viral protein but 
which do not interfere with binding of the viral protein to its target ligand can be derivatized to 
the wells of the plate, and the viral protein trapped in the wells by antibody conjugation. 
Preparations of a viral protein target ligand and a candidate compound are incubated in the viral 
protein-presenting wells and the amount of complex trapped in the well can be quantitated. 
Methods for detecting such complexes, in addition to those described above for the GST- 
immobilized complexes, include immunodetection of complexes using antibodies reactive with 
the viral protein target ligand, or which are reactive with viral protein and compete with the 
target ligand, as well as enzyme-linked assays which rely on detecting an enzymatic activity 
associated with the target ligand. 

Agents that modulate one of the viral proteins of the present invention can be identified 
using one or more of the above assays, alone or in combination. It is generally preferable to use 
a cell-based or cell free system first and then confirm activity in a shrimp, or other organism, 
infected with WSBV. 

Modulators of viral protein activity identified according to these drug screening assays 
can be used to treat shrimp infected with WSBV. These methods of treatment include the steps 
of administering a modulator of viral protein activity in a pharmaceutical composition to an 
organism, such as a shrimp, that is infected with WSBV, the modulator being identified as 
described herein. 

In yet another aspect of the invention, the WSBV proteins can be used as "bait 
proteins" in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. Patent No. 5,283,317; 
Zervos et al (1993) Cell 72:223-232; Madura et al (1993) J, Biol Chem. 268:12046-12054; 
Bartel et al (1993) Biotechniques 14:920-924; Iwabuchi et al (1993) Oncogene 8:1693- 
1696; and Brent WO94/10300), to identify other proteins which bind to or interact with the 
viral protein and are involved in viral protein activity, and therefore are targets for inhibiting 
viral protein activity. 

The two-hybrid system is based on the modular nature of most transcription factors, 
which consist of separable DNA-binding and activation domains. Briefly, the assay utilizes 
two different DNA constructs. In one construct, the gene that codes for a viral protein is 
fused to a gene encoding the DNA binding domain of a known transcription factor (e.g., 
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GAL-4). In the other construct, a DNA sequence, from a library of DNA sequences, that 
encodes an unidentified protein ("prey" or "sample") is fused to a gene that codes for the 
activation domain of the known transcription factor. If the "bait" and the "prey" proteins are 
able to interact, in vivo, forming a viral protein-dependent complex, the DNA-binding and 
activation domains of the transcription factor are brought into close proximity. This 
proximity allows transcription of a reporter gene (e.g., LacZ) which is operably linked to a 
transcriptional regulatory site responsive to the transcription factor. Expression of the 
reporter gene can be detected and cell colonies containing the functional transcription factor 
can be isolated and used to obtain the cloned gene which encodes the protein which interacts 
with the viral protein. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an agent 
identified as described herein in an appropriate animal model, such as a shrimp infected with 
WSBV. For example, an agent identified as described herein (e.g., a viral protein-modulating 
agent, an antisense viral nucleic acid molecule, a viral protein-specific antibody, or a viral 
protein-binding partner) can be used in a shrimp, or other organism, infected with WSBV to 
determine the efficacy, toxicity, or side effects of treatment with such an agent. 
Alternatively, an agent identified as described herein can be used in an animal or other model 
to determine the mechanism of action of such an agent. Furthermore, this invention pertains 
to uses of novel agents identified by the above-described screening assays for treatments as 
described herein. 

The viral proteins of the present invention are also useful for providing targets for 
diagnosing viral infection. Accordingly, the invention provides methods for detecting the 
presence, or levels of, the viral protein (or encoding nucleic acid) in an infected cell, tissue, or 
organism. The method involves contacting a biological sample with a compound capable of 
interacting with the viral protein such that the interaction can be detected. Such an assay can be 
provided in a single detection format or a multi-detection format such as an antibody chip array. 

One agent for detecting a protein in a sample is an antibody capable of selectively 
binding to a WSBV protein. A biological sample includes tissues, cells and biological fluids 
isolated from a shrimp or other infected organism, as well as tissues, cells and fluids present 
within the infected organism. 

In vitro techniques for detection of viral peptide include enzyme linked immunosorbent 
assays (ELISAs), Western blots, immunoprecipitations and immunofluorescence using a 
detection reagent such as an antibody or protein binding agent. Alternatively, the peptide can 
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be detected in vivo in an infected organism by introducing into the subject a labeled anti-peptide 
antibody or other type of detection agent. For example, the antibody can be labeled with a 
radioactive marker whose presence and location in an infected organism can be detected by 
standard imaging techniques. Particularly useful are methods that detect fragments of a peptide 
in a sample. 

Antibodies 

The invention also provides antibodies that selectively bind to one of the WSBV 
peptides of the present invention, a protein comprising such a peptide, as well as variants and 
fragments thereof. As used herein, an antibody selectively binds a target peptide when it binds 
the target peptide and does not significantly bind to unrelated proteins. An antibody is still 
considered to selectively bind a peptide even if it also binds to other proteins that are not 
substantially homologous with the target peptide so long as such proteins share homology with a 
fragment or domain of the peptide target of the antibody. In this case, it would be understood 
that antibody binding to the peptide is still selective despite some degree of cross-reactivity. 

As used herein, an antibody is defined in terms consistent with that recognized within 
the art: they are multi-subunit proteins produced by a mammalian organism in response to an 
antigen challenge. The antibodies of the present invention include polyclonal antibodies and 
monoclonal antibodies, as well as fragments of such antibodies, including, but not limited to, 
Fab or F(ab') 2 , and Fv fragments. 

Many methods are known for generating and/or identifying antibodies to a given target 
peptide. Several such methods are described by Harlow, Antibodies, Cold Spring Harbor Press, 
(1989). In general, to generate antibodies, an isolated peptide is used as an immunogen and is 
administered to a mammalian organism, such as a rat, rabbit or mouse. The antibodies 
generated by the organism in response to the immunogen are then isolated. The full-length 
protein, an antigenic peptide fragment or a fusion protein can be used. Particularly important 
fragments are those covering functional domains, such as the domains identified in Figure 3, and 
domains of sequence homology or divergence between WSBV and other viruses, such as those 
that can readily be identified using protein alignment methods and as presented in the figures. 

Monoclonal antibodies can be produced by hybridomas, which are immortalized cell 
lines capable of secreting a specific monoclonal antibody. The immortalized cell lines can be 
created in vitro by fusing two different cell types, usually lymphocytes, one of which is a 
tumor cell. 
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Antibodies are preferably prepared from regions or discrete fragments of the WSBV 
proteins. Antibodies can be prepared from any region of the peptide as described herein. 
However, preferred regions will include those involved in function/activity and/or virus/host 
interaction. Figure 3 can be used to identify particularly important regions while sequence 
alignment can be used to identify conserved and unique sequence fragments. 

An antigenic fragment will typically comprise at least 8 contiguous amino acid residues. 
The antigenic peptide can comprise, however, at least 10, 12, 14, 16 or more amino acid 
residues. Such fragments can be selected on a physical property, such as fragments 
corresponding to regions that are located on the surface of the protein, e.g., hydrophilic regions, 
or can be selected based on sequence uniqueness (see Figure 3). 

Detection of an antibody of the present invention can be facilitated by coupling (i.e., 
physically linking) the antibody to a detectable substance. Examples of detectable substances 
include various enzymes, prosthetic groups, fluorescent materials, luminescent materials, 
bioluminescent materials, and radioactive materials. Examples of suitable enzymes include 
horseradish peroxidase, alkaline phosphatase, p-galactosidase, or acetylcholinesterase; examples 
of suitable prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples of 
suitable fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, 
rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of 
a luminescent material includes luminol; examples of bioluminescent materials include 
luciferase, luciferin, and aequorin, and examples of suitable radioactive material include ,25 I, 
13, I, 35 S or 3 H. 

Antibody Uses 

The antibodies can be used to isolate one of the proteins of the present invention by 
standard techniques, such as affinity chromatography or immunoprecipitation. The 
antibodies can facilitate the purification of the natural viral protein from host cells infected 
with WSBV, as well as recombinantly produced protein. In addition, such antibodies are 
useful for detecting the presence of the viral proteins of the present invention in cells or 
tissues in order to determine the pattern of viral infection among various cells or tissues in a 
shrimp or other organism over the course of viral infection. Further, such antibodies can be 
used to detect protein in situ, in vitro, or in a cell lysate or supernatant in order to evaluate the 
abundance and pattern of viral infection. 
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The antibodies can also be used to assess subcellular localization of virus particles in 
host cells. The diagnostic uses can be applied, not only in genetic testing, but also in monitoring 
a treatment modality. Accordingly, where treatment is ultimately aimed at preventing or halting 
expression of the WSBV protein, antibodies directed against the protein or relevant fragments 
can be used to monitor therapeutic efficacy. 

The antibodies are also useful diagnostic tools, such as for use as immunological 
markers for aberrant viral protein analyzed by electrophoretic mobility, isoelectric point, tryptic 
peptide digest, and other physical assays known to those in the art. 

The antibodies are also useful for inhibiting protein function. For example, antibodies 
may bind directly to viral peptides to block binding of the viral peptide to a binding partner 
such as a host cell receptor. Antibodies can thereby serve as antiviral agents. Antibodies can 
be prepared against specific fragments containing sites required for protein function or 
against intact viral protein that is associated with virulence. 

The invention also encompasses kits for using antibodies to detect the presence of a 
WSBV protein in a biological sample, such as a shrimp cell sample. The kit can comprise 
antibodies such as a labeled or Iabelable antibody and a compound or agent for detecting viral 
protein in a biological sample; means for determining the amount of protein in the sample; 
means for comparing the amount of protein in the sample with a standard; and instructions for 
use. Such a kit can be supplied to detect a single protein or epitope or can be configured to detect 
one of a multitude of epitopes, such as in an antibody detection array. Arrays are described in 
detail above for nucleic acid arrays and similar methods have been developed for antibody 
arrays. 

All publications and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described method and 
system of the invention will be apparent to those skilled in the art without departing from the 
scope and spirit of the invention. Although the invention has been described in connection 
with specific preferred embodiments, it should be understood that the invention as claimed 
should not be unduly limited to such specific embodiments. Indeed, various modifications of 
the above-described modes for carrying out the invention which are obvious to those skilled 
in the field of molecular biology or related fields are intended to be within the scope of the 
following claim. 

Certain aspects of the present invention are described in greater detail in the non- 
limiting examples that follow. 
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Examples: 
Infected prawn 

Dead and moribund PJaponicus with evident white spots on the inside surface of the 
crust were collected from a prawn farm and kept at 4 °C. 

Isolation of nucleocapsids 

Hepatopancreata gill and intestine were removed from PJaponicus and placed in an 
ice-bathed beaker, homogenized as a 10% suspension in TESP buffer (50 mmol/1, Tris-HCl, 
pH 8.5, 10 PMSF), then centrifuged at 6500 x g for 10 min at 4 °C. The supernatant was 
recentrifuged at 30,000 x g for 30 min at 4°C. The pellet was suspended in an approximate 
two volume of TESP buffer containing 1 % (v/v) Triton X-l 00. After centrifugation at 5000 x 
g for 10 min, the supernatant was centrifuged again at 25,000 x g for 20 min. The pellet was 
suspended in TESP buffer and differential centrifugation was repeated, and then the 
precipitate was resuspended in TMP (100 mmol/I, Tris-HCl, pH 7.5, 10 mmol/1 MgCl 2 /l, 1 
mmol/1 PMSF) buffer and treated with DNase and RNase. The mixture was incubated at 
37°C for 15 min and 30 ml of TESP buffer was added. Differential centrifugation was 
repeated again and the pellet was resuspended in 1 ml of TESP buffer, 1 //I of suspension was 
dropped on a copper grid, negatively stained with 2% (w/v) uranyl acetate, pH 7.6, and 
observed using a JEM-100CX II transmission electron microscope. 

Purification of viral DNA 

The nucleocapsid suspension was lysed with 2 ml of GTE buffer (6 mol/1 guanidine 
hydrochloride; 50 mmol/1 Tris-HCl; 10 mmol/I EDTA; pH 7.0), slightly mixed, and then 
centrifuged at 25,000 x g for 10 min at 4 °C. The supernatant was collected and 0.02 vol. of 1 
mol/1 MgCl 2 and 0.6 vol. of isopropanol were added. After centrifugation, the pellet was 
picked out with pipette tip, washed twice with 70% ethanol and then dissolved in 1 ml of TE 
buffer containing 0.5% (w/v) SDS and 0.5 mg/ml proteinase K and incubated at 55°C for 2-3 
hrs. The DNA was precipitated again with 0.01 vol. of 1 mol/1 MgCl 2 and 0.25 vol. of 
isopropanol and dissolved in 0.1 X TE buffer. The viral DNA obtained was quantified by a 
spectrophotometer. 

WSBV genomic DNA library construction 



41 



WO 01/38351 



PCT/US00/28888 



Construction of a random "shotgun" library: 

WSBV genomic DNA was sheared with sonication. Mung Bean nuclease was used to 
blunt the end. The DNA fragments between 1 .8-2kb were recovered from an agarose gel 
following electrophoresis. The blunt end DNA was cloned into pUCI 8 vectors. The vector 
was subsequently transformed into DH5a cells and plated onto LB plate. 
Construction of a restriction fragment library: 

WSBV genomic DNA was partially digested with Sau3Al restriction enzymes. DNA 
fragments between 5-10kb were recovered from the agarose gel. pBluescript vectors were 
digested by the restriction enzyme and the ends were dephosphorylated. The fragments were 
cloned into pBluescript vector and transformed into XL-blue competent cells. Subsequently, 
the DNA plasm id was prepared. 

Large scale DNA sequencing 
PCR reactions: 

PCR reactions were carried out in a 25 //I volume containing 0.2mM dNTP, 1.5mM 
MgCb, 5 //M of each primers, 2.5 unit of Taq polymerase, and a single white colony as 
template. Take out Iwl as glycerol stocks. PCR reactions were done in a PE 9700. The 
cycling profile consisted of an initial denaturation at 95°C for 12 min (one cycle) followed by 
30 cycles of denaturation at 95°C for 15s, annealing at 58°C for 20s, and extension at 72°C 
for 2 min. Then 5 [A of PCR products were visualized on 1% agarose gels stained with 
ethidium bromide. Excess primers and dNTPs were removed by digesting the PCR products 
with exonuclease I and shrimp alkaline phosphatase. 
DNA sequencing: 

Sequencing reactions were carried out in 5jil volume containing l(-il of "BigDye" 
premix (PE Applied Biosystems), 3.2^iM sequencing primer and 30-90ng PCR products. 
Sequencing reactions were done in a PE 9700. The cycling profile consisted of 30 cycles of 
denaturation at 95°C for 30s, annealing at 50°C for 30s, and extension at 60°C for 4 min. 
Excess Dye terminators were remove with ethanol precipitation, and sequencing was carried 
out on AB1 377 automated sequencer. 
Sequence analysis 

A total of 5795 sequences were assembled in a UNIX system using InnerPeace 
software designed based on the "Phred, Phrap and Consed" program originally developed by 
the University of Washington. Sequences were edited and finished as follows: a) for bad 
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sequence quality, sequencing was repeated; b) for regions with repetitive sequences, which 
may cause misassembly, primers were designed for walking on the original PCR products or 
on plasmid DNA; c) for mapping gaps, clones were sequenced that cover the gap; d) for 
physical gaps, PCR primers were designed between the gaps, then the PCR products were 
sequenced that cover the gap; e) for gaps that can't be covered by PCR methods, walking on 
WSBV genomic DNA was applied. 

WSBV cDNA library construction and WSBV cDNA clone selecti on 
Poly(A)-mRNA was purified by using the "PolyATtract System 1000" kit (Promega). 
Double stranded cDNA was synthesized and cloned using the "SUPERSCRIPT™ Plasmid 
System for cDNA Synthesis and Plasmid Cloning" kit (GIBCO BRL). cDNA clones were 
transformed into DH10 cells and then plated. WSBV cDNA clones were selected by DNA 
hybridization using Dig labeled WSBV genomic DNA as a probe (Dig labeling kit, 
Boehringer Mannheim). Finally, the plasmid DNA was prepared for automatic sequencing. 
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Claims 

That which is claimed is: 

1 . An isolated peptide comprising an amino acid sequence selected from the group 
consisting of: 

(a) an amino acid sequence shown in SEQ ID NOS:3, 5, 7. . .281 , 283, 285; 

(b) a variant of an amino acid sequence shown in SEQ ID NOS:3, 5, 7. . .28 1 , 283, 
285, wherein said variant is encoded by a nucleic acid molecule that hybridizes under stringent 
conditions to the opposite strand of a nucleic acid molecule shown in SEQ ID NOS:2, 4, 

6... 280, 282, 284, 286-293; 

(c) an amino acid sequence of an ortholog of an amino acid sequence shown in SEQ 
ID NOS:3, 5, 7. ..281, 283, 285, wherein said ortholog is encoded by a nucleic acid molecule 
that hybridizes under stringent conditions to the opposite strand of a nucleic acid molecule 
shown in SEQ ID NOS:2, 4, 6... 280, 282, 284, 286-293; and 

(d) a fragment of an amino acid sequence shown in SEQ ID NOS:3, 5, 7. . .28 1 , 283, 
285, wherein said fragment comprises at least 10 contiguous amino acids. 

2. An isolated peptide consisting of an amino acid sequence selected from the group 
consisting of: 

(a) an amino acid sequence shown in SEQ ID NOS:3, 5, 7. . .281 , 283, 285. 

(b) a variant of an amino acid sequence shown in SEQ ID NOS:3, 5, 7. . .28 1 , 283, 
285, wherein said variant is encoded by a nucleic acid molecule that hybridizes under stringent 
conditions to the opposite strand of a nucleic acid molecule shown in SEQ ID NOS:2, 4, 

6.. .280, 282, 284, 286-293; 

(c) an amino acid sequence of an ortholog of an amino acid sequence shown in SEQ 
ID NOS:3, 5, 7... 28 1, 283, 285, wherein said ortholog is encoded by a nucleic acid molecule 
that hybridizes under stringent conditions to the opposite strand of a nucleic acid molecule 
shown in SEQ ID NOS:2, 4, 6. . .280, 282, 284, 286-293; and 

(d) a fragment of an amino acid sequence shown in SEQ ID NOS:3, 5, 7. . .28 1 , 283, 
285, wherein said fragment comprises at least 10 contiguous amino acids. 

3. An isolated antibody that selectively binds to a peptide of claim 1 . 
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4. An isolated nucleic acid molecule comprising a nucleotide sequence selected from the 
group consisting of: 

(a) a nucleotide sequence that encodes an amino acid sequence shown in SEQ ID 
NOS:3, 5, 7... 281, 283, 285; 

(b) a nucleotide sequence that encodes a variant of an amino acid sequence shown in 
SEQ ID NOS:3, 5, 7... 281, 283, 285, wherein said nucleotide sequence hybridizes under 
stringent conditions to the opposite strand of a nucleic acid molecule shown in SEQ ID NOS:2, 
4, 6... 280, 282, 284, 286-293; 

(c) a nucleotide sequence that encodes an ortholog of an amino acid sequence shown 
in SEQ ID NOS:3, 5, 7... 281, 283, 285, wherein said nucleotide sequence hybridizes under 
stringent conditions to the opposite strand of a nucleic acid molecule shown in SEQ ID NOS:2, 

4. 6... 280, 282, 284, 286-293; 

(d) a nucleotide sequence that encodes a fragment of an amino acid sequence shown 
in SEQ IDNOS:3, 5, 7... 281, 283, 285, wherein said fragment comprises at least 10 contiguous 
amino acids; and 

(e) a nucleotide sequence that is the complement of a nucleotide sequence of (a)-(d). 

5. An isolated nucleic acid molecule consisting of a nucleotide sequence selected from the 
group consisting of: 

(a) a nucleotide sequence that encodes an amino acid sequence shown in SEQ ID 
NOS:3, 5, 7... 281, 283, 285; 

(b) a variant of an amino acid sequence shown in SEQ ID NOS:3, 5, 7.. .281, 283, 
285, wherein said nucleotide sequence hybridizes under stringent conditions to the opposite 
strand of a nucleic acid molecule shown in SEQ ID NOS:2, 4, 6... 280, 282, 284, 286-293; 

(c) a nucleotide sequence that encodes an ortholog of an amino acid sequence shown 
in SEQ ID NOS:3, 5, 7... 281, 283, 285, wherein said nucleotide sequence hybridizes under 
stringent conditions to the opposite strand of a nucleic acid molecule shown in SEQ ID NOS:2, 
4, 6. . .280, 282, 284, 286-293; 

(d) a nucleotide sequence that encodes a fragment of an amino acid sequence shown 
in SEQ ID NOS:3, 5, 7. . .281, 283, 285, wherein said fragment comprises at least 10 contiguous 
amino acids; and 

(e) a nucleotide sequence that is the complement of a nucleotide sequence of (a)-(d). 

6. A gene chip comprising a nucleic acid molecule of claims 4 or 5. 
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7. 



A transgenic non-human organism comprising a nucleic acid molecule of claims 4 or 5. 



8. 



A nucleic acid vector comprising a nucleic acid molecule of claims 4 or 5. 



9. 



A host cell containing the vector of claim 8. 



1 0. A method for producing any of the peptides of claim 1 comprising introducing a 
nucleotide sequence encoding any of the amino acid sequences in (a)-(d) into a host cell, and 
culturing the host cell under conditions in which the peptides are expressed from the nucleotide 
sequence. 

11. A method for producing any of the peptides of claim 2 comprising introducing a 
nucleotide sequence encoding any of the amino acid sequences in (a)-(d) into a host cell, and 
culturing the host cell under conditions in which the peptides are expressed from the nucleotide 
sequence. 

1 2. A method for detecting the presence of any of the peptides of claims 1 or 2 in a sample, 
said method comprising contacting said sample with a detection agent that specifically allows 
detection of the presence of the peptide in the sample and then detecting the presence of the 
peptide. 

13. A method for detecting the presence of a nucleic acid molecule of claims 4 or 5 in a 
sample, said method comprising contacting the sample with an oligonucleotide that hybridizes to 
said nucleic acid molecule under stringent conditions and determining whether the 
oligonucleotide binds to said nucleic acid molecule in the sample. 

14. A method for identifying a modulator of a peptide of claims 1 or 2, said method 
comprising contacting said peptide with an agent and determining if said agent has modulated 
the function or activity of said peptide. 

15. The method of claim 14, wherein said agent is administered to a host cell comprising an 
expression vector that expresses said peptide. 
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16. A method for identifying an agent that binds to any of the peptides of claims 1 or 2, said 
method comprising contacting the peptide with an agent and assaying the contacted mixture to 
determine whether a complex is formed with the agent bound to the peptide. 

17. A pharmaceutical composition comprising an agent identified by the method of claim 16 
and a pharmaceutical ly acceptable carrier therefor. 

1 8. A method for treating WSBV infection, said method comprising administering to an 
organism a pharmaceutically effective amount of an agent identified by the method of claim 16. 

19. A method for identifying a modulator of the expression of a peptide of claims 1 or 2, 
said method comprising contacting a cell expressing said peptide with an agent, and determining 
if said agent has modulated the expression of said peptide. 

20. An isolated WSBV peptide having an amino acid sequence that shares at least 70% 
homology with an amino acid sequence shown in SEQ ID NOS:3, 5, 7... 281, 283, 285. 

21 . A peptide according to claim 20 that shares at least 90 percent homology with an amino 
acid sequence shown in SEQ IDNOS:3, 5, 7... 281, 283, 285. 

22. An isolated nucleic acid molecule encoding a WSBV peptide, said nucleic acid 
molecule sharing at least 80 percent homology with a nucleic acid molecule shown in SEQ 
ID NOS:2, 4, 6... 280, 282, 284, 286-293. 

23. A nucleic acid molecule according to claim 22 that shares at least 90 percent 
homology with a nucleic acid molecule shown in SEQ ID NOS:2, 4, 6... 280, 282, 284, 286- 
293. 

24. An isolated nucleic acid detection reagent that is capable of detecting the presence of 
1 or more genes from WSBV, wherein said genes are selected from the group consisting of 
SEQ ID NOS:2, 4. 6... 282, 284, 286, 288-292 and 293. 

25. The detection reagent of claim 24, wherein said reagent is a nucleic acid array. 
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26. The array of claim 25, wherein said array is comprised of short oligonucleotides from 
about 5 to about 100 nucleotides in length. 

27. The array of claim 25, wherein said array is comprised of polynucleotides based on 
the transcript sequences (SEQ ID NOS: 2, 4, 6... 280, 282, 284, 286-293), wherein said 
polynucleotides are from about 100 to about 1000 nucleotides in length. 

28. An isolated nucleic acid detection reagent that is capable of detecting the presence of 
10 or more genes from WSBV, wherein said genes are selected from the group consisting of 
SEQ ID NOS:2, 4, 6. ..282, 284, 286, 288-292 and 293. 

29. The detection reagent of claim 28, wherein said reagent is a nucleic acid array. 

30. The array of claim 29, wherein said array is comprised of short oligonucleotides from 
about 5 to about 100 nucleotides in length. 

31 . The array of claim 29, wherein said array is comprised of polynucleotides based on 
the transcript sequences (SEQ ID NOS: 2, 4, 6... 280, 282, 284, 286-293), wherein said 
polynucleotides are from about 100 to about 1000 nucleotides in length. 

32. An isolated nucleic acid detection reagent that is capable of detecting the presence of 
100 or more genes from WSBV, wherein said genes are selected from the group consisting of 
SEQ ID NOS: 2, 4, 6... 282, 284, 286, 288-292 and 293. 

33. The detection reagent of claim 32, wherein said reagent is a nucleic acid array. 

34. The array of claim 33, wherein said array is comprised of short oligonucleotides from 
about 5 to about 100 nucleotides in length. 

35. The array of claim 33, wherein said array is comprised of polynucleotides based on 
the transcript sequences (SEQ ID NOS:2, 4, 6... 280, 282, 284, 286-293), wherein said 
polynucleotides are from about 100 to about 1000 nucleotides in length. 
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36. An isolated nucleic acid detection reagent that is capable of detecting the presence of 
all genes from WSBV, wherein said genes are selected from the group consisting of SEQ ID 
NOS:2, 4, 6... 282, 284, 286, 288-292 and 293. 

37. The detection reagent of claim 36, wherein said reagent is a nucleic acid array. 

38. The array of claim 37, wherein said array is comprised of short oligonucleotides from 
about 5 to about 100 nucleotides in length. 

39. The array of claim 38, wherein said array is comprised of polynucleotides based on 
the transcript sequences (SEQ ID NOS:2, 4, 6... 280, 282, 284, 286-293), wherein said 
polynucleotides are from about 100 to about 1000 nucleotides in length. 



49 



WO 01/38351 



1/201 



PCT/US00/28888 




& 
o 

CO o 

&= 



II 



s 
s 



f 

•i 







A.. ,_t 

£ 






lulu 

ice 






«s > 

~ Q 


« 

1 




vable 

rage! 






Sto; 









s 

I 



J- 



i 



FIGURE 1 



WO 01/38351 



2/201 



PCT/US00/28888 



GGATCCAGACATCACCCCAACAATCGCAGACCCGATTACAGTAGCAGAAGTTGGAAACC^ 

AAATTCAT^3AGATTCTCTACTAATTTCTCATT^TAAAATACGTCAAACGGTTT^ 

TCTTATTAGATTTGATTTTTGAAAAATTGJITAGACT^^ 

CAGGACGGCAACTTGTCGTCTAGCGCAATCTTATTACCCCCTCCCTGAAACrGTTGGTAT 

TTCCCTAACAAT GAAAGGTATAT AT AT GAATTTC AAGCT ACAAT G T AAAT AATT GGTT AAT AAAA T AAAGGT AT ATTTTTAAAAAATGTGTTTATTTTTC 

C C CAACCTT AAACAGATCAT TGC C AGG AG AAAAT CGCAT ACTT AACAGAT AAT G C CT ATTTGT AT CATCGArGT C TT CGTCAAACGTTGCTCCAAAC A CA 

AG7GTGTTGAT C CTATTT TTCTTC AAAAC TGTTGG TTTG AAATAT AAT AC AAATT TGTT AGC T G AAAATTCCCT TCCTGT AGCATTT A GACC T GGAGACA 

GTGAAACAGAGTCCACTGTTATATCCCTCTTTGTATTATTGTATACTC^ 

GTCTCCGTCTCCAAGTAAAGATGTGAGGATTATCTCTCCTTTCTTTTTTArc 

TTCCTGGTTTOTAATCTGCCGTTTACAAAGTTAAAGTGGTT^ 

CTTTATCCTTCTTGTCCiU\TTTCTTGTTAAGTTCTAT^ 

AACCCCCCACATGTGCATTO AAG AAJW3TTGTACAAAGGTC AT GAGAGAAAJUUIACAGAT ACT AAAGTTTGTT ATATTTT ATT^ AGTCGTAGAAATCAC 

CACTGAGATATATTGTGAGCAGCAACTGCCGTTTCAG^TACTCGAAAGAA^ 

TATTTCTGCTAGTTriOTTGAAGTACTGGATAGCCTT™ 

CTGGCAATTCACAAAGGAGTGCTGATTATTGTTGTTATGATGGTACATTTTGAAGGTGAGAGC^ 
CTCAGGTACATCCACGCTC ACCT7T T AT AC CAACCTCTT ACACTT CCTTCA^ 
GAATCTAATATCTtrCCTCAGATCTGTTCAAAACTTTCCACGACTCCATTGGGTCCAG^ 
GGTTGAATATATGCAGATGATGCATCAAAATCTACTTCTTCGCCG<n\^TGCCTG 
CTTCTTCCATTTCTTCTCCACTATCTAAAAATTCGTCTATATCATTAT^ 

AGGAGATGGGGT TC GTGT CGGGGGAGGTG G AGGAGT A GGAGT AGG AGTAGGT GG AGGAGT AGG AGT AGGAGT AGG AGGGGGTGGAGG TAACAAACC TTCT 
CCAGGCACAGTTCCTGTTCCTCCACCTCtTTGTATTGGACCCAGTATT^^ 

CAGCACCTTCCT CGTCTCTTCCGATCGGC CTATTT TC AGGTTCCCTTCGCCC TAATACAGCTCCTTC GGTCTCATCTTCTTTTTCCTTCrCTTT ATCTTT 

CT 'fTbT r i TTT l' ATTGGATGTTGT RGTAX5T TTTATTTGGTCGTGACATGAAAATCACACC AACCACC ACACCCACAJUACACAC AAAAAGT ATGAGAGAT 

AATAAAAGGAACATCATTATTATCCTTATTTTTTrrACATTCA^ 

CTGACCTCTGCCGCGAAraCTTCATAC^TTCCATATTCTTTG^ 

AAGATCCATCOiCAACAGCGGCCGTGGATTCTTCTTCA^ 

CAAATCTATAACCTTCTGCCTAAAGACAGGCCTCTCTGCTAAAAGAGAAGGATTTTCTtrTAAGCTGGT^ 

TTTTAAAACCCAAACTCTTCTAAAATATCTACAGATTCCAGGTACTCTGCCGCCACTGTTGTAATCA CTATCACTGCTT GTCTTT 

TATCTTCCACATCTCCCACCTCGTCTTCGTGTTTTTTCCTITCT7CTTCTAGAA 
TACCTGAGAAGCTTCCTGTTTTACATCATTCACAAATTCG^GCGATTCTAGTACGAGCCT 
GTTGTTTAGACCTAGATAGAGAGTGGGAGACGCTATTAATTTTTTTGT^ 

AAGTCTCT ACAAAGCAAT AGAAGGGGT AAAAACAAAACAC AC C ATGGTGCT AAC TCTTTCCT GT ACG ACT AGGC GC GTAGCGTC T AGCAAGGGGAAT TTC 
TCTAAGGAAGATGCGGTGTTGCGGAACCAGTTCCCCATTTTAAAGAAATCAAACAAC^ 

C AGT GGAAAAAAT ATTCAGGGAAT GG AAC GAAAG TGGGG GAG AA AAAAT TT'TC G AC ATAT CTCAG AAT GAAGAAG AATGG ATGGAT ATCATATCC T T AGT 
GGAAAGTGTATATGAACCTGTATTTTCTAAATCACT^ 

GCCGTGGATGJU^AAATTGACAATCTTATCAGGTAGTGATGGGAGTGTGCTTCAACGTACAAC^ 

CTCTTTTAAATAATGAAAAATGGACATCTATT^ATTGGACAGGTTAAAAACAGCCAAGA 

AGTAGAAGTACTTC^TCGGrTGAATAAACTCAAGCAGGCTCCTCTTGCCCACCA 

TCTGCTOSAACACTCATCGCATCAGATATGCATTTCAJiATTGGTTGA 

AAAGCGGGGAAGAAGiyVGATGAGGAAGAAGAAGAAGAAGAAAAGAAACATTC 

TTATCATAGGCCCGAAAGTGCTTCTGTTTACTTTTGTGATTATTATGACTA 

CCTGGCACATTCAATATGGAGGATTTACCTTTCCGCCCTTTCGCAGTACCrrTCAACrrTATA^ 

ATCrTCCt^AGCTAAGTTTCGACTATC^X^AGTTTTTATGTT^ 

AAATAGTTCCATGATATt^TTTGATTCTCAGACATTGAGTGGTGTGT^ 

GATTTGGCCGTTAGGAGTATGGCAGAATTTATCCGCACTGAAGCGCATAAGG^ 

AAGAAGAAGCGATGGACCAGGAGCCTGCAGAAGTAGACTTCCTTTCAGTGCCTC^ 

TGTC^SAGAACCATCTTTCTATATTTCCTTTCTAACTTCAAGAATC 

GAATTTGAAGCATTATCTTCCCATrTTATTCCTTTCAAGAATAT^ 

ACCTGTCTCTGTCGGATAAATCTCCCCCTAGCCGTGTCCGTGGAAGCA^ 

CGGTATAGAAGAGITGCCrGAAAATGTACTAGTCGGTTTGTCCGGAGGATTTGAAGATACCGACATGTAT^ 

GG TTGTGATGGAGG AAAAGT GC? AAGT GTC AC CTTCAATT GT GGT GAT AATT T T AT CCAGCT C C ATG AAAAAACAGCAGAAACTT TT AAGGATGAT A C GG 
ATTTAGTTGAACGAAT AAGAGATGTC<ITTC AGACTGCA CT ACGCTGTTTTGC G 

TGAAAATGGCATTGAGCGCCCCGGGGACGATTTTACAGAAAAGGGGATTGC^ 
ATAACGG^rGAAGGAGTCAAGGGATTTTTCAGOBGTTTTCGTGACATTTT^ 

GCCAAGGCATAGTAAAAGAGACCGAAGGACTGACAGCTGCAACAGTGGCAGAAACATCCTTCTCTGAAGGTTTAGCTGAAAG^ 
TCTAGGTCTAGAATTYTCAGAGGACGCCAAAACGGTTGTATTCAAA^ 
ACTTCTTTTTraTCGTTTCCAAGGGACATGGGCGTCCAAGTTAGT^ 
CCCTTGAACAAAACTTGAAAGATCTCGACAAATTCGAAGAGACTATACCAGTU^GT^ 

TCCAGAAAGGGCTGGCAAAGAAATT AACGACACTGAACTGTC AAAGGCTACAGAT TCAGT ATTGGGGAAGAAACT AGGCAAT GC AGTT ACAGTGTTG ATG 
AACAACTTTG^^GGTTACAATTGTAGTAGGGGCTTCTGT^ 

AT C T C AACGTCGTG G ACC ACAC C AGCC CT AAA GGT GT CATC A G TT A T AAAATTG T GGACTTT TCTT G TGCAGATAGAAACAC CGG AT GG GCT AAGCC AAC 

CAAGCACCCGTTCAGGGAAGAAATAGACCATGTTATCGCACTAGATGCATCAT^ 

AAATCGAAATATAAGGCCTACG<^CCAATCTGTGGAACAAAAGA7GCTGCTCA^ 

TGCCTTGGGTGGCAAGCATGAAAGATTTGCCTAAAGGACAATCCCTCTCCTGCGATAAAGGGATGTCCACTT^ 
C^TAGGAAAGGATGTTCCAGAGGCTATTTTTGAGGTTGCAGAGG 

TTCAT ATTTGGAGTGCCTCTTGGATTTG^GT AT AGCTGCTACACG C CTCAATCCATCCAACTGGAAAACT GGCCTCATTGTATTTTCAAT ACT ACTAGTGG 

TCATACTGATAGTTCGCTTCTTTGCAGG^TCGGGCCCGCTAACCCTGAATTGGTT^ 

AGACGGGGGAGGAAATCGTTCAAAAATAGTATTGGCAGAAAAG^ACAACGCCA^ 

GAGGAGCTTCCTGGGCATGAAG AT T TGCGCCCAGTTT TOTTCCCTGCCACAACAAATTATTCCAAATCTGCCAAGATTCTGG^CTACAAATCTAAACCCT 
TCAACGACTTTTATACAAAAATAATAAACACAGACATCA^ 

CAATAAAGATTATACATGTATAAAAATT'GCATTG-TTTTATTCCTATACACCAACAATAGATATAGA 
TTTCCATCACCCATTCTGATTTTATTCCTATTCTTCTCCAAAACAGATTGCATTCCGTGTC 

ATGGCTCCGACATATCTTGT ACTGT CGATTGGATAGT TTCAGCTAGGGAATX5TTTTTCTGGTCCAAATT CTCCTCTCCC AT AACGGATGCGACGAT AAGC 
TTCAGCAGAAGCCAGCAGGCAAAAGTGGGCCCAGCCAGCCTCGTCAGTACAC^^ 

AT AAAATCG GAATGGGTC AAGT GC ATGTC CTTT AATT TACCCGT TT TTGACAAGT AA GTGAC T G GC AAC TGT T TGGCAG T AAATT GTTC ATT GAT AT GA G 

AAGTGGAGTATTTTTCTTGTTTAGAAGCATGTCGTATATAGAGAGCGA<^GCTAAGGCAGTTATTGCTAGA 

ATTCATTCGTTTAGTAGTTATTACAATTTTGTTCCACATGGTCATAATTTCTC^ 

TCATAGGCACAGGCG^TArTCCGACGGTCCTCTTCCGAGAATATTTTCTCTAGATTGAAGCATTGCAG 
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atcatcacgftggcaaaagattgtcctttgatc cttttg ataaagc ttcx3atgtttat aaaggaacct aaaagtttccac att at agcaatgtgat aaaag 

ctgtccaacgaagcatatacagcagctgggaaagttgtccatattaattgcccactttgtaacgcagaggctaaattttcaaccccgat 

ggcagatgttcgccatccaatctagttttgttatcgtatctagaactggggtattcttagccagtttgactgcctcccctttcggt 

aactggctccttttcagtagagttggtttttargtttttctctgatgc^ 

tcactcttcactaattctatgtccctttcgattagttggcacacgtc1ttatctagcctcatcct 

cagacgcagttgctaitagtccatcagatgcgrccagtatggaattgggaggag^ 

gtagtccagttctaatacatcaaaatcctgrtcttccccttcctccattttgttgct 

atgccgtacaccgaatgacgagaggaggtaggagcaggccctttttatagccctaaagggccattatcgacagtact 

tgatattatacajigcagagaaaaatttcagtctacttgcaggagacatcttttta^ 

tgtttcttgtctgttggttttttgagtcttttgttgtttagcttatgtatt^ 

gtaaaat atccttttcgtacttgat aaatct atc aacgtact tt t tt attagcctgt ttctggt cgtggaattccgtgaagaag atcgtagaaagtt tgc 
atcagcagatgtggattctgtatctgaatcttcttcctcctcctctccaagaagagaatcgac^ 
tcttcttcctcatcgcttgaagggatggcx:gaagacgtgggagaagaagaggaggatgaggaa 
ttgttacc^gcccatgttcctcctcgacatcatcttcttcttcaatct^ 

GCTAGACCATAAAGCCCHXXTGGAAGCAGCCGATGCTCCAGGGTCCATTTCT^ 

TTTACTTTCAGCCAGCCGTGTTAGTCATCTACGGAATGGTGTCTTTGTCTT™ 

TGTGCAGTTATCAAAGCATATTTGTATCAAAGAACCTTCCGCACCTATTAAATTTCCAACCATAAA^ 

TCTTTAGTGAGTGTTAGTTTCTTTCTATTGCACAGTATTATGTCCACATCGTG^ 

TTTCTCCTTCTTCTTCCCCCCACTCCCTTTCTGCTTGT^CCTCCTCCTCGTC 

CTCCCTCGTTTCrrCCTCTTCGTCATrTATTATATGTCTCCTAAAATTCTTGTC 

AAGAAACTTGCATCCATCATGCTGTTCATTCTCATATCACAAAGGCCATCATACCTATCATCTCTGGCCATGATCGC 

AC AT AAT CTT AGAC V C T TTTTT AGG TGAGAAAAGT TGGTT TT GG AAC TGT CT AA AAT T AATG AAACC ACGGGACAAT TT CTT AGCCAAAC CCAGAAGAT G 
AGAACCTCCGTACCCTTCTCTGTGGGAATGTGCTAGTGCAGCATTATGTATAGTAA 
GTTGCCGTCACTACAGGTATGAGATTTGTCATATCTAATTCAGGTACATCTAAACA 
TTCTCACCGTCACTAGTTGATGTGCATCTTTTATGGCTCTATC^ 

TAAACACCCTTCGTCTTCATCATTGTCGGGAACATCTTGATCAAAGTCCATGCCGTTATTGTTATTATA 
CTATTCCTAGGACCAGAAGGGAGCATTAGAGCAGTTAATGAAGGTGCATCACTTTCTATTTTATTTG 

AAGTGTCTATCAGGGCAGATGCTGCATCACGC ACTCGCTT CT CGC GTGGGAGAAGATC GCAGACAC AAATT 

AGAAAAAACGAATTGAGATGCGGATTT AAC AATGGACAATAT TTG ATT ATGAGGAAGT AC AG^ 

GTGATTTTATTTTGGGCACCTATAAC TACT AGCGCTATGTGC ATC GCGAGGACAATT ACTTCAGT GAAGATACCATTTTCTAACGTCAAACCTTCTACGA 
ATGCAACCCCTCCTCTATTTCTCCTCATCATCATCTTTGAAAAATGTTC^ 

CATAGTCTTTAGGACTAACCCTTGATCGATAAATTGTTCCCTCATAACTGCGGCAGCAATATCTAT 

GCCTTTGAATXTTTTGATCCTCTTATCCAAACTATCAAAAAATCTTCTCTCTGCGGGC^ 

CGTTTCTGTGAGrCCTACCGATAGTCTGTAGGAAAGCAATGGCGTTATAAGGTACATCTAAAACGGA^ 

AGAGTCATGTAAAGAGAGACCGGTATTGCCTTTAGGTCCCAGCATTATAACGTCGACTTCTTTC^ 

GCAGTTTTAGTGTTGTTTTTAACC^GAAAC^TCCCCTATTCGTTATAC 

ATTGT GTGATTGAATCT ATAGGGTTGCCTGCT AGCGCCATGAGCGGTTCCGTGTC AGGGACTGAAGGCACAAGCATAACGTGAGCTGGGG AAGTTCTTCT 

ATTGGTTTTATTCTTTAAGGAGATTGTGAAAGCTGAAGCCATCATGACTGC^ 

GAATCGAATGTCCCCACGTCTACGATGCCGTATTTTGCGTCACTTTCATCCTT^ 

CCCCAGTTTGCTCTAGTGACATGACCATCTTTTTTGATTCGTTTGTTCTCCTT 

TGCACCTTTACATGCTGCCACAGAAGTACTTGCTGTATTAATGAAGTATTGTCT 

TCTTCTTCATTAGCACCGTCGTCATAATCTTCGTTAAAATCAGTTTTACCTTT^ 

CCATATCAATGCTGCTTATGTCAACTTCCGGAGACGTTATTATTCCTCCTXXrrcCAT^ 

TTGTACTATACTTCTTAGTGTTGTTTTCCCATCATGTGCGACATCCCTATCGCGCACAACCACCACCTTAAAACG^ 

TATAAATCATCATCATTTGCAGCACGTTTCCTCTTTCTGGACGGGGAATTAATCTC^ 

CTTGGTGTTCTCGTATAATCTTTGTGAAGGCCCrrTCTGACATGTC 

CAATTCGTCTATAGCGTATTTTTGAAGAGGGGAGGCTTTACAATTAGTAGTAGAGCAGT^ 

CTATTTCTTAAAAGTTTAATAACCTGTTCCAGAAACACTGTCACGTACTCGCT 

CTTGTATGCTCOTGTAGGCTGGCGCACTTTTCCTTAGTATGTGGTCAATCATGT^ 

GAACGTATCATACTTTAAGATCTCTAAGAAAATACGGAACGCATCAGCTTGAGATAATAGTTTCAAGTAGTTG 

GATGATTTCATGCGCTTCACTTTAAAGTTTCTGTCGTCACTCATTGCACTCTTGAACCTATTA^ 

TGTTGCGCAAAACTCTAGTATCTTCTTCCTCCCAAGTCTTTGCCAGTTCCTTTCT 

TAATGCTGTGACAAAGTTGTCAATGGGCATAACCTTGTTGGAATCGAGGTCTGTTCCTC 

TCAGAGTACGTCATAAATAAAATGGTGGGCGTTTCTGTTAAGAAATCTCCT 

TGAAAAATCTAATATCCACCGAGTCTTTAGCAGACGATACAGAATTTTCCACATCTACTGTCAT 

TTTATTAGCACAAGGCACTTGTAGTTTTCTCCATGAAAATTTAGGATCACGAA 

TTGAAAAGAGGCTTAGAGCACGTCAACCATATAAAAAAGGGGTGCCTTTTCCAC^ 

AAATTGATGGCCCT ACGTGTTTTTGT ACATCT AAGAGTGCCTTTTCTTGT AGTATG ACAC TC ATG AC GAAGGCTGCCAATTC TCTTGTTTTT CCTACACC 
AGTACCATCTCCGATGATAAATCCTCCACTOn^TTCTTGTCCACTTTAAAATTCCTTT 

TTCCACCCTCTCCCTTTGTAGAATTGAATCAGGGCTAGACGCACTGCTAGTGCCTGCACAGAATTAATGTC^ 

CATC GGTTACATTTTTGCAGAAGCCCACCGACCGGAATGTTTCACTCTC AT AAAGAGCGAAT GAAGGTTCTGGGTGG^AGAGGGTGCCAATCTCTTTAGC 

ATCTTCTTCCTCCAGAAATAAAAGTTTGTGAGCCATGGTAATTCAACCAGAAAATC 

GTGTACTTTTTTTACGATACreGAAACGAGTACGCCTGCACGGATCAAA 

TTCCCATGCTAAAGAGAACCAGGAGTTACGTACAGCCATACTAGTCTCTACAAGTAATCGTCT^ 

GAAGTTTCTTGCACGTCCGCCTTCTTATGCCCCCACTCGACGAGTGACTTCTTGAATAGAGAGAGGGGTGTTATTGTAGCTCCTTCGG 
TGTTTGACGCATCAAACGCTAGACGAGGGTCTGACTGTACTACCGGGCCACTCTTGGGGTTATGATCCAA^ 

GTCAATATCCACTCTGTACT CGGATGAGGAAG ATG ATGTTCCG AAC A ATTTTTCACTTC C GGGAGTT AC ATCGGAGC AATC AAAATAGCTGGGGATTGTT 
TCGCTGTCAAATATCCTATTGTGGTCTACTGAAAAAATATCGTCGGTAGCTCCTTC 

GATT GGACACTTTC AATGT ACTGGAA GTT T TTGATGC AGG1 M 1 ' TT CCTTTT AGGTAATC TAAATC AAAC AT ATCTCGCGCT AGC ACAGTAGCAGC ATAAGG 
TCCGTATGAACGCATTAATAACCCCTCTTTCAATACAAAAGTATCCTCAGATTT^ 
CAAGAACTTGATTTGAGATATATTTCACTTGTCCTCATCCCTCTCGCCT 
TACGTTCAAAAAAGTCCCTTATTTCTTCCAGCCTTTGGTCACCTGTAGAATTGTACT^ 

GGGTTTATTTTGACTCAAAAGTGTAGAGAATAAAAGTTCAGATCCGCCGTGAACACTACCCGAAACCATGCGACT 

AT ACT ATGTTCATAGGGTTTTTGAGGT ATGTTTC CGGCC AAAAC ATT AGC TGCTGGCTC AATGT AAAAGGGTGCAGTACAGAATGGCCTATGCTGCCCCG 
AGGAAGGTTTGTTCAGGACGGGAGCTAAACAACTTCTATGTAACAATGAAGACGGGCGGATGATTACA 

GCT AG GT ATTGGTACAAAGAGCACATTCTTAGAAGT ATTTGTTGCAT CCAGGTTTAAAG T TGGCA 

GTAGTTTCGTCGATACTTGATTTAGGTCTTTGCTGGTGAATCTGGTCCCCTCTTTTAGATTGGTCGGCCAAGAA 

CTAACGGTGTCCTGTCATCGCCTATTATTGCATGCCTGTTTGAGTCTTGGGGATTATT^^ 
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AAACGAAGXSGTCTATGTTGGGCATCATAACACAGGGGTTTTCCATACJVTTC^ 
TCGTTATTTAGTACATCCTTCTTATTATTAC^GAAACAGGTGATGGGGTTTT^ 

AAGAAGATTGACGCAATTTGGAGATGTAAGAAGAGTTGCGGAATTCTGTATGATTCCGTATTTGTTGTTTCTT^ 
AGATGAAGGAGGTGGTGGTCCTTTGTGGGTTGAAGGATTGAAAGCGCAAGCAGT 

CTGCATTGCCAGCCCCCATCATCTGTCAGAACGACTATAGATGTGTGCACAGAACACTCTTTAACTACACGTTCTCCAG^ 

GGTCACCGGCC^CGTTTCCTGACCAACAATAAGGTTTAGGATCTTCTCCAGATTTATGTGTGATAAATGTTC 

AACTGCTCTCGAGGTATAACACACAGATGTGCAAGTTTGTCCCTC^ 

ACATCTTC^TCATGTCCTTCTGATGTAGATGGAGGAG^AGGAGGAGCGTCACCACTTACGCCT^ 
AAGAATCGTACAATTCAGTATCGTCTTTTTCTATATTATCCTTCAAATTGACCA^ 
AGATTTTTCGATGGTGGCCTCACTCAATCTATCTATATGGTGTTCAGG^ 
GAATTCCTTTCCTTCGTTTCTGACGGAACGTTCCATTTATTCCCTTTAACACT 

CATCTTCCACCCCGTCATGCCTACCTCTG^rTCTAGGTATTCTTCTAGTGTCTTTTCAGGGTTAGCTC 

TAATTCGTGTATAGCTTGAAAATTAAC^^CATTGGTTTGGCTACATTATAGGCCATTGCAGCG 

GTCACGAATAACGTTATTTGCGCGGTGAAGCGCATACTTCGGTTGGTAG^^ 

TGGAT AT CTTCC T AG ATAAATCTC AT AOlGGTG^CACCCTTTTAG<K3TCTTAT AAGACGCJVG ACTGTGAAAAAT AAT TCT ACGTTAAAAAAGCTTAAAAT 

GGGCGCGCCTACCJVACGCTGATTTTACACGCACGGTATC&GGAGTAGCTTCTT^ 

GTTTTGGCCTCTTCTTATTCTGACTCTTTTGTGTACAACTACAAGGATGCAGTGGTGACCGCTGA 

TTCATGAGCACATCATGAACAGACTTGl^AAAAG^TGGTCTAATTAACAGATCTCG 

TCGCTATTCTGGAGGAAGTACTCCCCAGAACTTAATTTTCC^ 

ACAGTGGCAGCATCGGCCAAAGACGCCATCGAACGCATTCCTGATCTAAGAGAAGGTGGTACAAGTAAAC^ 

GTGTATGGCGAGCCTTTAACTGGATAGCGGAAGCCTCCAGATCAGCGGGC^TGATTCGCTACGAAC^ 

GGTAAGAAGAAGGCAACTAAAAGGAAGTTATGAGCGAAATGTACTATTTCTITGGTGACCATTTCTGTAAAG^ 

GGGGGAAGAAGTrTCTGATTTCTGGACGATCGTTGAAGCTGTCATCCGGT^ 

AAGACTCTTCTATAAACTTGGAGTGGGAGGATGTTCTAGTAAAAAACCA^ 

TGCCATCAAAGAACATGAAAGCATAGGAGAAAAAAGGAAAAAACATATCCTAGAGTTTATTAAAACATGTCT 
AAGGGAGTTCGAGGAAAAGGGAATTTGTCCCCGGCCCATCTAACAAATCTGGCCGATGCGATACTG^ 
TGCAGAGTCTrTTTAJ^GCAAATCAATTTCTCTATACTCCATTTAATAGG 
TGCTCTITTTATATCTCAGAGGGGAATTGGTGACGTGTTTCTGMTG 

GACATGGTTTCTCGAGACGCCTACAAGAACAATACTAACGAGTCTAATAATTTGGGCAATTATTCCCAATTCGATATCG 

CTGATTGTAATGCAATTCCTCTATCTATTAATATTGGAOTACCTCTC»TAAATCGAGGATGAACATGCAAGACATT 

GGATATGGGTTTGGCAAAGATACGTGTAGCXSGAAGAAAATAGAGTAGCTAGACGTCTTT^ 

GCAGCTGCGAGACAGAGAAGACTAAGAGGTGAAGAAGAAGAAGAAGAAGJUXiAGGAGGAGGAAGAGG 

GAACATCAGGTGTTAATGGTTCTGGTTATGATCAAGAAGAAGAAGATGATGGTGAG 

GGGTGAAGATATGAATGGGGAAAATAACTCAAGAAAACGTAAAAACACTG 

AATGCGCCCATTTCAACCAAGGGAAAAAGGTTAAAGGAAAGAGATAATATTGGAGGATTTT^ 

TGGAGAGCATACAAAAACTTTTGACGGCCAGGCAAAGAAAATATGTTAAAGGT^ 

TACAAATCTACTAGATTCAGTGTTCCATTTCAGAAAGGGGTCTATTCTTAACAGCATACACGCAAATAG 

GCAAACTGTATTTGTGATTACTATGAAAGAAACGTATCAAAAGACAGTAGTAATAATACACCCCATTCATCT 

ATGCTTCCTGTGCCGAATCTAACAAACGCCCTTGTCCTGTAGACTCTAACAACCCTGAAGATGTGGAACAACCT 

TTCATTATCAG^GGTAGAAGATTCTCTCGCTATCGAAAGAGTACTTCAAAATGAGATATTAT 

TTAGGTGCTGAAAAAGGAGATCTMGGAAGATTTATCGTAT^AAATAACATTGr^ 

CTATGCTTCTAGACTCAAGAGGCAAGACAAAAAACCTTCTAGAAAAGGGTACAGTGAAAAACAC 

ATTGAACGTGGCCACTGCTCAATCTGCAGGACACCAATACATGAACG 

ATG AAACTGAAATCC AT CGCCATGG AAAAGGGTC GAGGAG^GAG ATCGGCCCTTC AC AGAC AGAAATGTGACCATGC CTTTTGCAAGATGCTGAAGTGTT 

T ATT C TTCAAT ATT G AC CC TT C AAATGCTGCAGAT AC ATTT ATTG AC CCTGCGTC AC GTGC C AC C TT ATTT AGGC TTGATGATCTTTGT AGGGAT AG GAA 

AAAATACAAGAACArTGACTGGGTAARGGATTTACTAGACCCTGTAATGAAGGGAACAJ^TA 

GACTCAAACGTGGC CGCCCCTGTTG AT TTTTACACAATTTTG AAATACACAATC ATTGATG ATGGTGTAAT A 

TGTATTACAGTACTATTGAAAGAGCGGATGACTTACTCACCGAGAGCAG^SGAC^ 

GTTGGAAGTC^CGGAGACGGACGTGTaXTTACCCTTCCAGTGAAACCG^ 

GACGACAGTACTGAAATAGAAGACGTTCAAGTACAGGACAGTAATCTATTCGATGTGGAATTATTCG 
GAGAAGAAGAGGAACTTCCTTCAGCJ^TATCTGAAGTGTTTGCTTCATTA 
TTCTGAGGAGGGGGAAAAAAGTCCAGAACCGTACAATATTTTTC 
CCTAAAAGAGTCGACTGGAACACTGTCACCATTCAAGAATGAAAATATTAC^ 

TTGGTCAACATT^TGGAGCTCCTCTAACrTGCTGAGAAGAATCCAAGCGCTATCCCAGCTAACGAACTGAAT 
GGAACCCACTCTTTAAATTGGAGGACCACATAACGAGGGATGAAGATACT 
ATATCAGTCTGCCGTTTTAATGCTGGCTCTCGATTCGTTCGTTTCGCTTGCATCTTTTTCC 
TTCTGCGTAAAAATTCCCCACGACACTCGCGCTGAATCTTTACTGAATAATGTTGGATTCCCCGCTGGAT^ 

ACTAC AA GG^TGC AAACTTGAGCG^TAAAAGCGGTAT AGATGGCCTTTCAGGTTCCATGTTGAC AGT ACT AAAAAAT AACACC AAT AAAAGAGCAACTGA 

TATTTTACATTTGGTGAATAATGTTTCTGCCTCTGCACAACAACTTGACGATTCTG 

TATGATATAAATGTGTCCAGTTCAAGGCAAGTGAACCAGCGTAATTTACTCCACCATCAGAAT 

AACTTGAACGCGCACAAAATAAAAAAGTCAAGGAAGAGGAAAATGGTGAGCATGAAGAAAT^ 

AGGAGGTTGCCTATCAGATATTGACGAGGAAGATTTCTATGAGGATGGTTAC^TGJ^ 

GAAGAAGATGAAGAGGATGAAGAAGAAGAATATGACGATGAAGAAGATGAAGAAGAGGCAGAAACTTGTGGTGCTAATGCT 

ATGCAATCATT^TTCCCCAATGGACAAAATTCAAAAAGGAAGAAAAATGGTAAA 

TAACACrTTATCCTTTGTGGAAAAATACGT^GGJ^TTGTAAGAGCCTAGGTATAAAGC 

TT T AT GAAGGGGAG TGAAGC TGAC AGC TGTT AT AAT AC T T GT CAGTC CAC C AGAGGGGCT AGC C G T AT AA GGTC ACT AC T C AAT AAAT ACTC TGTTAAAG 
ATTTGATGCAGGTAAACAGCCCTTCGAGTTGGAAATGGGCTAACCCTCCCGACCGACGGTTCGTGCT 
TAAGTTTGAAATTGAATGTGAAAAATCCGAGTATTTTGATGTCGTATCTGAACTCCCTAGTAATATTAAAGTATGGTT 
AAACATTTGGCTCTGArTGAAGACTTTCTTCCAGCTATGGGTGCTGCTACCCCAAAAATTCC 
TTAGAGATATTGTTGGATTTAAAATACCAGAAGAAGTGCTCAGTTTTAT^ 
* ATTTGATCGTATAATAGAAGTGATAGATTTAATGATAACTAATGGCGCCTTTGCGACGTCATGCTTGAACAACGCATTCTTC 
CCCAGAGATGGGAGTAACACGTGGCTCCACACGGACCTTGTGCAACTCTCCACCTCCATATTC^ 
ACAACAACACTGGTAGCAATTCTTCTAGTTCTTCTTGTGGAGGGJUITAAGGG^ 
CCTGAAACCACCACCTGC^GCGATGACTAATTCTTCTTCCCCGTCATCCTCGX^ 
ATAACGACAATCIATCCAAGATTTCTCAGAAGTTTCTGGGAAAT^GAGGCTTAATGGAl^ 
ATGCAATATACGACTCTGGCGCATTCAAGGCG^TCCTAACATGCAC^GTCAAO^TAA 

AGAGGGTGTGGTACGAAGAAACC7GATGGTGAGTCAGGGCAATGACGTCAATGATGCCCACCAGTTCCAGGAAGAATGCG 
GCTTCTAGGGTGTATAAAAGAGCCCAGCGCCGAGGTTCGGCAGTCAGTTCCAGAAGAAGAGTAAGGAACAAACCCCAGTT 
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AAGACGACGJ^TGCGAAGAAGAAGGCGACTTTTCTTCC 

AGAAGACGACGACGTTTTAGTGTCTGTAGAAGAATATAACAACAGAGTAAGCGGTTCTTCCACCACAGCCGGAGACAGAGTT^ 
TCTACTGTATCTCCGAACGAAAAGAGGAACTCTGCCGCCCTCGCCGCACTCACCATATCCCGGCACTCTCTTTTCAACGCTCTA 

TGGGAGAAAATG^CGTTTCTTCCTATAAGAGCACTATTGACTACCACAACATTGAAGATATGGACGATCTCCAGCGCGCCACCTACAAGGATCGTATGG 
AGACGGAATTGGTCCTCGAGATGGCTAAGAAGGAGGGAAGGTACGTCCG^TCGTTGGCCACCATGGACGAATTGGAGGTAC 
CTACACTTGCGGCTACACCTTOATTAGACGCAGGGCACCCCCACCAAJWVCGCAAGTCAATATT^ 
GCACCATCCCCCGTCCGTTTAGAAGAGCTTGTCGACGTGC 

CACAAGCCGAATGTGAAGATGATT ATCCTCCACCAT ACGACCCAT CAGAAAATCCACAGAGGTCCCAAGTGTGTG ATTATTGTACCACACGTCAAGTCC T 
CAGTTCTATGACGGATCACGCCAGGGCCAACCTCATAAAAAATCTGAMAGGGAGA^ 
GGTTGTACAAGAAGAAAATAGCATTGATTGTATCGATGACGAACTTTGCATGTTTTTGA 
CAATAAAACAGACCCCGAATCTATTTTTGTCTATTCTTTATTACT 

GAAGGGAAAAAGAGGGCGCTAGTTCATATACTGCCCAATAGGTGGTCGGGTCCAGAAACGTCTGTCTGTTCCAGAAATGT 

GTCATTTCTGGAAAGGGTTGTAATTTATTATAGTTGGTATATTATTTCTGGTATACATTTCTAGCTCCrrCT 

GTCCACCCTTCGAACTTGACATAAGCCCTAGTCAGCGGTCCACCCCCTAAACTGGAGTGAGCTGAAAAAATT^ 

GGGT AAAAAGAGGGTTGAAAGG AT AGCTGAGGCCACCTGGGTCAGCCGTGTT CC AGAAAC GTCTGT CTGTTCCAGAAATGTGTTTAGAGATTTCTGG AC A 

AGTCATTTC7IXK5AAAGGGTTGCAATTTATTAAAGTTGGTATATTATTTCTGGTATATATTTC 

GGTCCACCCTTCGAACTTGACATAAGGCCTAGTCAGCGGTCCACCCCCTAAACTGGAGTGAGCTGAAAAA 

GGGT AAAAGGCT AGT AAT AGAAG GTT GTGT AAC T C GG C AT AACC C AGAAACATAT ATCAAAAGGG ATGGAAAC AA CT ATTCTTTTTATACAGTAGTTTTA 

TTGTACCAAAAACTCAGAAATCTCATCACATGACAAATCTGAATGTCCCATTCCCT 

CATGTTTTACATACAGAAAATTTTGGCTTAGATGCATTATCAACAAACACTTTTACCACATTGG 

AGGCCATATTCTCCATAGAATCAGATCCATCAACAAATATATAGAATGTTACACCCTTGTCGATCCACT 

AACACTGTTTCCGAAGATTTCTTTGTCCCCATCCATGTATTGGGAAAAAGTACAGTATGGTCCACA 

TCCCTAATAAAGAAAGACAGGCAATTCGACTTGATCTGGAAACC^U^TGTTGAG 

GTGTATCTGCTTCAAATGCTCCACACTTCCCTT/TATGGATCAGGCGGTTCTCCATAATC 

AGAGGAGCAAGTAGTAGTGTGGT^ACTAGCAAGGAiXT^CAGTCTTATATATAGACTAACACCCCCCCCTC 

GGGTCGGCT AGAAC CC AGAAACGGTTTGACTCACATCTGGT ACATT ATTTCT GTT ATACATTTCT AGCCC CTTCGTGT AATC TGACATTGGGTC GACCCA 
(^GGTCCACCCTTCGAACTTGACATTCGATCGACTCAGCGGTCCATCCCCT 

GAAGGGGGAAAAGGGTT AGTGAATAC AGGC AGTTCTGCCTTGCTCCAGAAAT GATTCTAAAGATTTCTGTGCATT ATTTC TGGT ATACATTTCT AGCCC C 

TTCGTGTAATCTCACATTGGGCCGACCCAGCGGTCCACCCTCGGAACTTGACATT 

AATTTTTGAAAAGTTTTTGAGATGAAGATGAAGGGGGAAAAAGAGGGTGCTAGGGTGCGCTCC^ 

AACTAGTT/TCTGGGACATTTTTTCTCCCACTGACGCAGACTATATAAGCTAACCACTAAGCATATTTTGCACACATTT 

CACTCTTGTGTTGCTCCGCATAGCCTAACAATGGCCCCAAACTCCTTCCAGAAATT^ 

AACATGACGAOXTTTACGGCAGATTGATTTTAGAJGATAGAAAGACATT^ 

ACATGATCGAGTATACAAAGTACTACGCTACGGGAACCCATACAAGTACCGTCGCCCCAA 

GAAGTAGGAACATGCCTCCCTCTGCGACCCATGGAAGAGACTGAAGAAAAC^ 

GCGATGGCATGACCCACCTTTACAACGACGAAGAGTArATAAAGAAGTGC^VAAACAATTGAAGG 

ATACTTCAGACAAGCTCTAGAGACATTGATGATXrrCCCATTCTTATAAAACAATATTCCAATTT^ 

CACAAACTCCACAC^TTTATTAATATGGTACACCCTAAAAAGGTGTCTGTTTTGTGA 

TACCAAAAAATTTATTATGATGTATT7CATTGGGATGGCTTTTCACTCACCAATCTGGCCTATG 

TCCATCTCAAAAATTTTTAGAAAATTTTTCTGGGTCGCTCG^ 

GTCGACCCAATGTCAGATTACAGAGAGGGTCTAGAJUVTATGTACCAGAAATACTGTACCAGTTATAAGAAAATGTGACCCTT 

AGAAATCTTTGAACCCATTTCTGGAACAGACGTTTCTGGAGCATGGTCGACCCAGATATCTGTTGGCGGCTGTCAATAG^ 

TTTTACCCTCTTCTCCATCTCAAAAATTTTTAGAAJ^T^T^TCTGGGTCGCTCXSAGTTTAGAGG 

GGTGGAC CGCTGGGTCGAC C C AATGTC AG ATT ACAGAGAGGGTCT AGAAATATGTACCAGAAAT AGTGT ACCAGTT ATAAGAAAATGTGACCCTTTCCAG 

AAATGGCTTGTCCAGAAATCTTTGAACCCATTTCTGGAACAGACGTTTCTGGAGCATGGTCGGCCCAGGCAGCCT 

TACTAGCACCCTCTTTCACCCTTCATCTCCATCTCatfiAAATT^ 

TGTCAAGTTCGAAGGCTGGGCCGCTGGGTAGGGCCCATGTCAGATTACAGAG 

AACTTTTCAACATGT<^ATACACCGTATGAACGTATAGCCAAATATGTACCAC^GTCAAAACATTTAG^ 
AAATACTTTATTTGCATACATAATAAAAAATCATACAGTTTCGGACTCA 

CT ACGTTT ACAAGATACTT GTTC ATCAGGC ACTGCX3ATTCTTTOACAAGACACT ATTGGTTCAACC A GTCAGGTC TTA 

GG GAAGT AC GTTCTCTGC A T AC AGG AC AC C GGGT AATTTC TC C AC GT TGCGT ACT CAACAC C AATC T GT CCC AAC ATTTTTCATCAA T AC AATGTCC ACA 
GCCTGTCG?CCTGATGATATTG<5TACTACTGAAATCTTCTAAACACAC^GG 

AGAGCCCTTTTAATTTCTTCCTTATCCCCAGAAATTATGTG^CAGCCCAAGGATAATTTGAGGATGCAATATACCCCAAGCGA 

AAGAACGAGGTGCTTTATTGACIATCGAGACCGGTGGCAGCAACAAGXSGGTTCTTCT^ 

ACTAGGCTCGTATCCTCTCTGGGCAAACTGTGTTAATCCGOTATCATTGAACATCAATAATGCTGCTT 

AACATATTAATCTTTGCCCCATGGGCTCCAACTAGTGCATCAGAGACATCGTGTTTGTGATTGGCGATGGCAATCATTAG^ 

CCTTCTAGCATTCACTATACTTCTATCGAAATCGTATATTATATTCTTGTATT^ 

GCATGGAATACGTTTTCATTATTTTCATTAATAAAi^AAAAGGCCAAGCTGTTTGTT 

TTGCAGCGCAAACCATTAGAGGrGACATGCCTTTCGCAGAAGGCTTCAGAATGTTAAGTTTTT 

AATGATACAATAACACAATAGAGTATTCCCCTTTTCATCAAAATAAGTGTGTTTG 

CACGAGACTTTTACACACAATCTAACATAGCCTTTAACTTCATGAGGGTCTGTTATCACAACGTTTTT^ 
CGCACAGTTTGTCGGTCTTTTTTCTTACCATAATCT^ 

CAACAGTATTGCACCACACAAACCACTACCCACTGCTTCGTTTATTGCAACATTTTTTAGAA 

TCAAATATACTGTAAAJGGGTTTGAAAATGCTGGATTGTTTCAGCCTTCTCTATAAACACATCAACA 

TAGTCTCACAGTACTCTTCTACCATT^CTTCTTCTATCGAGGAAAGAAAAGACTTGATAATCACA 

AT^AATCTAGCTATAGATCTAGCGTCCATGTTCATTTCTTCTCTGAAGTCAGOTGACACGGATGATGCTGATGTTTCT 

ATAATAAAATTGCC^TTGGATArCAGTCGTTTTGCCTTTGTAACACAAG<^GATTCGTCCAC 

GTGGTGCAGATTTTTTCATTTCAGCCACGTAATCAGAGGTGATATTGACGATTCTTGAAAAGAG^ 

GAAAACAATACCACTTCTTGCAGAACTAGTAGACTTTTTCAGGCTAGCCAAAACACCGTCCAACT^ 

TCAGCCTGTTCCTTTGAAGTAAACTTGAATCCAGTTCTGCTGTCATCACCAGTGCCAAACTTGAT 

TAGAGACCAGAAGAGAATATrTTACGAACAAAAACTCGTCGTGGATGTTTTCGTAAAGGCCTC^ 

AGTCATTCCCTCGCAGATGGGGGAAGAATCAGACTT<KTATTGTTGTTGATAAAGAAGTAGATAATATCT 

AAACTACTTGAAGGAACAGGAGGAGAATTTCTGGAGGTAATTATGTC^TTCAGAAGG<K:CAATTCCCTTrCTG 

GTTCAATGTTTTCAAGTACTTCTTCAAGCACCTGACGGTATCGTGGAGCTG^ 

GCGTACGCCCGCTTTTATACAAAGATCCCGTGTAAGAAACTCCCTCCGGTTCACT^^ 

TTTTTT AAT ATAACAAAAAAATCGTACCGCTTATTGGCTGCTATAAAAGAGGGAGCACCTGCTCACTTGGACATCATT AACC ATCATCAATAT 
GAACATCAATATTTGAACCTAGTCAG<MAGATCCTAGAAAGAGGAGTGAAGAAGGACGATAGAACTGGA 
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TGAGG TTCTCTCTTCGAGACGACAC T ATTCCAGTT CTCACTACC AAG AAAATTTTCTGGAGAGG AGTTGTGGAAG AACT CTTGTGGTTCATC AGGGGC AA 

TACAGACGCCAAAGMTTGGCCAAGAAGAAGATACACATCTGGAACGCTAATG^ 

GGAGATTTGGGACCCGTATACGGATTCCAATGGCGTCATTTTGGTGCTGMTATGATACCTGTTCT^ 

CCAATATACTAAAGACCCTGAGAGAAAATCCAGATGATAGAAGGATGATTATGACGGCATGGAATCCTATGGATCTTCA 

CCACATGACTGCTCAATTTTATGTGGCTAATGGAGAATTGTCGTGCCAGTTGTATCAGCGAAGCGGAGATGTCGGGTTGGGCGT 

TCATACTCTCTTCTGACTCATCTGATGGCCAGTATGGTGGGTCTAAAACCGGGAGAGTTTATCCTCACTCTTGGTGACGCACACATTTATAAT 

TTGAGGTGTTAAAGAAGCAGTTGTGCCGCGTCCCTAGACCATTCCCTAAGTTGAGGATTTTAATGGCTCC^ 

GTTTTATCTTGAGGSGTATCAACCACACAGTGGAAACTTGCAGATGAAAATG^TGT 

GAAATGGTGTAATCGCTGTTGTGGGCGGAGCATATTTGTGTATATAAGAGCCCGTGTTAGCrCCTCGATTCAGTCACAAGAGCGCAC 
ACTAGCTCTCTCTC TCCAC TCAAGATGGC CTTTAATTTT GAAGACTCTACAAATCTCTTTGCCAAT ATGGACTTGACGGCTGGC ACAACAACAGACC CT A 
CCCGCCCCAATATCATATTCTTTGAAAGTCTACTCCCCAACTCTG^ 

AGCAAGTGGAGGTGXrTATGTCGTATTTCTGGCTCGAAGATAATGCAGAAGATATGGAGAATCTCAACAGT 

TTATTCCTTCAAGAGTTTATCAGCAACTGGATTGAAGAGACTGATCGACATGGACAGT^ 

GTGGGGGATATTTTACTTCGCTAG^CATGAAATGGATGGCTAGGGATGTGACTTTCTTTGTGTTTGT 

CATATGGATGTACCAAAAACT ACTAGC AAT TGGTGC AAAGGT AGT AAAGGTGATTGTTGACAATGCATC AAACCC AATGTTTTCT GT ATGTAATGCGTGT 

AGGTGCAAGTACCCAGGCCCAGTGTCATACGTTATTGAAGGCCATGGAGTGGGTCATTCTGATTTGACATGTGATGAGAT 

AAAACCCCATAAGAAACAATAATCTTTTTTATTCAACACCCAT 

CGATAGCTCCTTCACCTATATAGAGGAGGTGGATAAAAACAGAAGATATGGATTCTAATACTTCTATTTTACCGCCAAGCAAACGGCCA 

GTTACAGGTTTTAGGGATTATAATAACGGTAGCATTAATAGCTTCCGTTTCATCCTrTATATTTTATAGGGTAGGTAAAC 

TCCTCTTCCTCAGAATTATCTGATGTAGATAATGGGGTAGAAGGAGGAGGAGGAACAACAACGACACCAAC^ 

ACGTAGATCTTTCTCCTCAAAAGAAGGCTG AACT AAG AACTAGAGTTGCAAACGTC ATCTTTCAAGAAGT GTCAAAGGAT C AAGGAGTGGCCTTTAGACG 

GGCAATGAATGATTCAACTGATAAGATAATGGAAGAAACTGAGGCGAGAATCAATAACTTTTCAGAGCCATO 

TTTAAGGATGACACAGACAAAAACTTTATCCTTTCAACTCTAGATTTAACAGAGGAAO^TTTAA 

AAAATTTTGACTATGAAGACATGACCCGTCTC^TCTTTGATAACATCCCAGAGACTGATTATTTATGGACAACT 

GTACTCTGAAAAGGTATTAGGGTTCTCAGATATAAATAGTATAGAAAGAATATCCTCTACATTTTATAAAGGTAAAAAA 

GT AG CTGTCCTC GTT GATT TTGAATCT G AAAC AAT AAAAGAGAAGGC AGGAAAT AGTCTCAT CC GT AAT GTCGAGTT T ATTGTT GTGGACGAAC AGACCT 
ACAAATCTTTCTTCCCTG^ATTCAATCAAGTTTTCT^ 

TATAGTGGCCAATATTACTCCTCTAACTACCXGAGTTGGAGCAGCTTCCGG^ 
GTAATAGACAAGTACGATACCACTGAATTTGTTTGTGGTCTGAGTAACAAGTCAACTC^ 
TCCCTTCATTTGACGAAGCAGAAAGACCTCTGACGGATG^ 
CAGAAATACTCAGTCAGATGGAAGGGGTTATTTTGGATAAGATAGAAACAATTC 

TATT TTGCGCCGTC AATT CATGGAAAAAAT TAGGGGTT AAAAT AAATGAAGAAAC CAC AATGG AG AAG ATT ATGGGX^AC^ GAGAGGAC AC T 

AT AAG AAGT ATAGTG GCT AATGTT ATCAAAGAGAAT ACTGTT AAAGAAAATGTAAC C GAAAAAAT T AGAGCAATGACAG AT AAGG AAT T AAATGACAAT A 
GGGAATTTATGCATGATTTTGGAAAAATTTCAACTGGAGATGGAGGAACC^CXIATCTCTT^ 
ATATAAAAACGTTCCAGGAGCAACTACTCCAAAATACGTATCTATGAACACrTTTACGTATCGA 

TCACCT ATCATXSGGT AT^AG A^aAAT ACGGC ACCATTCGCAGGGGCAGGT ACGAAGAAAATGC AGGTTCGAAAGAATTGGTTTTTATGACCAAGATTGAAA 
AAAGACCCAATAATCTAGCTGAAAATCTCATTATTAGAGTTGCCAACCA<^^ 

GGGGGTGTCCAAGGAGG AAATGTTT ATAC C AT ATAAT GTTCAGAAAACAAAGGCTCTTAAAGGGCCT AGT ACTOACTTTTCATTCGT AAGGAAAATTCCT 
GATGAACCAGAAGX^ACTArCATAATACACGCACTAG<Kn'TTTATTGAGGAAATAATAATAATAATAATAAT 

ATAGGGAATGTTTCTGC TGC CCTACGAGAGTTGATGT ACTCACCCACAC ATATX5CAGCATCACGATAAGCT AJACAC ATTCCTGGAC AGAAATGTT 
CATC TTCAGAAGAG AAAAT AAG AC AAA TT GTGG AT AAAAT AC GATC C CAAACAAC AT CT G AC AT ATCTG AAACAGTC AAT AATGTC ACAACT AATGGGAC 
rGCATTTTCCCTTTTCGAAGATACCTTAGAAGGTATGGTGAAAAAAAATAT^^ 
AATGACATGAAGAGTCTAGCTACTGGAGCCATCTTATCTAGACAGCGAGATTTTGTTGC 
TGGGTTGTGGTATTATAAGGTATACTGTATTTGTCAATAACCTT^ 

TAT AT AT GGCGGGTATTGT AAAAT GGC T AT AAAGGACTATG AAAT ACC AG ATTCG T AC^G^IAAGGTCGAAGCGGJ^C AT ACAGTTT G AAGG AAG AAAG AT G 

ACCTTTAATATAAAATGGAGAGGCGATACCATAAACAACCTAATAACAATCATCCCTTCAGTGACAGGT^ 

TGCAGGCGCCATTACTTTTAAACTGCAACJlACTGTTT^ 

CAACTTACCGGAAATCGAAGGAGCTGATGCGAATGCAGTCTATGAAATATGTATAGTAGTAGTTTGATGGAGGAC^ 

GTAGTTTTATTTAACTCTTAATCTTATATACATTTCATGTACATTTCATTAAAA^ 

ATCCTCCTGGCCATGACGGCAGATGTCCTCCGTGAAGTGTTGCGTCTGCCGTCTCTGACC^ 

TTGGTG^ATCAAAGGGGCTGTGAAGAGGTGAAGAATAAGGAGAAGGTACAGGACTAGCATCATAGTC 

TGTGGATGCGGGAGGGGGCATATTAGCCGTGTCTAGCAACTCCAACATCTGTGCCTGGTAT^ 

TCATTGT CAT^ ATTAGGAAT AACACCATAATCTTCCATG AGGGCGTC ACGCCTTCCTGCC TGATGCCTCT CAAAGAGCCGTTTTCTTTCCTCAGGGTCAT 
CAGAGAAGAGTTTGTGAACCCAGCATGGTTTCTTAGCCrCCCCTACAGCCAAGTCAGGCACAA 

AGGGGGC AACACCTTGACAG G AGC AGGAG T GG GG GGT AAAGGAG TGGT AT ATTCACC CT TAT AT GAACT AGGTTT AT GGAACTC C TGGCAAGGT ATTT TG 

CCCTCCCTGTCCTCTTTTAGAGGGGCGGCCTTACAGCCTCCAAAAACCTTCTTAGCTATTTTA 

ACATCATGAACCCATTCTTATCAAACTGAAAACCCCTCAAGAAAAACTTATA^ 

TGCACTC AAAACATCCCCT ATC TC CCCTTCCCC ATACTTT ATTTC AGGAAACCT ATTGAGTG CT G AGCAGGTACTTTCT AT AGATGCTC TCAGAGCATC A 
TTTTCCAAGGTTGATGAGCTATAATCAACCTTCTTACAG^TGGCACACTTCTTAACCATATCCT 
CTGATGAAGAGGAAGTTTTGATTGATGTCTTAAAGACGATATTCTTCCCro^ 
GCTGCTGCCGCTGCTGCTGCTGCTGATGATGTCAGTTCCGGCCATTTTTAATGACTCTAA^ 

GT AAC ATTCTTTTATTTT AATAAAAGGCAGTAGAAAATTT GT AC ATCATT^ACAG^CAGGCAT AT ACAAAAACACATCACTTTAT AGATTGG*^ AGGGAA 

GATATTTGATCAGCTTCCGAAGGGTCAGTATCAAACCCATTCAACAATGAACTCT 

ACTTAATGAGACAGGAACCAATTAGCTGCTTArTAGCACTCCACTTTGAAGCCTACTT^^ 

TACATCATTGACAGTTTGTATG^AACAGGTTACCTCTAGATATACAATCTCCAAGOTT 

GCCAGATACCTATTGGCAATCTCATTATACTCCTCATCAATATTATCGAGTC^^ 

AAAT ATT AGA CATC CTCTCCTCGTCACGCCCATACCTTGGCCTCTTT CTT ACT GTTACTACTA 

TGATGACGAAGAAGCTATTACTTCTTCGTCATCAGACTCTGAATCAACATTAAAACAAAGGTT^ 

gaggtaggg^ggatgacgtagaagcactaggcatattcaagtcagcx:aaagacattc^ 

tactaataccactaatagtatcaaaggtaatattgttacttctagggggaacagtcaagtcaggaagt^^ 

ggggacaaacactac^^tatcctcatcatcttcaatgaaattgaagxsatgtcac^^ 

tcatcatcattactcaagcattgagtaatgtcaatagtattgggagctacatacttaagccattcacct 

tacccattaaagctgcctcatgattatcttcaaaggtctcttcactga^ 

ataagccctctccatataaacatcctcattctcattaacaccttt^ 

tcctttacaccttcatcattaacatagtcagcagacaacaaaaatgagtgaaaagaggcatacttgtac^ 

CAGGGGGTATATAGGCCCTAQACATGG<1ACATAA(^GGTGATTTTCATTCACCATATGCTCTTCA^ 

TTTCCTGCAAC^CGGCATCCCTCCATTTGACCAGTTATAGCAATCCGTGTCCGAATCGTAATCACCCATACAGA 

GCGACGGTTTCAGCCATCTTGTTGTTGTTGTTGTTTACT^TAAAG 
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ACAGTCCCCTTAAAAGGGAATATGTGCTTCATGGCAAGGGCATAT^ 

ACATCAAAAAGTGCATGTTTGACATCAAATTAAAGGCAGGGCCTTCCATGTTAGAACAAGCAACAACATC 

ACTGTACCTCATTGAATAGGCACCTAC^CACTGTCCCCTAAAGAACTCGGG^C^ 

TCATCGTTGTAACAATAACCTAGAGGCTATGAGAGTCCCAAGTCAATCATTTTAGGTTGGCC^^ 

C GAC ATT AAT AATTCCCTC ATTTGCAACT CTTGT AATT AC ATT AC AA GT C TCAGC CAAAACAAAAGGC ATGTTC AC C T TT AGTTT T T CAAACAT ACTTC T 

TACAACCTTTCGGGCAATGAAAGAGTCTGT^ATTGGTGCGACTG 

TACAACAAGCGTGCGTCCCTTGACTCATAATACTCCCCACCATC 

TCTCTAAACATAACCCCTGAGGACACACACCAGTCACCCCAACTACCCCCTTAATATTCTTTCTCT 

CTCrrCATAGTTCATATCCTT<ATAAACTTGACAACACTCCCCCTATTCACAATGTAATA 

ATGAAAGTGTTAAAGrCCTTCACATCTTCAGX^ACTTCGACCAAATCAGACAAACAACCA 

AATCTGGGAGGAC^GAATATTACGACTAGACAACTCCTTAAGAATGTGTCTACTGTCAAA 

AAGCCAACTTTAGGGTC CT TTCCAT AG AAACAGGTATCAT GG AC AT AC TT GCT AG ACTT AAC AATTG AC CTATGGCCAArTC TCAAC TGTTGACAGGT AT 

CATAATTGTCCTCAAGACTACAACTCTTCTCAATGCCTTTAAAGTAGTAA 

CACCTTGGACGCACAC^CGTTTGAAAAGGTGCGAGTAGTGGGAGGAGAGGGAGCAACTTCAG^ 

AC^TTGATGACGTCArCCCAAGTCTTGATAGTGATGTTGTTGTTGTTGTTGTTGTAAGCGGGA^ 

TGTTGTTGTTATAT^CTTCCACAGGCCGTTTTTTATTCCCCTGATG^TA 

TAC^GTGCGTCCCCCCATTGCCTATG^AAGAAACAACAACAAATACACGATTAATTCAAC^ 

AATAGCCCATAATATTTAACGTTACGTTAATGAAAAAAAACAACAACAAT 

AGTAAATCCAGAAAACAACAGTTTTACTGTTGGGCACTACACTGCACTGGTGACGTAGTAG 

CTGCACT AGTAGT AGTAGTAGT AGT AGT AAGAGCGTTGG AT AAGC TC GGGTTGGCTGCT AGAGCTGGAGCTGGAACTGGAGCTTGTATCCTT^GCTGTGC 

CTCTG^TGAGACTGATGCTTCTGGCCTGCCCCGGCTCGCTTATATACAAGTT 

TCCAACAGAGTGGGGTCCAGATCTGGCTACGGGTT<^CTCCTGGCCATACCC 

GGGTGTTATAACAACATGTAAGCGTTGGGTAAGAAGATCTGGATCTGG^TGACCCACC^ 

ACCACCTCACCC«TATCTCTCACCCCATCTCACCTATACCCCCATCTCACCCATCACCTC^ 

CCCTGTTCTTGCCOVGCCAACCCATAACCAGATCTGGAC C CC AGTCTTCCCTTTT ATCCCTAACCGGCACCATTTAT GCCCCCAGGCGCT AGCGGTGTAT 

ATAAGGCGGCGCGGCCAGGCCAGAAGCATCAGTTCTCTGCAAGCCAGCAGAAGAGCAACACAACA^ 

CAATACrrCAAGCTACAAGAATGGCCTCTCCAGCCCCCGCCGCACCAAGTCCTTACACCATGTTC 

CTAACTTCATAO^TCGACTAGCTCTCGCCGGTCTGATATGAAGAAACACT^ 

GTAAGCACAAGTTCTCACTArACACTTTGGACTTTGAAATTTTCTATGTTATGCT 

TTTACTCTTTGACAGAAATCTCCAACCACTACGGAGACTATGGATGTTTCACA^ 

GATGTCCGGACCTCTATCTCCCGAAGGCGCCCAAACTTCCT^^ 

AC CCG CT TCACCACCAACA TTGT G ATG AG AGAAGTT AAAGCC AT GG AGT T C CAAGGAG AC GA TTTT AAC T ACTCTGC C TT GT GTGCAAGT AT GC CCCAAC 

GCCCCGTGACTGJW3AGGCA6ATOTTCGCCCTTATGAAGAGTGAGGACGAAGAAATGGGAGTGTC 

CCCTTCAAGCCTCXXCTCTGGACAAGAAGTCGACTCATCAACTTCCGCTCAAATTTC 

AGTGGCTCTAATAGTAATAGrrCCCCTGTCTCTAGGACAGTCTTAGTTTGCACCCTGTTTATAATCCAAGT 

ATGTGAACGTACTTAACCAGrrGTTTCGACAT ^ 

CAACAACAAACCCTCGACCTCTAATAATAGTAACAACATCAGTAACAAGCGTGTTGX7T 

GCCACAGCCAAAAATCCCTTTAATAATGTAGATGGGGACAATCATGGCATGTTTGCCGGT 

AAGTTGAAACTCTTACAAGTAAGAGCACCATCCCTAAAGAAGAGGTAAATGTAGATX^VAGAT^ 

AGAAATTCATACCTTTAATGTGTTCATCTCTGAGATTAACCCCTCCAAATATGACC^ 

GTAGAGGGGGATACTGCTGGCGTTAAACGCTTCXGTAANCTATATCCTCACTCGCTCA 

AGGGGACTGTXTATTATAATGACAAGTCAAAGTTTCTGTTCCACGATAATGTTAACCCT 

TAGACTCCCAGCTAACTTGATGGCATTCATGGGTATTGACATTGTAAAGGTGTGCGCTAAGGGGATTCAAAAGTATATGTTTGC^ 
CCGGAAGTGGAAGAACT^CTGCCTCCTATCGCTGTATACGCAAAGGTTGCCGC^ 

ACGAAAA7TGTCAGTTTATCAAGTATGACACAGAAGGGTGCAAAAAGCACAGTGAGTTATACGCCAAA 
TAAACTGGAAGAGGGACAGACTGCATTCCCATTTGCAAATATTGTG 

AT GT AC AAGACAAAGGATGGCGTAAAGGG AGTCTCAAAAATT G AG G ACGAAAAC ATCAT C AAGGT GAAT C CAGCAGAAGAAAAG AAGAAT AAT AG AGT AC 

AAGCCGAGAAGACXICTGTATTTTGAGATTGATTCCX^TGATGAGGTGTGTGAGAGAACAGAGGAAGA^ 

GACAACACCCCTCGTACCTTCTAATGTGGAGGAAGAGGAAGAGGAAGAAGAGCAGAT^ 

GATAAGGAAGATGACGGASACGCACCAGCACAGGAAGAAATGGAGGAGGAGAAGGAAGAAGAACA^ 

AGAACCAAGAAGAAGAACAACAACAACAACAAOIACCAGAAAGAGAAGAGGAGAATAAGGA 

TAGTAGCAGCAGTAGCAGCAGTAGTAGTAGTAGCJUX^GTAGTAGCAGCAGCAGTACTAGCAGTGAAAATGAA 

CC TGC CAAGATTCAGAAGAGAAAG AGGCT AAGTGAAAGGC C ATC AGAAGCTGCTTCCTCTCCCA 

TATCACCATCATTGGACArACTCCAGACTGCAGTTGATGAGATGATGGAAGAAATTCCTGCGCCTGAGCCTATC 

GACACTTGCACTOUiGACAGGATTTAGTTACTCTTCATTCGTAAGAGGAGATGACCTT^ 

CCCGCTGCTGCC^CTTGCACTTCCGATGTTCGAAATGACTTTTTGGACATGTTCGACGGTTTACCT 

TOACCGCAAAATTCTTTGAGGGCATCACCCTACCAGATGGTACTGATAATGAATGCA 

TATAACCACCAC^TGCmACCTCCCCGAT^CACCCTTCTAGCAACTCAGCCCCCAGAAAGGATATTGATAATTGC^ 

GGTTCACTTTTTGACACTGATGATGATAGTGAAACAAATGAGGTTGAAAAGGAAGCCCCTAAACGTAAGAA 

ACCGTGGTTCCTCTGGTTCTGCTTCTTCTTCTCATTOTATGAGTAGTGATOAAGAATCAGAGGATGA^ 

GTCACCAAAAGCTCATGTTAAACATTCCCCTAAATATGATGCTGTAAATAGTGATGTAAATAACTCATACAACAA 

TCATCAGATAGTGATGCAGAAGCACAGCCTAAAAGCCATAATAAAAGCCACTCTCGTAAACACTCTTCTTCCTCCA 

AATGCTCAATCAATACTCAAAATGTCAAGAAGACTGTTGTACAGTCTCCACCTAGTTTTAGA^ 

GTC^CGCAAGCACACAAAGCCAGTTAGGCCCTATAACAAGAAGCGTGATAATGTTAACACCACTAATAATCT 

ATGATACTCAATCAATGT'ACAATAATAATCTTAGTACTTAACAAGAAGAACTATATTTTTATAATATTTTTACATGTCT 

AAA\ACCAATGT ATTATATGTTT AAAATC AACCCCATTTGC ATGAT TAAACT AACT ATAGTGTGT AAGGAAAGAAAAAAAAC ATGATT ATT TCTGCCATTA 

AAACAACAACAAAAATTCPAAGCTTCTCCrTTCTTTCTGTGTCTTTGCAGATTCGGGAGTC^ 

TCTACTGCTCTCCCTCACCAATCCCCCTATCGCCCTTATGCT^ 

TTCAACAACAATGCCACCAATACAATTACTACCCCACACCCCCT 



gataacaagggagaagaagaatcttttggatcaaactttgatatgtttggtgattcag^ 

ctacaacaacaccacttcctactcctcgatccatcatggatactgattcggatgaatgtt^ 

cgcctcttcttctatccctgtcgggatctctgaagctgaattgaaaaaaatg^aaaagaaaaagagga^ 

gatcctctccctcacctatatgtaggaggagaacctcctgtcgcagcagattataa 

atatgtgcggtgtcgcccctcctcaattttgcgctgaattgcc^ 

catgcataataagaaagggtccaagaaatgtcaattccttaag 

aaaag^gagctgtatttttggctc^cttggaacaaagattcttggctga^ 
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ACAAAGCTTTCCCCAATCTGATTGCTCGTGCAGACACACTGTGCAGTGATATGACATTCTATACT^ 
CTTTGATAAAGATGATGGAGGAATACATGGCGATGCGTCAGAGTATGCAACAGGAGAAAATTTTG 

AC CAATGG<aAGT GCCAGT AAGAAG AGGCGTCTCAC GC CTGACACT AGTAATATGGGAAC AAGCACTGAT GTGCAAGAATTCCAAACG ATGGGAAC AAAT A 

CTGATATGCAAGAATrCCAATCAATGGGAACAAATACCAACCCCATAGAG^ 

ATTGGTAATTACTCCrTTAACGAATGATGTACCAGAATTGGACATGATGTGGCT^ 

AC AGGAAC ATCTCCCC TGTCTAACACCCCAATTCCTACC TGCTT C AC AGGAG6TGCGAATGT AGT AGTGCCTAATGGATTTGTCCCT CCCACGT TCC CCT 
TAGAATGTGACGAAGATGATCCAAGTATTCCCAATTCTTAGAATTACGAAGAGGAT^ 

CC CT C TT GTTCCATC AT AT AAC AAGGGACA GACT TGT AAT GTTGTC C AGGAGTGG TT C AAGGGAT C C TT CTCTCTTGCAAAGC GT AGAGGAACAGTC CC C 
AAATTCTGTAGTAACATTTCCCACGCTTTCT/TTTGTAATATGGA 

GTAATAGATGTATCGTAAGGAGGTCATGTACATCCATGCTCGCATATCACTACATTGTTTGCAGAGACGCT^ 

GGTTCGCAACGACATGGATGACTGATTGATTGCTTGATATGTGACATTTTCICTATArTG 

TCTATTTTTTTAAAAACCTTTAAAAATATACATATAAAATGATGTATTTTTGAAACTACAC 

CACAGGGGAGTCTTAGAGAGGGGTTGTGAATCTGGCTAGGGGTTCATCCCTCAATGTTAATCACACGCAAGTAAA 

AGGGGTGACTTGGTGATATCATAACTGGGGAATTTCCTCTCCAGATATCTGGCTGTACACGTCT 

TATCATAACTGGGGAATTTCCTCTCCAGATATCTGGCTGTACACCT 

TTTCCTCTCCAGATATCTGGGTAAGAAAAAAAACTAGTGATATCATAACTGGG^ 

ATAACTGGCGAATTTCCTCTCCAGATATCTGGCTCTACACATGTGAGTGTTCT 

TTTTCCTCATTTCTTACTCCTCTTTGTTTGTATGrrCCTACTACCAGTCTGCATATAAGAACTGGA 

CC TTTCATGTTT AT ATTT AAT AAATAT AATCACXZATGGATATTTC AAAT AAGACATTATTTT T AGTAGTCGGTACCTTTTTTCTGAC CACTTGTGC ATCT 

TGCAGTCCAACTC^U^TCGTCTGGAACCTCATGCTRGCTTCATTTGTTGGATTTC 

TTCTCGGAAAAAGCTTCGTATTCTCTGCAAGTTTAACCATCTCA 

T T TT G AAGAGT ATG AAGAAGAAGAAGAT AGTGGT ATT GAAGAAT AAT AAT AAT AAT AAT G GT TT GAAT AAAATAT AG AGATACATTT AT ATTGTT TT ATT 

TGCATTATATATAAAAAAGCACTACAAAATTTCGTACACATACATTGAG 

ATATTCGAGAGAGTTAGTAGTAGCAGAAGAAGTTAGCACCAACTCCGGCAGCAGAAGTTGTA 

GC AG CTTC ACGGCG TTGT TGT T GT TC C AGGATAAT CT TTGCGCGTTT ATTCATTT C AGT GGT GC GTGT AAAGAT GCCTCTGTTGT ATTGGACC ACCT T AT 
TCCTATATTCTCCGATGTCCTTATCGCGCTTCTC^AAAAACTTCGCCATTTCTCC 

TTGGTCAGAGCTGATATTATGGTTGTTTAGAAGGTCCACAGCGTAGGCCATACAGCTAGCGGCACGTTGAA 
ACGGTATGGATCACCTTACTTCCATCATTCAAGJATTCT 

CAGGTCCAGGTATGGGTGT^CGCAATCTCCCTAAGJ^GTGTCAGCAACCGTGAAGTGGGAAGGATATAGT 

AGCGCTATAAACTTCTCCGGTGATTTGGCGAGTGACTGGATTACTTAGCCCTGCGTCAATTTCAGACATATTTC 

CTAGGTGGGGCGAGAGTGTCGTTGTTTCTTT<3K^CGTAGTAGTAGCAAGAGA^ 

CGTCGATAACMGAGATGGTTCGTCCTCCTOVCTGCTACGGTTGCCATTCTGGOT 

GGGGCCAGGAACMGTATAACATCCACATTGGGTGTAGAAGAC^ 

GT AAC AG AG ATACC T TGTGGT ACTC GG AAC ATTGC AG AAG AAGAC GT AG AATTAGAGTT G AT ACTTGT AACAGC AGAAGC AG AAGT CCGAGAGGCGATGG 
CAGC AGC TCTGGCGGCAGCAAT C AT AGGGGCTGTAGTGGTGC AAAT C GGC AGAGTGTTGGATGAAGTCGTGGCAGCAGAGGT AGAGTTGATGTGAGACAT 
CTTGATTTCTTG^TGGTTGGTTGGTTGTGGACTATTGTGITCTCA 

CGATTTTCGGGGGTTGCC^GATACTAATATACTCCTCCACGCCCTCTTCTTGCGCCATATCTATAAGG^ 
TATAACCGCCATTGTTG^TCTTTCTGAGAATCGAGAGTTGCGATCACGGCATGG^ 

AT TGTTGGAGCGGG GT TGGT TAAAGTGC C T CTGGT GTTGGTGGC GGT GTT GGTGGTGGCGGT GGT GGTG AGC AGCAGC AGGGGCGTC GT AGTGTTCGT AT 

TGCTCACAATAATAGTCC^GGGCACTTGCATTCTTCT^GGATGAATATATCTTTCTCTT 

TTTTTATGTGAGTTACTCTGTCCATCTCCATTTTXrrCAGAATTTCTTCCCCCTCCATTGACGTCATCCGAA 

ACGTCTTATTCTTCGGACCTTATTATCCCCTCCTTCTGAATCATCAGATGATTCAGAGTCAGATGAATCAG^ 

GAAGAATCGGATGAGTCAGAAGAGTCCATG^SAGCCACTCCCAATTCCACTACTAAArTCAGCT^ 

TTTCTTCC^CATCTTGATTCGTATTAGGGTATGTCACACCAGTTTCGTTGCT^ 

AAAACCTTCCATTTCTACGATCTCCCCTTCTTCCAATTCTTCCTTTTTGTTCT 

TCTGTAGATCCAAATCCACCAGTTCCACGCACGCTTCTTGCTT^ 

C ATT AAT AT ACAC AATCT GTTC TT CC T CGACATCAC AAT ATC TT AAG AAAAT C AACT GGGC AATGCT GGT^TCCCTTTCTG AT TGGCACAC TTTTTTC TGC 
GACACAATTTGTCCATAACAGCCGTCGGGAAACrrTCTCTATTCCATATCCAGTAGAG 

GAAATAGGTCGTAGGCGAUGAACCGGGCGTGGCAC£TCTGGGCGGAAGTGX:AGTTTCCTCCCCGGGAGGG^ 

TGAGTCCATGGCTGGATACAGGGAGTACACAGTGGAGACGGGATGTAATAAGTGTTCAACGTCTGGAGGACAAGTGAGTACCT^ 

CCTTTTATAATCGTGAAACCCCCACACCCCCGCTAAGG^AGTAAAAAAAATTGACCCACCATCACCCCTTGTC 

TTTATCATGGCAAC^TTTACTGAACAGGATCATAAAAATGCGTTTTTATATGCTAATGAGAAGCTGAG^ 

AGCCTTCAGTTTATGCTTTTATTGACATAAAAGAAATAGAAAATGGTTGGGAAAAAG 

CAGGGATATTTCTTATGACTCAAGCJUUVCTTGATTGTGACGCATTTTCTTGCATACCTTCAGATATACTTCA 
TGCAACTTTGCCGJ\ACACACCTCTGTCTCATTTCCTGTCAAGAACCCT^^ 

GGTACAAGCAAAGAGACCCCCATACTGGTTTGCCAGTAGCCAGAGGCGTCCTGATGCAAGATCACGTTGACCATGAAACTGGAAATAAAA 

TCTGAACCAGAGCOTAGTCATGTGGGCTGCTGTCCCTTGGATACGACCTGGAGACCTTACTGAAGGTTACAACA 

AAGGAGGATGACGAAAGGGATTCAAAAAGAGTTAAATATGAAAATGTGGTCATTTCAAAGGCTTATTGTG 

CTGGCTCATGTTATCGATCTGGATGGATGAAATTTCTCCATTTAATGTTTGGGCAGTATTTTACTAATCTTT 

CAATTTAACTGGAAATACATGGTCTGATGTAGTTTCTCTATTGACAGATAACCCTATTGTAGATGC 

ATTATCACCAAGAAGAAGTTCAACGTATTCCCTTCTGAGCAGAC^TCTGCTCGCCAGAA^ 

AAAT AG ACC CATCTTCTGTGGATGCTTTAATGCAGTTTGTT AAT AGGGAAGGTGT AGT AGGAAC AGAG AAAAAATCCGAC CGCCTCATGCGAGTGGCAGA 

CGCTGTTATGGATGa^TATGCGCCTTCAAGTCATGGGTCTGGACGACAGTCAATCTAGACG 

AACCCAGAATATGCAAGACATTTTTCCAGTTCTCTCAAATTAATTGGAGTAACTTTCGCCATAAAAAGATCTGTT^ 

GAAAGG AAAC GGCCATT AAT AATGGCGAAC AGCAT AG AAGAAGCAGATGGT C ACCCGAGACGGTG AC AGAAG AAG ATGC ACTTTT AT TTGCAAGAGAAAA 

TATCACTGAAGACCCAAAACACCCTGCGCCCTT^GTGGACATTTTACACTCACCAGATATCAATTC^TCTATCAAGAGCGGTTC 

AACGATATACTTTCAAGAATTTCTTCGACCAGAAAACTG^AAGAAAAGGCGAG 

ACATTTTAGAAGGTAAACTGTTTTCGGACGGATACGAATGGGACGATAATATTCCTCTGATGATAGGCGTGGACCAAA 

GGCCAGTAATATCTGGGCTAGATTTGCCTCCTCAGCTCTGGAGTCT^^ 

GTACAACTGGCAGCCAGAACGTTCTCCGTATTCTTG^GGAGTCTGTTATAG 

ATCTGGCAACTCTTG£CGCGTCAGCCTTAACCGTTATrTGGAATTGTTATATTCGTA^^ 

rTGGTACGATCACATTTTTAGCCCAGAGGATTTAAAGAAGCAGGTTCTAGTGT^ 

CAACCAGTCACCCCTGAAGAAATCGTCGCTATCAAOnTTTCCTTCAAACTC 

TTGATTTTCTTCAAAAATACTTCCATTCTACTCCTTTGATGGGAAAGAAAAGTAAGTTTGT 

aacaatgaacgcttttgggcagcgtataataacagctgctgatgatagtgacaccj^caccaccac^ 

AAGAAAATGAGGAGTATTATTCTAGAAACAGGTCAAACTCTTAAGGATTACTCGTCT^ 

AATGGGTAAGAAATACTGCCCTAAAAGAAGAGACJU\GGAGCAACACTACTAGTGATAACCTATTCAAGAAAACTGTTTC 
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ATTTCTGGTTCTAGGAATAGCrrCTATTGGTAGCGTCCCATATTACACTT 

TTTATTGCACTTATGAGTATATCATATATAAACATGAATGCTATGGCTGTA^ 

AAACAGACCCCAGAAGAGTAGGAATGGTCCAGCGCCACGTAGGTGTC^GGG^ 

CGAGAGTGACTAAAAAGTGAGGGTTATAACCCCCCCCCOUVCAAAGGCTTTAAATTAGACTCAAACCAAATTCATA^ 
GAGGAAGGTCCGTGCTCAAAAGGTCCCAAAAATATTCCTCAATCAAAAAAT^ 

TTGGAAATCATCGAACAAACAAACCAATCACCGAACAAGATGTGGAAAATACTCTTAACCTTAACAGTCTAGAAAGGG<^ 
TGTTTTCATCAAAGAAATGCAGTCCTATTCTGGATGTATACCC 

TGGAGAGATGGAACAAAAGTACACAGATCAGTTTCACCAAGTTCCCC^ATACCTCTT^ 
CGATGCCTTCTATTAAAGAAGAAGAATTTGAAGAAGAATTTGAAGATGAT<3AAGAAATATACGAAA 

tggagaagattcagaagaggaagaagaagaagatataattgttgatgacgaagaagaagagaatgaagagggaga^ 
aatcatctgaggcgccagactgccgccgctgccgccgccgc^ 

atgactacacactgtccgccctccagcaacagcaacaaaaactgctccagcaacagcaacagcaa 
ctccacacccaacaaattcaacaagtttttactgccgagtaatggcttctcagaacagaccgagctctttg 

tat aatggactc gtggagct ag acatcttacccattgttgctgaat acattatcaacg gccttggttctgaaatgcagtatggaaact cccccagtgaaac 

cgtgcaggagaaaggaagtgaaagatgtgtggtgtcagcctaaaactagctttga 

tatacttcaaaggcctagagatttccctatccctaaaaaaatc^^ 

ccgcttttgaaaagtggatcaaactttcgagtgtccgaatattcgcgtc^ 

cttgctttatatactctcaaaaaaaccccaacattgaaattctatcaaaattaaat^^ 

agatttgttcgagacgggcattttgagcgattcttcattagctacggctatgg<:attttgccaccca 

TTTTCTGTATATTTACCATTCTCTAAAATAACTCGCAAAGAAACTATAAAAT^ 

CCCCAAGTGATAATCCTAATATAAGTGCTCACCAGAACAATAACAACAATAATAACAATAATACCAGTGTGAATATCGAG 

TAATATAAGCAGAAAAATGACCATCACAAACTACCAATGTATGGCATGCAAGGAAAGATGCACAAACAATTGCACTAACGGTAACTATC 

AACCAGCACTTGTCACATAGTGTAAAAGGGGAAGATTTCTTTAAGATirPrAAATAATAGTA 

CCGCTCCTCCCTCTGGAAATTATACATCTAAGTTTTGTGATAGAAGCTOT 

CTCATCGGACAGTTTTGAAAAGACTAAACTTGTTTTGTATGGTAAAGTCGnrTGAC 

ATCAGGGTCTTTTTT AACTCTG AGGAAAAAGAT AATAAGACTAT CCCCTCTAGAGCC GAAAGTGCAAAAAATGC ATTC AAGGAT ATACTTGTTCACGAAT 

GTAATAAAGAACGAGCTGTTTCATATT^GAGCAAAACAAATTATCCTCTAA^ 

CATTATGT*TT<»GAAACACGTGGAAGATTTTTACAAGA 

tgtgataaatacaaaactgccaagagggcaattattggagcacaagacccttctacttctactccctctaaaaa 

gtac att atccgaatttcatt c aaaag at gaagctacagtaagtgcccttctcgacaaaacaatgctcttgggatcg agg ac aat aatgtctggtgt tag 

atgtgttatacgtaacaaragtgtgttttcgggctttgaaaataagaacactaataataarr 

ggtgctgcagtgacaaagatttccgatgaagat/ttggaacaat/fcacgcctgta^ 

taggggcacatcagacatggaaggatgaacaaacactaaaaacaaacactaaacg^ 

taataataaaataaaaaatcgttcattaaaactatcagattttaattggagaacacccaa 

aacaagaagaggtac gc ag aagtcgtggcgtcagctgct c caaagtc accttcaccaac gagc agcagcagcagc aac agc aac agc agcagccctcctc 
tttcaccgctctcaccaacagtgaagaatagtaataataaaccattgtatattcctcccc^ 

gttatttt ataat gt tgtgt gat atttt at acaat aaaaatat at aag aaaac aac caaatg at t tt gt at atc c ttgtt ct gt gtt gggcggaaattcc 
tccgcgcccattatgatgaaaaaattgctagaggccttgaatcctg 

ccagtcccacgcatgagaacctgttattgcaccacgattcgtgtaacaaagtcgtatctgcacatgtgcataaat^ 

aattgccgggagcaatgggcgtgagagccggttcctggaaaaccccacctccgttccacgcactacctcactattgcaccattt^ 

gcacggagagctagctaaaacatccactcgcagagacatcaccgtcaagatx^cgtctcctact^ 

gctgtggtggttacggctactgtgatttctgaagctgggggaactaagatagacgaccgctgcgccttcccgccatgtgactac 

gacgtggaagaagatgtgttagagctgctcgaaggg^gaacaacaacaataacaacggaacccgtgatggagccaaagtctttggcga 

actg attgggcgt aagaggc gtg aagc agt atctctaacaccagtccatx3aagat atgccagatttcttcccct caccctgagc atcccattgg a 

gggcgtct accacgtgaagc agctc ctc c acttaaaggtgcacttgg acg^aaaaggcgt g aagc agctcctcc actt aaaggtgcgcttggacgtaaga 

ggcgcgaagcagaatccttggaggaagaactttctgrrctgctgaagaaga^ 

aaagcgcgaagcagctcccccacttaaaggtgcactoggacgtaag^ 

gcagctcccccacttaaaggtgcacttggacgtaagag^cgcgaagcagcrrcccccactta 

tggaggaagaacttgtgtctgctgaagaagaacgtgaaaagcgcgaagcagc^ccccacttaaaggt 

tccacttaaaggtgcgcttggacgtaagaggcgcgaagcagctcccccac 

gaacttgtgttrtgttgaagaagaacgtgaaaagcgcgaagcagctcccxicactt 

aaggtgctcttgx5acgtaagaggcgcgaagcagctcccccacttaaaggtgctcttgg 

gtctgctgaagaagaacgtgaaaagcgcgaagcagctcccccacttaaaggtg^ 

cttggacgtaagaggcgcgaagcagcagcagcagctatgcctccccctgmgacgatctcgacttcttttacgcacctgt 

tatggaaagcaccagaacctacaggttaaaatgti^g^tgaacatcacatatacaactta^ 

ttattaattgaaacaattggactctttcctacctaaggtttatt 

taacgctttgtttaacgtgctttttacaattgatattggtacatgtttgtgacctct 

ttttcttgtgtxgttagaattagaaggagatgtggagg^itgttgatgtagt^ 

tgtcctcttcaacatactcctctaatccaaagcgctctcgaattttctcctttacaccctcaccacaatctcctatc 
tgtagtagaagtagagaataatggtacaacgttattgatgttgttacttctgcccatagctggt 
tctttgtccttttctatacaaaaggataatcgttcatcccc^ 
cttccagtatttataccagaccctcttgccccatagtttcgtatct^ 

cc at aat tt ctg atgttt t a t c aa c aac aat atcagc tgaaggggcaaaaggccccc cat cttccagtgc ac t attc c cat t ag aagg aagtggact agc 

tggcataggtgacaataatgaagggagtggtgaaaacacaggataatcatccaca 

aac^taatttccatctgcatttcttcccccgacgtggacggttgctgttgttctactccttc 

ctgcttctagttccttcattaccatttccacgcctgttgctagtctcattttcttattgttattttttat 

tgtaatatcagatgagtagtatttgcctggacaaataaggttatcactcatggctx7tat 

AACTCTTTGGGTGGTTTATTTGCACC^GCCGTGAAACTATTTGCCAGAAGAGCCGAGG 

TATTAAATAATGTAACTAAACTGGCCGAACACTTACTATCCCGGCAACTCGW 

AGTTTCAATTAAAGCTGATATTTGGCCCATGATTTTAAGGCTCGCACACAGTGCCATTTCCTGTCC 

AATTTTTCTTGTAGAGTCAGGAGACTACTCTTCAGAAAAGAGTGTAAAAGTGT^ 

CGAATTTGATCTGATTTCTTACACTATCCACTAAATCCTTGTTTAATTCTTCCGT 

AGAAAAGTGAATATTTTTAAGTACATTTCCGTCTGTTGAAGAAGGGTGCTTTAGATC 

CTTTCTCCTGTATCGAGATTTCCTACAACATTTTC^ 

TTTGGTTGTTGTTATAGAAAGACCCATAAGTCTCAAAAATTCTCCTCGCAGTCTCATTATCTGCATCAT^ 
ATTTAAAAGATCTAGGTAGCTACTCATCTGGGAGGGAAGGGGAGGGTTAAGGGTT 

ACGATGGACGTCGAGTTCGGTTTCTTCCACGGTCTGCTCTCCAAGGCCCT^CTCCCAGATGAAAAACATCAACCCGTTATAAGGCG^ 
ATTCTAGAAATAAGGGAGAGGATGGCTGCTGCTCCTTCTGTGGAAGAAGAGGAACM 
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TTCCTTTATAGGAACTGTCTCATCTATTGGTACAATTATCAATTCTAAT^ 
TTATCCCATTCTAGTTTTCTGGATGTGGTATATCCAAGTTTGAAGAAAACAACTGAAGACG 

TTCCAAAGT^GTATGAAAAAACTCT TC AACCC AT AGAAGAAGAGG AT AT T GGTTATAAGGATT ATGTTGTTTC A ATT GAAGACGAC G AC AATGTTGATGA 

TGGTGiACCAACAAGAACAAATGATTATTGATGAAGAATCTTATAAAACTATTGGAGAAAAATCAACCA 

TTTGGTAATGAATTTATAAGGATTCCTTTAAGAGAAACTGCGTTGCACGCACAATCTCTGAGGTACGACACTGA 

CTATACCTCTATTTTATGAAAA.CAGCACGTGCACArGTAAGGAACGTCTTATTG^ 

TAAACCAACGGACAAGTAGAGAGGATCTTTTCAACACGTATACAGGGAATGCAGTAATACGTTCGGC 

AGCAGAAAGAAGAGGAGAAAAGGCATGGACGACTTCAGCAGCAGCAGCAGCTTCTTCTAATT^ 

ACACAAGTGTCTATTGCAAACACTGTTTTGAATAACCCCTTT^ 

TAGAGGATGATGTCAITATACACACTAGAGATGCCTCCAACTCTACAGTCAGAGTAG^TGGAG 

TGTATACCCTAAACAATATATATCTGATAAAGTGCTGTGTGGTTCTCTCAACCGGGAAAAGGCACTGTTCT^^ 

GGATGTAACCTAAACTTTGATATCGTTGACGCTGCCATCATGAAACACCCCGACTACAAGGAAGAGACTACA^ 

GTATCGGAGCATCGGAAAAACTGAACATTACTCACTATTTAAACTACTTTATCCAATAAATCACCATGTCTATATACAGCCCAGA^ 

GAATGAGAATGAAGAAAAAAACGTTCTCAGTAATAGGAGGGCTGTAATATTGATCrTTT^ 

CTAGATATAGATATCAAATATGGGAATGTGGGCGAGAGAGCCACTATTGTAAAGAAGACTAGTCCGATT^ 

TTAACAACAAAAAGATTTCCCACATTGAAGATGGAGGAGAAGAAAGTGATGTTATAGCTTCAAGGACTGTTAGTTCTCTCCAGAGAGATCCT 
TTTAACCAGTTCATTTGGATTGTATAATAGGACAGATTATATGAAAAAATAAAAGTACCAAA 

CTTTTTAAATCC^CAATCAGAGCTAAAACATAGTGCAGTTATACAATAC^TAATACATTTCTCCCTCCAATGATACAATG 

TACTTTTCAATCAAAGCCGAGTCTTGTTCCTrGGATAATTTATCCATGACCAGCTT^ 

GATTCGTATCATTGTGGGAAGGGTTTATACAACTTCTTGATGTCTTTACATCCTC^^ 

AACAGAC TCAGAATCAT ACTTTAAATCTTCTATG C ATTC ACCTCCT ATTTTAGATTT AT T AAAGGGTGCAAAGAAAGGGGTTTCTTCCTTCTTCTTCTT T 
TGGACATCATTTTTACCTAATACACTTAAACCTATCTGTTCTCTTACAAAGGACTGAAAArTGATATCTATATAATGTACA 

TGCG ATT AACATTTTTGT AT GAT T C TG G AACAGGGATCTT C T TT GCAGTC GTCATCATT CTGGAACACGTGCAGGC AGGTT CTG GGACAACACCCAT GG A 
TAGGGCGATAGCATTGGCCAGTTTCACTAGCCACATTTTTGTTTGTGG 

CCTTGCTCGTCACCAATTACACGTATTGTGTCCCTTTTGTTTAATATTTTGTAG^AAATTCCAGTAGTACTTCCTTTCC 

GTTCTTCAAAATCCTTTTCTAAATAATCCACACTCTCCTCTAAAGACAAGGAAGAGGATGTGCATTTGGGCAG^ 

ATTCAAACCTTGAGAAGAGGACATTTTTCTCTTTACTGTTCAGXJACGAGGAGAATTG 

GCTGGAGGTAGMTGAAGACCACAGCGTGAGGGGCGACCATGGCTTAGATATGGCCACGCCGAGCACTGGCATGAATTGCCCCTGAGAG 

AGTGTTTTCACTTCGGTATrTTTCCCTCTTTATTGAGAGTTGCCAAAAGTTTATTATAGTTGTCTGTGClATGTCGATA 

ACTTTTGGAATCCrCTCCCAGTTCTTGCGTTTTATTTGCTTGATTCTGGGAGTCTTTTTTAGT^ 

CAGACATAAAATTAGTTCTATCTGTGCATATTTTATCCTCTTCTGTTTTACCACAAAATTCACTATGTATTATGG^ 

ATTATCCTCTAGACCGCACATGTTCATATACACGATAACTAAAAGGTTCATGGTTTTTAC^ 

TGAACTGGTTCATAGAGCACCACGTTACTATTGTTCCCGTCTTCTCTCTTATTX5GCGTCCATTA 

TGTACGTTATATTGTCAAGAGAAATGCATTTGTCGAATTCGTTAACAATTT^ 

TGATACAGAACATCTTGAATGTTCATCCTCTTCTATATTATTTTTAGCGAGCGATGACG 

GAATCTTTATACTCGAACGCTTTCCTTGCCTCATCTAACCAGTCGT^ 

CCTCCTCCTCTACATCATCCAT1TCATTGAATTTTTTAAAGAAATCTAAAAAGTCACTCGTGOT 

AGTAGATAGACCTGGTATTGACGATACTATTCCTCCTCCTCCTCCATTTTCTTCTC 
ATCCTTTCTTTAGTCAATATTCCCCTATCATTAATTATACCACCTTCAGTGT 

TT^AGTAACAGGTGCACTTTTTCTGGTTGCATTAACCTTACTTTTCTTACGGCTAGAT 
TGGGATAAGATCOTCAGTATCAGATTCTACATCCGACCCTGTATTTTTACTCTCATTAT^ 
GTTATTGTACCAAAAACACGTTGATTAAAGGACCCCATTGTCAAATATGACATGACATTTGCCGCTGC^^ 
TCAAACTAACGTCAGTTTCCAAATCAATCAAGTGTGTTGGAGGAAAACACAACrG 

TTTGATTGTCCAGAAAGGAAGTACTAGTTTCTTATCACATTTTCTACAATTACTTTTAGACAAATTCCCCATAGA 
TCTTTATTATTCTTCCTAGCTTCTTCTTCACTCAAAAAAATCTCCCATCCGC 

CC CAT GTGTCGT ACTCACAGTT ACT AAC AATTGC ACTATTGT AGT CTGATCCTTPTAGTT AGCAT AATCAATAGTGTACTTTC AGGTG AAAGC AT GAGCGG 
CGATTCTGCCAAATTATATATCGTATTTGGCTTTGTTCTTCTTTTAGCTGCAAAACC 

AT GC ACC T AAAGAACTT GAC ATGGC CAAT CTTT AAGAAC G TTTC AGAGT C CTTAAATTGTGT ACC AGT AAGA CGACACAC GTTT TT ATG ATGT C CAACC A 
TC AG GTT C CATT TCACC AGC AC^T C AG AA TCGTGGGAG AAAAGATTC AT AGAAAGTGTTTGAT TTTC ATCGTTGGTrGTGT T GCT T CAAAA C ATT CAAT AA 
ATCTTCCCACCTCTTGGATTCTTTTCTGTTACC^GACATATTATAAGAATCGTACTCCA^ 
CTCTCCAGATTCAAAAGGCATGATGTTAAACGTACGATATCATCCTCTGCTTCG^ 

TTAGAACAGGGATCCTCATTATGTCCAGTAATGCAGATGCCATCAGACCTTGAG^AAGACGCATCATCAT 
GGTCTTCCTATTGAGGCAATTTTCATTCCTTGATTTTGCCTTT^ 

GGAGCCTTTC CATCCGTAAAGAGT ACAGT ATACCTCCCTTT AAGGT T TTT AG AC^TK^GG AGATTTTTCATCACTCGGATC ATCC TAAAAGC GT AGTACA 

AGATGACCCCCTCCATCTGGCTAGCAAAAAATGCATCTTTGAATCTGGATTTCAAAACTGCTGTCGGGG 

GAACAAGAAACTGTTAGTCAAAAACTTCCTCTTCTTCATAACATCTGTC 

GAAAC TGGAGGAGCCAT CAT C ATGT C TTC GAT TGG AGGTT CA GTT AGC C AATCAACTTCCTC TT GGG ATGAAGGGATT AT AAT AG GC AC TC T T T C AATTC 

TAG^CTTAATTTTCTCGTGTTTATATTrrCTAGAAAAGAATTCAGAAATAACCTTCTTTTCATCCCTGCAGTA 

CTGCATCCGTGAATGCATAGTATCTTCTTCCCTGATAACAT^AAAACCGTCATGTAAAACGTCT 

ATCTGACCCATATTTTTCTGTCCCTTCAACATGGTGGTTGCAATTTGCCTG^ 

CGAGTGTGTAGTC^CAGGTAGGTGAAGAAGAGXSAAGATTTCTTGCCTCTTCCAAAATCCAGGAAG^ 

GAGAAGTGAGCTAGTTGTGCAATCTTCTTCTCCTGAACAATAATCCTCGTCGTCTTCGTC^TCGTAACAATCTTC 

CTCCCATTCTCTGCACCGTACGCATTTTCGTACACTTTTTTGGCTTCTGGGACACCCATTTCTGTTGATGTTACTCCC^ 

ATGCTCACTGATGGGCGCCGTAATTT^ACACACTCTGCGTAGTCTGGTGTTTTTAC^^ 

GCTTGGAATCTGCAGCAAACGCAGCTCTCTACAGAGACTTCTCGGAGACTTGCCAGAAGCGAGTTTCCTCTTCTO 
TGTTGTTGTTTGTAGAGTGTTATAGATTCAGTAGGGAAGTAGTTTCTAGATCGAAGAA^ 

ACACCTCGTTGAACACTTCTAGGGCCAATTCCCACACACCTTGTTCCTTAAGTGTCTCGACCACGTAATCAAATTCGGATTCACTCAATAACTGTAAG^ 

GGAAGAATCTCCATCCATGATACGAATCTGGAACACCTGTCTCCTCCTTGACTGTTAACTGATGTCTGACCCCT^ 

CCCTCTCCCGACGGiUVGATGTATACGTCACAATTTTTTTTAAAAAATAACGTTATACACCAA 

GCTACCCAGAAAAGGAGACCTGGAGATGAAGAAGAGGAAGAAACTGGTAGTAGTAATGTTCCATATTATGCCAACT^ 
TGTACACTGGAGAAGGAAAAAGGGGTAAATTTGTATTAGAGCCACCTAAAGAAAGAAGTGTACAAAGGGTGCAAAAA 
GGAACAACGTTCTAATGTTCGGACACGGAGACCTGGTCAAGAGTTTGAACAGAAAGTGCTACAAG^ 

AATTTGGCAGAGAAAGGATTGCAAGAACGGCAAAAGAAATATACTCCAAAG^TAGCACAAACAATGACAAAAAAAATAATCAG 
GAAAATTCAAGGCGCCGCAACAGCAGACAT<HX3ACAAAGGTGCAGCAACCAATGTTCTTGAAAGGG 

AGTAGAAATTACAGGAGATACTATATTAGGTGX^TAGGAGAAGAAGATGACGAAGATAT^GGAGAGGATGAATTAACTATACA^ 
GTATCACAACCCGTTO^CAAATCGTTGTCAGTTCTCCTATACCGCCAAAGCCCACTAGGCCCGCTCCTGATATTC 
AAAATATTAGCCAGTTACCACC^TTACCACTTGATGATTATGAGGACGAAGAAGACGAACATTTGTACG^ 
AACAGCAGCAGXTAGCAGCTTCCACAAGACCTCCCAGGCCTAATATTCCTCCTCGACCTCCTC^ 
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GCTTCAATTGC^GCCTTGGAAAAGGAGGCCGAGGAACAAAGA^^ 

CTGCTGCTGCTGCTGCCGCCCAACGGGAAGCAGACGAAAAAAGGGAAAGAGAAGCAGAGGAACAAA 

CCAACGGGAAGCAGACGAAAAAAAGGAAAGAGAAGTAGAGGAACAAAGAGCGGCCGCTGCTGCTGCTGCT 

AAAAGGGAAAGAGAAGTAGAGGAACAAAGAGCGGCCGCTGCTGCTGAAAGGGAAATACTTGCCCAACAACTTC^ 

AGGAAGAGGAGAGGCGGAAAGAACTAGCAGATAAGGAGGAAGAAAAACGTCGAGAACTAGCAGCCJU\GGAGGAAGAA 

AGAAGAGCAACTTGAAAAATTGAATTTCCAGTTGGGTACAGAAATCACGTCCAAAAGAGCACTCGAACAAATGTTAGAAGAAGAG 

CGGTCCGCAGCCAGTGCACAGTTAGCGATCCAAGCAATAGAATATGAGGAT 

CGGATTTGTACG<SAAAAATGTACGATCTCAATAAGAAATTAGAAGTAC^GAATAATAC*TT 

GAACCAATTGGTTGCTCAATCCCTTGAAAAATCCGCTAAAGCCATTGAAAAATTAACTAGTCAA 

AGAATAATAAC^GAGAGGGATTTTTCTTTAAAGAATCTGGGAAATGTTTACAAAAGAGTT^^ 

CGAAGGCATTAATTACAGATAAAGAATCMGGGATCTGGAGGTGCGTCTAACAGATGTATCGACAGATCTC^ 

AAGGTTGGATGTATCCGTTAACATACGCTCTGGTGGAACATTATACACTAAATTTACAGAG^ 

GAAATAAGTAACAGATC^GATCTGCCTTATTGCCATTTTCATCTGCAGGTTTGGATACTAATTTTACT 

ACCAACTAAGCAGTATAAATGAGGCTATGAATATTTTGAAAGAAAATATTGTCCCAACATTG^ 

TTCAAGCTCTCGTCAATATGCTATTGAAGAAAGGGTGTATTCTGATGTGTCCCGAATG^ 

ATATCCCCTTATTTTAAAGGCGATTGGACGGATGAAAGACAACGCTCTATTGCTGACAGTATTT^ 

GTCTTGCTACACTACACGATATCAATACAACXTTCAAGAATACGTAGTAATCCCCTCC^ 

TGTTAACGACTTTAGAAATTTTCTCGATATCCAAGGAGGTTCTCAATTTACTTATGATGTCCTTTCAGGCCAAAATATTGATGAC 
AAAACCACTGAAAAGGTTACAGAATTGTGCCTCGAATTATCCATAATTTTAGACGT^ 

ACCCC GCAGGGGAAAC ATCTATGG AAG AAAGTGGTTCAT TGGC TGTTGAC ATT AGAQWtfSAAAT TGGTAAGAACATATCAGATTCTAGCGCTGAACTTAG 
TCGCACACTGTCAGAGGCGTTAGAAATTTTTCAGCAAGAACAGCAAGAACAACAGCAACAATTCCAA^ 
AATCAACAACAATTATTACAACAACAAATAGAAGAACAACAACGGGTTCAGGAACAGCAGCAACAGCAG 
AAAGGGAACAACAGCAGCAACAGCAGCAAAGGGAACAACAACAACAAAG^AAGAACAACAG 

ATT AT T GCAACAAC AGC AACAAT T TCAGC AATTAC T ACA ACAAC AAGGAAGAAG AAG AGGGGGTG AC GATGGTGATG AAGAAAG AGAGG AAAQXGAAGAA 
GGGGCTGAAAAGGATGATTGTGTGCGTAAGGTTGCAGAATCAGTAGCGAC^AAATATACTGCTGACTTGACTAC 

TCCAATCT AAAATAGCATCAGCAAAATTGGGAACCCTTGTCTTTGC C ACCCCTC CTTCACCTATCATGAACTTGACAAAATTGAGAG AGGAATATTCCAC 
AT TC ACAAC CCAGTGTTTTTC AAAACT AAC AGCT G AAAAT AAT AGT AT T ATGCG T AT TTTCCC C GAG AGGATTGT AGAAGT ATGCAAGAGT AAG AATCTC 
AATTTAATGGGAAAATACTTGTACATTATAACTACCGGACAAACAGAAATGGAAGATCGAGTGAAGAATATATTGTC 
AGTTTTCAAACAATGTAAAACAACAACAACAACAACAAGCTGCTTCAGCTT^ 

TACAAGCATGCAAGTTTGTGAGTTGGATGATCAACGTACCCTAGAAAAGGCTGCCATAGTAGAGGCAATTACTCT 
AAATCCGCTTCAGCTCCTTCCACGGCGGCTGAGCGAGAAATTGCTCTAAAGCTAGAGAATGGGAAAAC^ 
CAGGAGCTACTGCTGTTTCCGACCAACAAAAATGCATCGACGAAAGTACTTCCAAACAAGAATT^^ 
TGCACATAATGAAATGGATATTGGATTAATTTTGGATGCCAAGAAGAACGAT^ 

GT GC AGTCTTTCO^T ATT ACGT ACTC CGCAC ATGGCTAGGAG AAAC AGATATATTGGATGAAGAT ACT GTACATCCTGAAT ATTTCCGCCAATAC ATTG 
ATCGCAATTGGAAGGTGGAAGAACATGAGCGTCAAGATACATTAAAGGC^CTGGGTGTTTCT 

TC CCA GTGTC AAAAATGAT GCATC AAAAT CAGTAC CATT TGC GT TGAACACTCT ATTGT AC AAC AT ATT T GCT ATCGAC GGAGG AAT GATTTCT AGC CTT 
TCAAGAACAGCCTTTATTTACCGTAAATTTTTAAGGXIAATCTATGACA 

TGTAGACACGTCAACCGGAGACATGTTTtrGTATACGAGTTTGTCATCTCATGTACAATTTTATAGTGGCTTATOT 

TATACGTTAAATGTATTGAGAGCGTCGGGTCTGGCAAACAAAAAAGTCGTGGCCGGTAAAA^ 

CTTATGATGTCACTITATGATTTTTCAGTATTGTACAAGATTCTTCAACTAC^ 

ATCGTGTGTTGCCGCCATGGCAGCCTTCACTGCCGACCC TTCTCT CTCC ATCTCAGATGCCGACCAATCCATACTCTTCCCACTAGAAGGGGGAGAGATT 
GTTATTGAAAAACATGAAAACGATGCAGAAAAAAATGTC G AT ATGGTTC AAGAATTGTGG AAGGAAACTG CACTCACACTCATGGCAAAGGAACT AAATT 
CATACTACAACTGGTTCATATACAGTAAGGATACCGATATGGAAAAGTTGGCGAGAGTGTGCAGG^ 
AACAAATAAAGCCGAGAGTTTAGTAGACACTAATGCTCTTTCTGATATTTTCAAGCTACCTGTGAT^ 
ATTGTGGTGTTCACTTTGAATAATGTAATTAAACCATGGATGGTTTCGTTCAAGCAAATGTT^ 

CGTTTCAAAATATCCAACAACAAAAAC ATC AACAAACTGCTT CC ATT TT AGATGC ATGGGCTTGCGCGCCCGGTAJ^TT AACAAAGGCAGCACACGTGTT 

TATTTCAGGATATGAAAATCATATCAAATTAAAAAAAGATGATCTTTTATGGGGTGCTTCAATGAAATT<: 

GGATGGGCCCJACAATACAATAACGAAAGCGTATTGGAAGATTTTACCGACTTT^ 

ATCCTTTACTTTCTrCCATGTTCGGTAAAGGAAATGGTGGAAGCAGCAGCAGCAGCAGCAAGGATAATACAATTAT^ 

CCGCCAAGTTGTTGGACAAGAACAAGCACCACCAATAAACACGTCGTCGGATACTAAAAAAATA^ 

CCTTATAGTGTAATTAAGGCAAGTAAAGGAGTATCAATTTCAGTACTGGATGATTTCAATGAGGATAGTCC^ 

TCAATGATGCCATACGAGAAATAGGGCAACGCATGACTTATACAAGACCTATATTTGATCATCAAACACAG 

TATTCTCGAAGGATCAGATTTAAAGAATGGACAACGTTCAGGACAATCTTGGGCTCCTTCTTCATCATCCT^ 

TCTCTGGAGCTTTTATATAGAGAACTTGCCACAAAACAAGTAGAGAAGGAAGAAGAAGAAAAGAGCGAAAG 

AAAAATTATCATTTGTCGTGAATAAAGCTATCGGAACTATCCAACAACAACATC 

TGCTGATCAAGCTAGTAATGGTGGCATAGATGATATAGAACTTATGAATAGTAAAGAT^TACTT^ 

AATAAAATTGCAGCAGCAGCAGCAAGGGATGGGGAAAATTCATCAGCTAAACCGTCAAACTTTGGCAATAG^ 

TTTTATTACGTAGAGGAGGAGGAGTAAGAGGAGGACTUyVCACCCCAGAGTTCAATGCTAACAATGTTCCGTCCT 

CTGGACTACTAATACACCCCTTATTCAACGCACAACrAGTGTTGGGAATAATTTAGTTGTGCTTGTACCCAACCT 

AATTAGGAATAAGGAATTATACACATTTATTTGTATGTTTTACATGTATTTTATTCAACAATAAAGGTTATGATCATACATAA 

TACATAGTTTCCATTACTGACCCTTCCAAAAATGAATTTTGTACCTCCGAATACAATTTCTTCCCTCG^ 

CTCACCCATTCACCAACAACACCCTCATTAATCTTCCCATGAGTGATCTGAGGATTCAGTCTGTTCT 

AAATGTGTT CCATTATTCGTCT ACGC AAAT CGAGTGC ATC AT CTC CAAC ACATACTTTGGAT AC GAGAATGTTGTTGTTCATCCCAAACAACTCAAAATT 

ATCCCCAGT GTCGATTTTG AT AGCTGGTGC AAAACAACTCCC AAAATCTGTTTTCATCAGAGTATTGT ATTCTTCCAT ATTTCGCAAAT AGAAAACAGAA 

ATGGTCACATGTGGAGGCAACTGAGAAAATGATTTGGAGACGTGTTTCTCAGATAATAATTTTCCCGTTTCTGAATAGAAAGGAAGT 

AGAGAGCAAAGAACATTTTCATTTCTTCACCTTTAAAATCAACTTGATTTATAGTGAAGATTCT^^ 

AAACATGCCATTGATTTGTTTGTACATTTGCTTGACTGCGCATCTTC 

TGGTATCCATTCAATATGTTGTAGCCGGCACAAGGGATAACTGGGTCCACAAACTTG 

AACGAATTTCGTCTGCGTCTTCTGCCTTGTCCATTAAGATCATATTTAGATGC^^ 

GCAAATAACATCATCCAC^TATTCCCCATCTACATAAAGTCCAGTAACAACAATCTTTAACGCTTT^ 

GCCTTGTTTACTTTGGTGGAGAGTTGTTCGTCACTGGGAATGGGCGTAGAGGCC^ 

GCCGGATATTGTCTACCATGTTAGACATAGAAGTATTATCACTAAAGTCAT^ 

TTCCATGGTGGTGGTGGTGGTTGTGATGGTTTGAATGAGGTGTGGTACAGTCATTATATACACGT 

GTTGCCATCAAGCAATATCTTCGTCCAGAAACACCTGGTCCAGAAATGGCCATAAGATACTTCCTCTATTTCT 

G ACATTGGC C CGACCC AGAGGT CC AC C CTCGG AACTTG AC ATTCGGT CG AGCT AGCGGTCC ACCCCCT AAACTGGAGCGCCCT AAAAAAT TTTTGAAAAG 
TTTTTGAGATGGAGGAAGAGTAAAATTCTCTAGTGAAAACAGAAGGGTATACCCTCTCATTTC^ 

CACACAAAAGTTAATGTACGTTTCTGGAGCTATTACATTTCrGGTGTAATCTGACATTGGCCCGACCCAGCGGTCCACCCTCGGA 
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GAGCTAGCGGTCCACCCCCTAAACTGGACTGCCCTGAMUVAAT^ 

TATACCCTCTC^TTTCTGGGTCGACCACTCTCCAGAAA 

ATCTGACATTGGCCCGACCCAGAGGTCCACCCTCGGAACTTGACAT^^ 

AAAAGTTTTTGAGATTGAAGAGGAGTAAAAAACTCACTATATGAAAGGTGTGTAGAACACCACATCCTTTCTAGGCACGGCCA^ 
TCCAGAAACACGCAAAAGTTAATGTACGTTTCTGGAGCTAT^ACATTTCTGGTGT 
ATTCGGTCGAGCTAGCGGTCCACCCCCTAAACTCGAGCGACCCTAAAA 
CAGAAGGTTATACCCCACCTTTTCTGGATGCGACTAGATCCAGAAACGTATGTTCCAG 

TTCTGGTGTAATCTGACATTGGCCCGACCCAGAGGTCCACCCTCGGAACTTGACATTCGGTCGAGCTAGCGGTrc 

AAAATTTTTGAAAAGTTTTTGAGATGGAGGAAGAGTAAAATTTCTTGCCTGAAAGGTGTCCAGAAGTGTAAACACA 

CCATCATTAATAAAAAATGTCCCCTCCCGCTCCCCCCTCAAAATCGGGTAT^^ 

GTACTGAGAACATCCTCTCCTCTTGTGATATTATTCAAGAAAACCTACTGAAATCGTCCCTAAAATGGATTTTC 
AAAATGTCCC^GTTGTATTCATCAGTGAAGAAAGTTGCAGAGCATTC 

TGG AAGGGGAGAGG AAGAAAAT GTCTACC CCAAAG TCCTCTT CTGAT GG ACAAAAAC T GG AC AA GGC T ATGTTGG ACGAT ATT ATC AACGAGT ATCAGGC 

CGTTAAGAGCACTGOlGATAATTCCATTGAATCGACCArCAAGGA 

AACAGTGTAACTTCCAGCCCAGAAAAAGTGTTTTCTGTCGAGGATTTAGAAATCT 

GTTCAAGAATTGGAGGCJAGTATGC^GTGTCAATGAGTATCAAAAAACACAACGTCTCCTC^ 

CAGGGATTGTTTTATGCTTGAAACAACCTATCCTCTTGTTGGGTTCGAA^ 

CTAGAAAGATCTCTACCTAAATATGTACCAGTTTTCGACATCAATGCAGGTATTCAAACCCTAAACA^ 
TGTGCJITGTTTGGACCTACAGAATATCACAACTTGGAAGATT 
GAGGTGTCACACCGAAGATTTGGAGAGGAAGAAGCGCATGAATGACGCJ\CC^^ 
GAACAATTGCTCTATATGATAACCGAGTATGAAAACACGGCTAGTTGGTGOACGC 

CAAT T AGTGATATG AAT GACTT GT GCTGTTTTGC TGACT GCATCGATGTTACTGT T AAT AAT G AAG AACATGAAGAACGTTCTATGAAT ATT GTGGTCGA 
ATCTGACAGGCGTCTTTTTGATGCTAGTCCTTCCCCCATCAAGACGGAAGAAGAT 

CCTACACCATACGAAGGTAACGCAGTTG7GGAGGGGGAGGAAGAAGAGGAAGAAATTGATGAAGACGAAAGTAG 
TCGTTATGAAGAAATTAGCCAAGCTTTCTACTATGAAACAAATGAGAAGGGTTAAG 

CAGTAGTATCAATAACGAAGATGATGGTGATGATGACGATGCCGTTGACGCTACTGCATTATGCCCCCAAACEX3A 
GCCCCAAAOSACGAGAGAACTGAAAATATTTOGTATGAAACTATGCAA^ 

TATTCACCJVACAAACTCCAAGAGTGTTTGAATACCATGG ACGAT AGT ATC CGTCGCCGTCCAACCATTTGGA 
TTTGTTGTT/TGATGAGGrTGTCACATCAATTGTGGC^ 

ACCAACATTAAGKSGTAAATATGAAGGACAAAAGAAGAGTCTGTATGGAAACAAGCACATTTCTTCCTC^ 

ATAATGCACTATTTGCGTGGGTGAAATCGAAACTCCATTCTGGCACAGTCATACCTAACGTATTCT^ 

GAAGCGCAAGCGTACCTCTAGTGCTTHIATCATCTAACGATGA^ 

GAATCATCATCTCCTTCTTCATCGTCTAC^CCAGAACAACAACAAC^ 

TGCCACGCAGHT^TGTCAATGGCTCGGCTTCATATTTGTCTGAAATATTC 

GT GC AAGACTTTTG AAGAT C TT G AAAATGAAATC ATG AGG AGCT C ATT CAC T AG AC T GAC T AG AT AT GAGAGGGAGGT AACTC G CTTGTATGAGAAATGC 

AGGTCTCAAGCTGTAGATATTGAGGAAAATGAAATGGATGT^TTTGTCTCACCAAGGGGAATTGTTTGC 

AAGAAGTACTGGAGAArATTAAGAGTTGGAGCCTAGAAAACGTTAACACCCCTAAGCGCAA 

TCGTAGGACATATGAAGAATACCATGCGTTTAGCAAGTTTGTACC 

GACGTTCACTTTTCTTCCACTTGCCTGTGCTACCTGACTGTAATGACCAG^ 

AC GAAG AAACCGAT ATT GT GGAGGAAGAGGAGGAAGG AG AAGGAG AGGAGGAT AAAATG^ AAG AAAGT A TGGACGT AGAACAAC AGAAGCAAGTT CG C AA 
GGGA GGG AGAAAGAAGG GT C AAAAATT C AACAGTATT GGG GATC AAGTCATT AG AAAAT T TGTGAAAAG TTTGTGTGAAAAT T C GAT GGTAGT TT CT AT T 
GCAATTAATAGTTTGATCTCTGGAATAAGCTGGATGAACAAGAAAATCCCTCCCGGTT^ 

CAAGG T TTGTGTTT AGC GATGT CAAAAT CAAT AGGAAAAT C AAT GGAAC AGAT G AT AAAT AT GAAAC TG TTTTT/GGAGTCAGT ACGCGTGTGG AT TCAC A 

TATTGTAGGCCCCTTTAGTATACCTGTTGATTTTTCAAGCGCAGGACTAGATAAGGCCTCATGTGGCAAATTG^ 

ATTTTGACAATT^C^CCCAAATATGATTCATTAAACGATGAGGATGTTGATTCTACTACAACAGACAAGCTAGAGAAGG 

ATGACACCTTTTTCAATATTAATAAGAATAAGGTTCTTCCATTCTATAATATTTCTCCTAGCTCTTCTCTCACTG 

GAAGAAGATeTCATCTGGTATGAGCAATAATAATGGCATGTGTGTACAAACTCCTTCTAG 

TCATCTTCTGTTCTGGCTCTATCTTGCTCCCTTTCTTCTACAAAGAAAAAGAGCAT^ 

CTGGATTTGTTGTACCACCCAAACTTTGCAGTTTTATTGTTAACCATAGACACGCTGT 

GAATATTATOGATAGGAATAGGAAGAT^AGATTTAACGGTCTAAAGAAGGTA^ 

AAGAATATGACTGCAACTTCACCTAGTGATTTGAACCTATGTATTTATACTTCATCnTTAAATGACCCATTGTATACT^ 
ACCAGGACGGAAATGCIOTGGATGACTATGGTGCAGTTTTTGTAAACTATACA^ 

CAATGCTGCTGCTGCTGATGATGATGGTTCTACTACOTCTACCTCTTCTTCAACTGATACTGATGCT^ 

AAGAAAATTGATGCCATGAAAGACJITTAGAGGAAAGTACAAGAAATCTTTGGCA 

AAAATTATATGTTTTACATGTTGTGTTTTATTCJ^TTCATATGGTTCAT 

GCTGCTGGTTCAATGTCTGAGCCATAGATGGATTC^TTGACATCTAAAAGTTCATCTCT AAAATTTCCTGGTCTACGCGCTTGACGAATAATTGGTGCAA 
TAAATGCCTCCTCTTCTTCTTCTTCTTCTCCTCTGACTCCCGT^ 

AGCGGGAGGAAGATCAAAATCAATACTGCCAAGCTCAGAGTCTATATCAGTAAAGGACAT^ 

G^AGGAAGAGGAGTAGGCTTGGGTAGAACTGGTGTAGGTGGAGGTTGAGGGGTAGTAGAACGAGAGTCAAATCTTACACGGGAAA 

CAGT ATTAGGTATAGG AGG AGGAGGAGGGGGACT AG AAATGACT TG AACCGGAAAGGGCT GCTGCTGC T GCTGCTGGGG AGG AGGAAT ATTACTGGGTG A 
TTGTTGTTGAGGAGTAGGGGGAGGAGGAATATTATTGGGCGGCTGTGGTGGTT^ 

TTGCCTAAAAAGTACAGCGGGTACTGTTGTTGTAAT^GTTGAGGAGCATATCGGTTATAATACTGGTTATATTGATAATAAGG 
AAGTAGGTTG*rTGAArcTGGAAAGGTGCCTGTGGTGGAGGAAGAGGAAAGGGAGGAGGTTGGA 

TTGTTGTTGTTTGGGCACAAACACTCCAGTCGATAAATTAAATGCTTGACTTGTTTCCTCCACTGCACTCTTTATCAT^ 
GCTCTTGCTAAAACACCAGCTGCACATTTCTCCAGCATGGAGATAGAATGCTCT 
ACATTTTTACCCACACGTTATTATT^ATAGCTCTAJUUVTAACGAGGTA 
ATTCTTATCAAAGTTCTTGTTCGGTAATATACGTGTACCAAACAGGTTATTAC^ 

TGTATACGATCCTCCACGATACTGCTCATTAAATTGTTTATCTCTGTTGCATTTCCATCGTTAAGAGATTTCAATATW 

GAATCTCTCCTCACCGTATATCTTTCCTATGGGAAATTGCCCAACAACTATTAGCCAGAAGAGCTTTCACCM 

AGTGTTTGTTCT/TTTTCCTCCGAACTTTTTCTrCCCTTGTTCAGATGACAATAAAGA 

TTAGCTAAAAGAGACCTTATTCTGTACTGTATAGCCTTCTCCTTGACCA 

TCCTTACACCTAAAATAGAGCCCGAAGTTATAGCACCTCTCAATGGTCGAGACACGGTC^ 

GTCACTTCTCTTTCTTTTTTTAGAGGGAGGAAATATGTGCT^TT^ 

ACATTCTCCTCTTCTTCCTCCTCTCCATCTT/TTTC 

TCCCTTCCTCTTCTTCTTCATTACCTCCTCCTC^rTCC 

AACACAATCGCCACTCCTTGACAGAATTTTACTAGTACGAATCTTACGCCTTTGCCTATTCTTACTAGAAGTTGGCGCGTCC 
CCCATGTTTCGTTGGAATCTGGATTCGGCGTCATCCTCCTTTTTTCGTTGTTGGTTCCTAATTA 

GTGGCATATTTGTAAGGTGGGATAAACATTACAAGAAAACAAAAATATTCAAGACATTATTTTATTTTGTAGGCACCGCTCT 
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TTCAAAACCATCTTCCATTACATATATCTCAATCTCTTCTTCTTCTCCT 

CAAC AT AGAGTACAATT AT ACACCTTTTCTAAAGCAGC CT TTGT AGACT T ACAATCAGAATACAAT AGTTTATGGTC AATTAGT AT ATGGTCAGGGTT AG 

TTATTTCATCTTCACATATCCAACAAAAAACTGTAGGGTACCCATCTCrTCTACCGCT&CAATC 

GTTAGGTGTCTTACATTTAGGGCATTTTACAGTCTTAATTTCCTGTCCTTCTTTCTCAACCTTC 

GGATCCTTTATTGATTCAGATGTCATGTTTTGACGTACAATATTTACCTGTTTAACAATTTCCATTTTTGCACCTTGGTTA 

CACGCCT ATCTAC ACAATG AAGG AC AAAAGAATGGGCTCT AT GGAAC AC ACATTC TGGCCGCTC C AACACCATTTCCCCT ATAAGATTCTTGACAAACAT 
TTTCGCCCATAATCCTAATGCACTCTCATGGTTCAATTTTTCATAGAC^ 

TCTCTATCATCAAAACCCCCAAAAACACTGTTTAAGTAATGGACTTTAACCCTATTCTGAAACCTCTCCCTGOT 

GAAGCCCCTTCATATTCTCTGGC^TACGATCAGTCGACTGCCAGTTGGCTAGCGCTAGAAGAACCTTGCCCTCCGGA 

CTTCTTTTCGGGAGTTCTTTTTAGTATAGAAAJCAGCCTGCCTACAATTTGTC^CTCTTCAT 

AATCTCATGGGCCATTTATGGAGTCCGGGACCAGTATCATATCTTACTAGCTCATCGGTGGG 

AATAGTCTTCCAGTTGTTCGGGAGGAGGGTAC^CAATACTGGAGGAGGTACATCCACTGT^ 

GGGTAGTAAACACTGGGTACAGATCAGGGAACAOTTGCTAT^CCAGTTTCTCCCTCCACCATCTTAAAG^ 

ACCAC AGTAAATTTT GAATC CTTGTCTAGTGCAAAGTCC TTGTC CATCT T ACACATCTCC AT GAATTCGGCCTGAAG AAGTTCC AC AGT ATAGAAGCAYT 

TCTCTGCTCGCTTGATGAGTTCGGTGATGAGAACTTTGTTGGCTGCAATGTATTCTGGGTCGACATTT^ 

TGAAAATTCGTX^ATCTACCTATAAACTGTACCATATATTCTCCACAACATGGGCACCTAAAAACGTC 

TCGTACACATACATTAGGTGTTTTATAGTACAATCAGGGCATAGTTTTGCAGAACACGTTTCTTGTCGTGCTTCAAAA 

GGAAATAATCGTAAGAGGCTTTCATGTTTTCATGAGAACAGTTGAAGAATTTTGGTTTCATGTCCAAATCCGCCT 

GGTATTCCCACTCAATATCTTTGTAACGTGWCTTGTATTACCT^ 

TCTTGCAGTGTCTGGCTAACCATGCATTTGGCGATTGCTTCTTCTAGATGCATATTrTCAGCGTTTTCATGT^ 
CTGAAGGTATGCCGTGTCTAGATACCAGGGACAGTGGTGTATTTTCTAAGAGCGAAGCGTTCGGG 

CTTTGGCTCCTCTACTCTTTCAGTATCTGCTGCCAAAAGATAGGGAACATTCTAGATAGAGTATCTGTACACTATATCAATCATCT 
CCAAAAAGAATATGGTGTAATTCTTGGGGCAGATATTCCATAGCGTCATACACGGGCCTGGC^ 

TGGTGGATGGACC^CTTAAAAATGCTTCTGTTACGGAGAAGGCCATATTTTCCOWVTCCTGTCATTGTTTCTrCCTTCATG 

AACAC T AAACGTTCT AGCGTCTCTT ATTTTTGTCCCCAC TTGCAAAGC AG AGAAGGGCC AC ATGT ACCACTCCTT ATGAAAC ACATCCTTCAAAGTGTT G 

AGAGAAACTACCACATCTTCATAAATGACACT*FTTCATGTGGCGGTG<rrTCTTGTTCTGATCTCGTTTGATG 

AATCCATATCTGATATCATCTCGGCGTACGCCTCACACAACGCATTCTTCTCCTCCACTGCATCGCCTGCATTT 

GCGCGTATTTTTCATGACAAACATGCCACTATAAATACGATGAATTAGATCCTTGTATGCTGGCCCATCTTGTT^ 

AATTCTTGTTCGTTTTGGTCAAGAGrrGTAAATGGAACTGGGTCGTCGTTACTTTTC^^ 

TCTCTTTCAAAAGACCGGTTGGATTAATCACAACTGTTTCTCCCTCCAAAA 

AAGATCACCACTTTCTCCCACTATACCCACAATTCGTTCATAGGTGGACCTCTTATTATTTCTGCCTCCCATTAGCT 

GGGACAGCAGCTGCAACTATGGATGTTACACCTTCCATGTTGAACGAGGGAATGAATGCGATAAAGTCTGG 

GAAAAAAAACAACATGAGGAAAAAAGTAAAAAATTCAATATACATGTATGTCTTTATTGGCT 

GAACATAAAAGAGCAGCAGCAGGATCTGGAACAACAATAGAGGACTTCTTCTtX^ 

GAGCTTCCTTCTTGACTTCTTGGAGTACATTCTTGTCGACTGTGAACTGGACAGCTCTAGC 

CTTCTCCCATGCATACATAGTCATTGACCGX^CCTTGCTGAGTTCGGGTTCTTC^ACAAACAAGTTGAG 

TGAATAGCCATGTCCAAAGTAGTACGAGGATTAATTTCCCATACAGTTTTGAATAGT^ 

CACCACTTGCCATAATCCTCTGTTTAGTTACTGAATTCCATTCTCCCAGTTTTATGAGCTC^ 

AAGTACATTACGATTATACATGTTGGACGTTAAAGGCTCAAAGGATTCAGAGTTGCCCAG^jATCTGTG<lAGTGG 

AWGAATTGTGAACACCATACTTCATAATGTCCCTTCTCAATTGCTCCCAATCGTGAATTGGCAAAGAGTTGAAATATATGTCCCTATTCm 
CCTTCCCCATGTGAAATTGGAAAATCCCTTTGCTCAAAGGACTACCCTCAAACAG 
CAAGGCACCATACTATATAGTTTCAAAAATCCTCTTGTTAATTAGTGCCGCTTCTT^ 
CCCTGCACACCCAATCCCATTGGCCTAGTTTTCATATTGGAAATGCGGGTCTTCTCAACTGCATAGAAA 

TC ATG ATTTTT ACAACCCTC TTC ATCTCC CGGTAATC AAC AT AGGGCCTT AGGGAAGGGATGGGAGAAT ACTTCACAAACTT GTTGACTGCGATAGAAGC 
CAAATTGCACACTGCAGTTTCCTCCGAATCACTGTACTGGACAATT^<^GTGCACAAATTTGAAGACTTGATGATGCCG 
CTATTGATGGTATCCTTAAAGCACACATAAGGTGTTCCAGTTTCGATACGTGCAGAATTAATTTGG^rc 
CTTTGCCTTCAGCCTCATATTTCTCGTACAACGCCTTAAACTCTTCGCCATGGACGTCGGAAAGG 

CTTC C CAGC CTTCACTC T CT CC AT GAAGAGATCAGAT AC CC AG AT AGCT G GGAAAAGATC C CT C GT CC T CAAATCTTC ATT ACC GGC AT TC TTTCTGC AG 
TCAATAAAGTCCTTCACGTCCAGATGCCAATCAGAAATATAGATGGXZTGCAGC^ 
TAAAGATTTG<aiGGAACGCCATGAGACCAGGGTG^TACCACTCCATGACGAAA 
TCCAGCAGTCTTGGAGATGATTGCCGCCTCCTTAAGAGTATCATAAATACCCTCAATG^ 

GGTGT GACT GTTCCACAATT AAAC AGTGTGGGGGAAGCATGGGT AAAATAGTGCCTCGAC AT GAGATC AT ATGTTTC AATGACAGATTTGATGTCTG ATC 

CGTGAATGCCGACAGCCAC^CGCATAATCATOTCCTGAGGGCGCTCAACCAAGATTCTCTTCTTTCTATCAGTGGGGGAAC 

GTATTCTAGCGTCCrCAGTCCGAAACAGGTCAAGAGATAATCCATTTTGTAATO 

TAGGTATCATTAACTACTGATGCTGGTTTACCAGTTCCAGGGTGGATTGCGTGCCTCA^ 

TTTTGTGGATGTTCGAAC AG ATGAATCTTCCTGCCAG TTT TCC AAAATC AGGGTGGTCAAC AATTTCT 

TTCTTGGAAGGAGATGGTAGCGGGAAGACGGTCCATGATGTGAGATGCAAGTTCTTCAGGGTTA 

ACTGGAAGACAGGCATTTTCAATCCTCrTGATTATCTTTTCAAGACTAATTTCTO 

AACCCATTTTATGTTATTAAATTGGGATGrTGTTATTTCGAGTATTTTCCTATACGTCTTCTCGGCT 

GGACGGTAGCTGTATTTATACCGGCCCGGTGAAGGTCGCTGGATATTTTTGAAGGA^ 

AGAAATCAGAAGGTGAGGTGACAGCTGCTGCTGCTGCAGTATTAGTAGTGCAGGTGCAAAACTGTTCCTCAAAAACrTT^ 

TCGAGTTT AGGGGGTGGAC CGCTGT CT AGGCCTGATGTC AAGTTCCG AGGGTGGACC GCT GGGT C GACCC AATGTCAGATTGC AC CAGAACCGACTGTTG 

CTCCAGAAATGTGCATTAACCTTTGTGTGTTTCTGGAACAGACGTTTCTGGGCATAGCTGCGCCCAGAAAGGGAA 

GAATTTTACTCTTCCTCACTCrCACAAAATTTTGAAAAATTTTTCTGGGTQACTC 

AGGGTGGACCGCTGGGTCGGGCCAATGTCAGATTGCACCAGAAACGACTGTTGCT^ 

CTGGGCATAGCTGCGCCCAGAAAGGGAAGGCTGTATCTTACTGTTTTC^ 

GGGTCACTCCAGTTTAGGGGGTGGACCGCTGGCTCGACCCAATGTCAAGTTCGGAGGG 

GAATAGCTCCAGAAATGTGCATTAACTTTTCTGTGTTTCTGGACAAGTCGTTTCTGGTCTAGGAGG^ 

GTACACCAAGGATAAAGAAATATAACAAATACCATACTATGTTTTATTTGAGTAATATAGTATCCAAGTTAAAATAACAATCACGCACTTGTA 
AGTCTGTAAAAGATGGtK AAAC GT AC ACC TGAGAGTATTGTCCCGACAATCGT AC ATGAACCCTCTAGTGCACGTATCACGTTTTCGCCCT ATTACH 
GATGCACAAATTTCCTCCTTCATTCTAATAGAGGCAGATTGTTTGTCAAAATAACACTCCCTATAGTAACAACCAGGATTTCCA 
GTTGTTCGAACGGTTTTCCTGTATAACATTTCCCAGCTGCGTGGTAGTGACGCACA 

CTCCCACGCACTAGAGGCACAGAAAAAATCAACTGCTGCGTGGCAGTGACGCACGGCCACACACCCGCAT^ 
TTAGTCGTCACATCTCTCAGAAACCCCCATGTGCATTGACGTCATGGG^ 

GGTTAGTAGACAGAAACAAACCGTCAAGATGGTGTCGTCTATTACCCACCTCTCTCTGTTGTTCGTCGTGGCTGTACTAGCTTCCGTCGTTT^ 
GAAGGAGCTAGTGTCAGAGTGAAACGGTGTGCTCTTAGCCCGTGCCCCGACGTTATTGAC^ 

CTCGAGGAGGTGACGACGACGACGACGATGACGATGGAGGAACTTTCGATACAGTAGGGTCTGGTATACTTGGACGCAAAAAGCGTGCCGCACCTCCACC 
TGAGGATGAAGAAGAGGATGATTCCTACCGCAAAAAGCGTGCCGCACCTCCACCTGAGGATGAAGAAG^ 
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cctccacctgaggatgaagaagaggatgagttctaccgcaaa 
gtgccgcacctccacctgaggatgaagaagaggatgagttctaccgcaaaaagcgtgcc^ 

caaaaagcgtgccgcacctccacctgaggatgaagaagaggatgagttctaccgcaaaaagcgtgccgcacctccacctgag 

ttctaccgcaaaaagcgtgccgcacctccacctgaggatgaagaagaggatgatttctaccgcaaaaagcgtgccgcacctc^ 

aggatgatttctaccgcaaaaagcgtgccgcacctccacctcaggatgaagaagacgatgat^ 

tgaagaagaggatgatttctaccgcaaaaagcgttaaactacgcacgaaagtgacg^ 

ttttcatgaaatgtgtacacacctgctatatatacctgcatatttgaa 

gggataaataatcggtatttgtgtccaaagatagtggtgcc^tgatgccatat^ 

cgtcaatgcgtgggcagaggcggaggtgctgataaagcgtttctgagaaacatt^ 

TGCCTCTGCCAGrTTCACTATTTTATTTTTCTTCCTATACAAT^ 

AAAAAACCAATAACT AACT AATCATGGATAACTT GAAAGGGGAATTTGTTGCGCT T AAAACAGACCT CACCCATT ACAAAACACAGTTGGATAGATCT AT 

ATTGGTATTTGTTGATGTTGTTGGTAGATTATATGTTATAGTAAATAGTGAACAAACAGCTAAAAAGGAAGGTC 

ACAGAGATACAAO^TTCAAGGACGAAATAAACAACAAATATAATGCTCTAACAAAT^ 

TC AAAAG AG CAAAACAT AAGGC C AT AATTGAAGC GAGGGAAT AC TC T AAACCGC TGAGGG AATT AGAGTGCATGTT T AC G C GT A T AGCGGACATGTT AAC 
CTTGACTTTTATGACTGTGTACACCAATATCATTAC 

TTGTGTGACGACTTGTGCAATCAATTACCAAAAGAAGAGGAAGAAGAGGAAGATT^ 
ACAOVCGCCTAAAGAAAGATTTGATAATTTTCAAAGATGTCATACAACAACnrrcACGTGAT^ 

CAT T AGATG TGCGAAAC AGA TG AAC GAAAT AAGT CAATAC AGGG AC AAC CTCAAG GAT AATT ACAAT AC ATTTTC1AAACA TT TT G AATG AAATTGTC T AC 

ATTTTTGATCACGGGGGACATCTTGAAGAACTAAAACACAAAGCCATAACTCTGACT 

AACGCATATCCGAAATGTTGTCATTGACTTTTCTAACAGTOTACA 

TAATTATCTTCTCX^CTTGTAACATGTAACGAATTGTGCAACCAACTCCCCAAACCTAAACAATACCG 
TTTTCTCTTTCTGTCCAAAAACATCTGAGTGGGTTTCTTT^GTATGT 

TCTCTTCCTCTCTATAATCCCAATATGGATAGTTGTTGCCTGATATCGAGGATAACACCAGAACTG 
ACAACTTCMGATTGTCCAGAACTCACTCCCAGACGACCAAGTTATCTCC 

GATGGCAAACATTAAAATCCAAGACAGGAAACAAAACACCACAGCCATATGTGAATTGACAACT 
GTATTTTTGGGTTCAGAGGAAAAACGAGAAGAGTTATTGG<^AATCTCCCTGAAGGTGC^ 
CTCTCTTGGACAAGAAACTAGAAATTGACGACGGTATAGCTTCTGTAAAGGCTGCCCTCTGTGCTGCT 
AGTGAAAATGACCTTTTCTGCTATCACAAACATGAAGGATATA 

TTCTTCAAGCACC^ATATCTTTGCTCAAGAATTCCAGGAAGGAACATGTTGCCCATCTAGTAAGACGTCTAGAA 
CCAAGATGAGGTTCACAGAAATGGAAGAAAACTACTTCCCAATCTCCAGCGATAGT 

CAAGCTC CGT ATTTTT AGGG AAC GAAAATGCTGCGAT AGATGTTG CCGTCTTTATC ACCAAC AACCGCCTCXIGGAGGTGT ATAATTGGGATGGAAAAAT A 
ACCCAAC AATCCAAT AAAGGCT AC ATT AATGCAGGCG ATGAAATO ATCGGCATGCT AAACTC AAATGAT A ATTC CCTCCT AT ACCTAAGA 

TGGTTGTACGAAGAGTGGTGGACGGTGTCTACGGGCAAGGAACTATCCTCTC^^ 
GACATGCAATAAATGCAATAGGATTTCC^GGCTC^CTATCTTAGGGCCTACAAGA 

AC AC AAC AG AAAGGAGAAAATAAAC C T TCGTTTGTGC AAAAAGGAAC AAAACGTC T ACG AGTGG AT A C C GGT AGCAAC AAGAAC A CGT T AGAAAAATTC T 

GTTCCTGGGAAAGATTCAATACTGAAGTTTTGCTCCCTTGGCTTGGCTACACTATTGAGTCTAAGTGGCAGAA 

TACCAGATATAAGGAACTGTGGGCCTTTGTGAACAAACAGGAAATATCTTCCATGAAAGACTCXTACATAA 

AGTATCTTG<IAAGACX^GAAGGGTGTATTTGAGACCGTCT GC AAAAT AAAGAGCAGAGATGGTTT GTGAATTGGCCA.C ACTG ATTCC GAT ACCGACT AAA 

AAGGCTGATTGATGGCAACCCCCCCCCCTCCAGACTCAGCCGCATGAGTATAAATATGGCCACTTCTCACACCACAG 

TACCGTCAACTTCCATTATTACTCCAATAATACCAACAACCCCAGAAATGGAGTCAATCAAACT^ 

CCAAGTGGCTGAAGAAATCAAGTCAGAATATAAAACCGAGGAGGAAAAGAGGATTG^ 

GTAGATACGTCTAAACACTTACTTACAAGAGAAAACCCCAACC^ 

TTGCCTGTTTTTGGACATTGGAAGAGATTGATTTCGAAAGGGATCCTAAAGATT 

GGCGTTCTTTGCATCCTCTGACGGAATTGTAATTGAAAATCTTACAACACGTC^ 

CAAGTTGGAATGGAGAGTATTCATGC^AACGTCTACGGAGAACTGATTGATAGACTGGTGCCCGA 

AACACTTCCCCGCCATCAAGAAGAAGGAGCAGTGGGCTATTAATTGGATGCAAAGCAATAACGAT 

AGG AAT CTTCTT T AGTGGTGC ATTC GC AT CCATT TTCTGG AT CJVAGAACAGGGGTATT TTGCCTGGTCTC ACCTCCTCC A AT GAGTT C A T TTCT AGGGAC 

GAAGGTCTTCATCGCGACTTTGCATGCATGCTCTTGAAAAAGGGTTTTGTTGATACCCCATCAAGAGAAAGGA 

GAAT TGAACAAGAATTTCTCAC AGTTTCC CIXJCCTGTT AAATTAGTCGG AATGAACTGCA 

ATTGGTTGAAATGGGACTAGAAAAGCACTATAATGTTACCAACCCCTTCCCATTCATG^ 

AGAGTCGCCGAGTATCAACGTGCCCAGGTCATGGCTTCTATCAATAAGATCAAGAAG 

CTGCACCTCCTCCAGTCTCTTCCTCATCATCCGAAC^GAAGATGTTGAAGACGGCGrCGGGGACTACATCACTTA 
TGTCAATAGGTTGTGTATTGTATTATTATTGTTATAATATrTTTAAAAA^ 
AACCTTTTTTGTTGCGGTAGATGCATGCTCTTGCTCTACCATCTGCCTTTTA 
GTGGTATACCCTACAGATGTGTGTGGGCCAAAGGGAGCTG^GAATTATTCACTGCT 

GTGAATGGTCAGGAAAAGGTCCTTGTCCAAGGATCAATAACGCrrGTCGTTGAACGAGATTACTCCCTTGACGAGCAGGATTGTAAAGGGT 
GTTCCGAATTCCTGGCACTGACCATTTTCATACTGTCTTTTCCCTTTGTTGGCTAGACAGAGATAT^ 
GGTATAGTAACTGATGATGAAGATTTGGTAGATTCTGGTATTAGGACTAAATTTAAArACTCTTCTAA 
TTTACTCCCTCGACTATCAAGAGAGGATTAAGATATTAAAGTCTCATTTTAA 

agatttttt cctcg^^tc ag agagatgggcaactttt gctct agagaat gcagt acctc aaatac ag aaccataacaatggtgaatgg aaag at attgaa 

aatcgtgcaagaactacgccaggtgccgcgtgggctgagactggaccaatattttaccaacacaagaagaaggaa 

tccctatccctcatgccctctacaagattgtgtacgacaagaataacaaggaatt 

AAAGAATTTATATTGTTTTATTTTGTCATTTATTTGATACATTTGTTGTATC 
TATGTCTGGAGCTCTGATGAATGTCTTGCCAGATGCCATCCTAGT^ 

GCAGCCAAGAAAATATCCCTTTGTCCAGTGACGGTAAATTGTGTACCGTTATTTAGTGTGATGACATT^ 

CTTCCTCCTCATCTTCATAGTCATCTGCAACATTCTCAGTGTCTAATAGAAC^TAACAGAGTTGGGGGAAAAAT^ 

AGTATTTTTACTGCTCCATGGTCTAGGATCATGCTTTCTAGGCTTGT 

ACTAGTGTATTACCAATGCGTTTTCCACTCTTATTGTTATCACAATAATCCTCATCTTCTCCAACAAGAGTTGACT 

TATGAGAAACTAATGCCTCTTTAAAAATATCGTCACTCTCTTCTAACCAGAA^ 

ATTGAGTAAAGX3GTGTAAAGGTTCCACGCCAAATAAAGGTATAGGCACMACTTTGCCTCATCCGT 

TT^TGAAAAGGT^GTTGCAAGAACACAATTTCrrTGCATTTTTTAAT 

TATTAAATCCTCCACACAACGAGTACATGGCATTATCGATATTCTCCCT^ 

GTCTTT ACCGAG AGGAAGAAT ATC TGC ATTGAATTTT GAAATTATAT C AT^GACAAC ACGTTC AC ACTGC AC AGATCTCTTT ATG AATT T AGAGGCTAGA 

GATATGAAATGTCTATTTAAATTGGAACCTAGCCTATTCCAAGGAAGTGAATTTTGAT 

GTAATCTGGTCCAATTAACCGCAGCTGCAGTTTCTCTCTGGACGAATGTTTG^ 

AGCTGGAAGAGGGTGTTGAAAGCC^CACACGCCAAATCTGAACGTCATCTT^^ 

CTCCTTTCATTGGGKIACAAACTCCTTTGTCTTTAATTCTTC^ 

TtZATCACGTCGTTGTTCTTAAACAGGCGCGCAGAAAGCn'GCATC AGGGAC ATTTCTTCCAACTTTTT 
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CGAAGAGATAAAAGACCCTATAACATTCTCAATGCAATGAAC 

ACCATGCCGCACTTGTGGTTGTATGTGAGCCCACAAGCAACTGTCAATTTCTTCAGAAGGATCT 
TCTGTTTATACTTGTGAGCAACCCAGTTCACTATAAATAACAAGGCATAAGAAGCAGACACCTTTCGTCCCAAA^ 
GTCTGCCATGTAGCCATCTGATTCGTGTTCGGCGATAGGTGTCCrCATCATTGTGACGGATGCTTTCTCCACTCTATC 
ACAATACTTTTTCTGCACGCATTTTTGTCATTAGAGTTTATTCTTGCATATTGGAGTTCACAG^ 

atgaatttttagttttgttgtcgttctttatatactgtttcaattttgaga^ 
gaaaccgagcaggtaaagttgtacacatgaaaaggcacctccaaa 

ctaaagttgtgctccagactataattcgtttcaacgttaaagtacctttccj^tctacgaggcactttaatg 

cactaaacatacccgacaacttgtttgattttcccaatacaatatct 

gggcttttttgcatttttaagagtatccgaaagatgcaccaaaccagttctgtagttgtag 

ttgtatgcatcgacaactttttctggagcaattttttcagcatataaacgttgccatattttct^ 

ttgtcttcacatcgacaagaaagtatccttcacgtatactggtgc^ 

tattatggaagcaactccatccaacatgtggtacagggctgattgggtacaccactcgtcactatt^ 
aaactgtatatagagatgaacgaaaacagtcttctggatgatgttttcagtctc^ 

CGAATTCCCATGTCAGGTTTTCATTTACGTATTCTTTTAGGAGTGC^ 

GAGGTACTTTGGCACGCACGTG^GCACTGACCGAGACACAACGGTGTACTGAACAAGGGTTAAAGACGTTGCCT 
CTACCAGAATAAGTCCTTTCATrTTTTTAATTAACACACGCAAAACT^ 

CCTGTGOTCGTAGGGATACTTTCGGTGGTGTTAACCGCCATAATTATCTCTACTATATTTGCCATCTAC^ 
AGAAATCGCGAGTCAGTAAAGACCCCTTGGCTATACGCGTAGATGCTAACCATGTCCTCTCCAATAATTCTCCCTACATGA^ 
GGCTAGAAATACACTAACAGAATTTGATCCCAAAAATATGAATGAAGAGGAACGAGCAGAA 
GTGGAAAAGGAACAG<n'GCAGAAGAAGAAGGAG^3AAGAAGAAGAAGACGAAGAGGAACAGACG^ 

TAATGCTCGAATCTCTGAGAGGGGATAAGGCTGGATTTTTGATAGATGAGAATAAGACCATGTCTATACAAGGTTCTCCTGGTACAAAAACTGTCTGTAA 
GAAAACGAC C CT AC C AAAGT TTGT C CC AAC CC CT AAAAC GGAAAAGT ATGT T CCTTC AGATG AAAGATTTTT CAGGATGG AAG AAGGAAC AGG AG AAAAG 
GCAGAACATTATTCCTTCACGTCTGTTAAGACAGGAAGAACCT 

AACAGTACATGTACACATTTGAACCAATAAAGGAAAAAAAGTAATGTCTTTGTTTATTTCCCCCTTTTTTAG 

GC TGTGACAGAAGAT TACGGGC A C AATGAAAAGT TGATC AAACGGTT AC AAAC C TC TGT AT AT C AGACCC CTCT AT T AGGTGCAG ACGAT GT AAT GAAAT 
CCATATCAGACTACATAATTTCTCGTCGATTCATGAACTACACAAATTTA1TAAAACAAGTTGAATATCTT 

GT AACGCGCCTATTT ACTATT ATAGGAGGT AGGC C C AAT ATGACGGTGC GGGTTAATTTT AAAC ATGGGCAGACAAATAATC CTGCCTATGGTT ATC TC A 

CAGATGAT AATGAT ACT ACT AC TGT T ACTC CT CC TGTT AC T CCT CCTC C ATC TC CAGCT GCAAG AAGATC CC CTTTT TT CACACGCACT C TCATATC C G A 

GTCGTCTTCAGTTCACCATTATGTATTGATGCATGATAACCCAAAAAGATCTTCATT^ 

CCTTCTGTTCCTACCTTCCCCCCTAAAACCTCGTTTGAAATTTCTGACGTG 

ACAATGTTCACGACTATATTGCTAACGACCCCGTACCATTTTTAGTGGATGT^ 

CGGTCTCATTAGACCTGAAAAATACCCTTCAriTGTTAGAAAAGTGTGCTTAGATCT^ 

AGAAGATGAAAATCT CATCCCACAACCCAAAAAG AAAAAATC AAAGAAAGT ACT ACC ATTTCCTGTTGACAAGT AT AGAGCTGT GGATAAAAAGGTGGT A 

AATCTCATAC AC AAGAT ATT AG ATC AAGAAAAGG AC CACC TTTCT AG T AC C GAAC TG CAAAT G AT AACTGAATG T AAT GGTGC GC GAGAAGATC TGC TT A 

AACATCTTCTAGACGAAGGAGAATTTAACCCTACTATAATTGAAGTAGTATCATCC^ 

CGACAAGAAGTTT^ACAGATATCATTATCAATGTTGATCCACATACTTTTCTTCGCTGATAAGGGTACT 

GTTTTC^GGGAACGACTATAAAGTGGAATTTGAAAATATACGTAAAAAGTATCTGATATTGGAA^ 

ATGGTCCTCTTTCTCACATGCrCCATTCTTCAATCCCTATTGTACAAGA 

TCAATTCGATATATCATTCATAATTAATAGTGTCTTGTGGGGAATTGATAAAAGTC 

GTGTCGTACGTACCGATGCGTGTACGAACACCTTCAAAGGACAGTAATCGGX2CACAAAATACTCCTT 

ATACATTTTCATCCAGAATCTCAGTGTACAGAAACAATACCTTTAAAAAACTAACCGAGTTAGTGCATAA 

ATCATCTCCTCCTCCTCCTTCATTATCGGACAGCGTCAACACTTTTGTG^ 

ATGCCTTATGGGTTCTTTAAGAAAACTTTTGACGTCCTTTATTCTAAGGG^ 

CGTTTTTGCGCGCCTTGAAGGAAAGGGAAATTTTAATCTATAAA^ 

CAGGTGT^AAAACAATTACTCCCTCTCTTCAAACACTTTTTAAAAATTGAAGAG 

TGCAAAACTCAAAAAGATTTGCAGAGTTTCCCTGCCATACAGTCTGCTT^ 

ATGATGATGAAGGAGGAAAGGGACATACCCTATTAACATTTGCTATAGTGCACAGATATCCCTTAATAAGCC 

GTTAGTGAATACTACATGTAGAGACAAGCACTTTACTCCCCTCATGCACCTCGCCAACA 

ATAAATGGAGCTAAACCAGAGTTCATAAACAAGTTCAACGAGAATGTTT7GCATATAGCGATTGAA 

GAACATTATCCAGCGAACAAATTGAAAAAATGGTCAATGTAAGAAGAATGATGGATAATACAAC^CCTTTAA 

ACTCGCTCAGCTTTTTGACGGTCTTTACAAGCCCAAAATAAAGGTCCGTTTCGGTTCTTC 

CTAAAGGAATCAGTTGCATATTTGGAAACGAGGAATATATCCTACGATATTAACATCATAAAGCATGC^CT 

ACGAAATAGCAGCAGCAGGACTGCGAGGCAATAACTGCGACCCTGAAGCAGACGAGAAGACTATGAACACGTGG 

ATGGGCAAGCTCTATTTTCCAAAAGAATAGGCAGAAATTTGTAAAGATTGTGGATGGTATGAATAGG 

GATAGTCTGGACGGGGATCTTCCTTCAGGGAGAACAACGTGCGGTCATTG^TTCCACAACGT 

AT GGCAG^CGCGCAAGAGG AGGAGG AAT AAAATGCCCGTCCTGC AG ACAAGTC ACCTGCC TCGGAAAAAGACTAGGGGTTGC CG ACTATGAT ATT G AAAC 

AG AGGAAGAACGTGACACG AAAAATGT CGTGCCTTCGGT AGAAGAAG GAAGAAGGGAATGG AGG AAG ATTGGTGTTG ACAGAT ATGAATTTCTT GT AGGT 

GGAGTGTGG^CAAATGAAATAAAACTATAAAATGAACTAGAATATTGGTATTATTTTTACACCTTAC^ 

TTTTGAGATTTCTGGACAAGCCATTTCTGGAGTAATCTGACATTGGGTCGACCCAGCGGTCCACCC^ 

CCCTCTAAACTCGAGCGACCCAGAAAAATTTTTAAAAAATTTTTGAGATGGAGAA 

CGGTTCTAGGTGCCCGCCCTCTCCAGAAACGTCTCTTCCAGAAATGTGTTTT^ 

CC^GCGGTCCACCCTCGGAACTTGACATC^GGCCGACCCAGCGGTCCACCCTCTAAACTCGAGTGACCC^ 

GAATGAGTAAAATCTCTTCCCTGAAAGGGAGATCTGAAGAGGCTATACCGGTTCTAGGTGCCC^ 

AGTTAAAGTAC^TTTCTGGACAAGCCATTTCTGGAGTAATCTGACATTGG^ 

TCCACCCTCTAAACTCGAGCGACACAGAAAAATTTTTAAAAAATTTTTGAGATGTAGAACGA 

CCTTTCTGG^CGCAGCTAGACCCAGAAACATCTGTTCCAGAAACTTAATAAAGTTAAAG^ 

GGTCGACCt^GCGGTCCACCCTCGGAACTTGAC^TCAGGCCGACCCAGAGGTCCACCCTCTAAACTCGAGCGACA 
AGATGTAGAACGAGTAAAACTCTCTAGTGG&TTAGGGTGTTAACAGCCTACCCTT 

AAAGTTAAAGTACATTTCTGGAGATGTGCGCACGTATGCATGCATGCATACGTACCTATCTGTATTTTTTTAAA^ 

ACAACCACCATTCTCCGCTCACTATACAGCTGACCrTCGTTGAGTAACCATC 

AACAAGTACCTTCTAGATGGAGGTGCAGCAATCTGTCTGCAGTCTAATTGTIGTTGC^ 

TGCTAGGATTTGGCGACCCCTATAAACAGAGACGGGGAAAAACAAACAGCAAAAATCTGGCCATAATTC 

TGTCCAACACCCGACAGAACCAGAAAGGTTTTGCTCCATGACATTCTTATT^ 

AAAGAGTATGTTGAGAGCTGCAAGAAGCAC^AAAGGGTCCACAAATCTTCCACAGAAATGAAAAGA^ 
CGATCGCCAAATCACCTGCAATGAAGAACTACAACAAGATAATCTTCCCTGCCAGAATTGGGTGCGCGGCAGCT 

TTCTATTCGAGATTTCTCCACAATCATTGATAAGGAAGTGATAATAGTGTCTCAAAGGATGTAATTAAAAAATAAAACCGTCGCTGGCAACCCCCGCCAC 
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CACCACCATCGAGGGGGGTATAAATAAGGGGCGCTGGCACATGGGTGGCACACTCGCATCATGTCTT^ 

AGATGGTTGGGGACTATTCAAGATGGACAACTGTCAAGAAC^GGAGGAACAGACAGCAACAGTATTCCCAT^ 

TCAAAAAAGAACATCAACCAATTCTCCTCeTGCTCCACCTCCTC 

GACCAGTGCAGAAATTGCGATGAAACTGGCTATTACAATTTCCACTCTTATGATAGAA^ 

GCATGTGGCGGCGCACAAGTAGATCTTCCCCCTTCCTTAATAAGAAGAAGGACGTTGACGA^ 

CAACAAGTACAGTTTCCGTGAATATACTCCTTCATCAAAGCTTGTGAATTGGCGAGACCCTTCACAA 

GCTCGCGCCCCTACACCCACTCCCCAAGAAAAGGAACCAGAAGTAGAAACTAAAGATGATGTTGTCATC 

AACCAGCCCCAGTTCCAGACCCAGATATTCCCGCAATAACTGCAACTACTACTACTAC^ 

CAGAAATGTTATT-CTGAGTATCGTGTTTTGGTTTCTGGGTGTTTATTCTGCATTATTT^ 

ATGAAATTTAAATCTTTATTGTTTCTTTCCAATTACTCCATTGAAATTGTCCT^ 

AAGGTTGTATAGTTCCATCCCTCTTTGGTGGGAATGGTCCTCTATCTGATTGACA^ 

AGTTTTTTCTCTTGTATAATAATATCTGCTCATTAATCCTTCATATCC 

AAGTAG AGGGTTGAAAGAT C ATCAAATGGTTCTGGGC CTTT AACTCTT ACCCCC TCC AAT GTTGT TTTTGTTCC ACTAAGATCTGAAGGTCTCATGAAAT 

CTGCATAGGTTATTTCCACACCTCTTGTACAAGCACTCTGAGCAGCTGTAGGTGCGCAAGATTCGAATCCTCCATGGTAGGGTG 

ATGGACTCTTCTCATGACTCTAGGGACTCTTTTACTCGCTTCATAGA 

CTAACGCGGJ^TTTTCTGTAGCCTTTCCTTGCTTATTGATTGTGAAMCTTCCGCA 

TCTTTGTACTACCGGCAGTGCTTCCTCCTAACATTTCCATATTGCTGTTAA 

TTCATTCATTGTCGCAGAATCGGCCGATATTGTTGGCGAATCTATGTCTGTAAA 

TAGGGGGAAAATGTTCCCACGTTAAATGTAGAGGCAGTAACTCCATGAGAAGACTCGAA^ 

TTTTTGCTGCATCCCrTATAATGTGGAAAATTTCAGCCTCCGCAGGAGGCACGCCCGGCAAGCGT^ 

ATCTGTC ATACC AAC ATCCCCTAACAATTCTGCGTTC CT ATCGAACGTACCCTTC CTTTC GTCC AAAACAGCAGC AGGT ACCTTTTC T TGTCTT GTGG AA 

TAAGGAGTGTAAGCCGTACTCCTACGCATGGCACAAAATTTATTAGGGGCCAACATAACATTGGA^ 

GGTTTATCTTTCTTCCATTAGTTTTAGTTTCT^GATAGTATTTAGT^ 

GTTTTTAGGGAGGGTAGTCATGAGGGGAAAACGTGAATGCCTATTTCCTCTGCTAGGTGTTGCCGCTATAG 

ACTTTGGGACAG AATAAACCGTTGGTG CAATT AC AGT TCTCCTTTTCTCCCAATT CCCCG ATTT T ATCGAAT ATGTTTTTACTATT AGGATCTTGC ATCT 
CAAAAATACTCTTTTCTCCAATATTGGCAGGAAGTAGCCGTGTTTGTTC^ 

GCCCAGGGGTTTGGGGACCATCATTGTAGTCTACTGAGAGAGGGCACCCGAGCACGGGTATCCGAAAG^ 
GAGTACACGCCGTTCAAGTAGTGGCCGGCCGTGTGGGGGCTCAGACCCCAATCTGCT 
AGGAAAGAGTCCTAACCCTGAGTAGCGACTCAAAGAGAGCCATAGGAGGAG^ 
TAATTCATCCCCTTCATGGAATTTTAGAGGGACTArrCCATGCATGGACCCGACCGTGCCGTTC^ 

CTGGAAGGTATGGGTAAAGGAAATCCCCCTAAAACTCCCAAAAATGATGTAGCTTGAGCAGCCGCAGACACTCCCGCAGGAGGAGCGAGAGGCTTAGAAA 
AGGGTCGTGCTCGAGCGTACGAATTGAAGGAGGTTTTAATGTCTGTTCTCGAATl^AGAATAGCTC 

GTTGATGGCTGTCAAGTTGTCGGCCGAACCAGGGTCTCCTCTCGC GTTG ACCGGC GATCTGCGTTCG AACCAGGTGTGTTCGGGGAGAGAAGATGCAGG A 

ATTTC CAT AGGTCT T ACAAGAGGTGAAGATGCAC C ATTC ATT AAAAC TCC AT CTC C AGTG AGG ATTG CTCCTGCAAAAAT ATC GGGC TTTTTTG AT AACG 

TTTTTGCGACCCTTTCGGCCTCGCGCATACCCGTACCTTCTATTC^ 

CGCAGTTGGAGGGTGAGCAAACATTTCATATGCTTCTTTTTCCCCCACTTTC 

GAATTTGTAGACAATATTTTCTCACTGAATXrPTGCAAAGGTGGAACTTTTCTTAGCAC 

TGAATTTGGCATTGTTTGGTTCATTGTATATTGATCTTGTATTGATGGGCAGGCCAGAC^ 

GTCAAAAGAGAGTGCTTGAAAATTGTCTCGATGGGAGTGTGGGCCATTCGCTTAGCGGTAGGTCTATA^ 
TATCATTTTTAGCAGAAGGTTTTTCCTTTATTACTCTATGGAGTGAAGGAACGACGTATGCTGTTGCGCATATC 
CTGTAAGGGGTTATCTTTGTACGAATCGTACCCTGCCTTGTCAGAATCTTTA 
GTCTTTACTCCTtSAAGGATCGCCTGCTCTCAIGTGAACCGTC 

TTCCTAAACCGGGAATATTG^^ATGGGGTAAAGGCACGACCCTGAGCGATGCGTTTGGATGTTCAGTAAAATTTC^ 
ATCGTTTAGTGCATCTCCGTTCCAGATGTGAGCGGCCACACTTCCTGTTCCACTGTAGGG 
TCTTCGTCAGAATCTTCAATTCTTGAGTGCCAATTGCCGGCGGCGTCCATGGC^ 
CTG7AATTTCAGGTGCTCTTCCAAAAAGAGAAGCGGCATAAGAATAGCCGAGAG£CGAC^ 

AGT ACTACCACCTAAAGTTTCGTACCT TTCTGTTATTTGCAGCAAT AATT TTGC ACCCTCTACAAAGGGTTGAGCATAATCTGTTTTTGCCGCATCGGG A 

AGATCTATACCGGCCACTTCTTCCAGTCTGATTTGTGTACTTTCATCACAAAGACAGCCCTTTGTTCCAGTATCTCCTA 

CAGGGCAGCCTAGACCTTGTTCCTTTTCCTGTATACCACCTT^ 

TTCGCACAAACAATCCATAGTAGAAGTCATACTGCTTATGTCGAAATCGATATCGTC^ 

CC CAC TCCTTTTCCTCCACACAGT ACAATATCTGTGC ACCCAAGAAT GGGCTCT AATCTT GG ATCTTTATTGAGAAGGG AAGGATCCTTTCGCACCACAG 

ACGAATCTACACACCCTCTTTTTTTGGTGATATAAGTCGTTCCCTTTTTTCCTCCTCCAGTTATGACTCCTCCAGATT 

GACAGTAGTAAAAGGGTTGACGTTTCTGTAGGCTGCATCTGTAGCAACATCATCGTCGTGCTTGTCGAACGC^ 

AGAGGGACACAAACTCCGTCATATTCATCATCATCTTCTATACCGTATTCAGTTTTC 

GACAGCTAATGCATTCAAAATCGTTGCCTCCTACGCGACATTTTTTAGGAGTTT^ 

TAGAGATTCTACAAAACCTTTCCCTATGATGTCTGACTTGACTCCAGCCATACCGAAGGGGGCGTC^ 

ATAATTCCCCCATTTTCTTTTAATCCTTTGACGCGTAAAGAGT^C^ 

CCGGGTACAATAAGACAAAAATTATAAATTGGGAAGACCGATACAGTCTTTCATGACAA 

ACATTAAAAATGGTTGCCCGAAGCTCCAAGACCAAATCCCGCCGTGGAAGCAAGAAGAGGTCCACCACTGCTGG^ 
CAATGAAGAAGCGTGCAGGAAAGAAGAGCTCCACTGTCCGTCGCCGTTCCTCAAAGAGCGG 

CCCTGT ACAACAACT ATGTTATTT AATTG ATT TTT TT TCTTCTG AAT AATTGGAAAT AAT AAAAC ATCC ATTGAAACTT ATGCAGTATTTTT ATTCAATT 

TTTTAACCAACTATAAATCCACATGTGGTATAAAAGTTTAAGGGTACAGATATATTATAAGATGATGT 

AAGAAGTGTCTATGAATTTCTTGTAGAATTCTTCCCTGTCTGTACTTGACGCT^ 

GGGTTCTATTTTAGACATGACATTTTGAGGGATTTTAACACATAATACAATATCATCAT^ 

GAATTTCTATCCACTAAAACTAGACCATCGTCATTTTTCAAATGCTTCTTTCCTTCATTTCT 

ACGACACCACCTTGTCTTGATTCCCAACTATAGCAAATAGTAACGAGGAAGTAAATTCATTGTTTGCACCATTATTATTGTAGAAC 

AATTC CTCC ACATCT AGGGTCT AG ACGT ACCAGAAATTGTCC AT CGGC T AACCGAGTAAC AGGGTCGCCGCT AGGCCATGTGGCT AGCCG AGTCTGT AGA 

GGGGCAACTGGrTGTCTCAATTGAGAATAAGAACCCACTGCGCGTAGGCGGCCAGTGTTTCTCCAGTTC 

TATTTGCTCCAAAAACAACAATACCCCATTTTCCTAAAATAGCCGCAAGTTTCTTGTCTTCTTTATTTACCAGAAGATTT^ 

GTCTACCACGTATGTAGTATTTGAAGTGTGCACAATTTGGTCTAATGTTTCTTCAGAAACGCCGTCTCCCATGTAC 

TGTGTATACGCTTCACTAGGTGCTGGCAATAACTGTCGATGACGTTCTTGTTCCT^ 

GACTATCTGTATTGAWVATGTTTGTAGTATCATTCGTTCTTATGGAAGAAATGAAAATG 

CCAT^CAGGTATGZTFTGTAGGTAGGGGGACAAGACTAGTCTGTTTCATGAAAGGGTCCC^ 

TAGACTATGCCCTTGTTGGTTATGTCGGTGGCGGGTGTGTCCCTTACAGGTGCCGTAAAGGTTG 

CAGACATGTCTGAAAACTGTCCTTCAAAATGCCCGTCCATCGTAAAGGGCATCTGTATG^ 

AACCCTATCGTCTCGTGCACCGTCAATAAATATAGGGCGGCTGGTTAT^ 

GAGACTGTGATGTGGTAGCTTTTATCTTCAAGGACAACATCATCATCATCATCATTGACGATAAC 
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ACGTGATACTGTGAATGGGTTGGAATGGGTTATCACTCGCTT^ 
TGCGGGTGCCTTAAATGTAGGCTGTTCACCTCCCAGTAArGCAGTTTGA^ 

ACGTTTAAAGCACGAGGCAGTAGAGATGTTGTGCGTTCACCGGCATATTCTCTGGCGGGGGAAGAA 
TATCAGTTTTCAAGATGATACATGGTCTTAGTTGGTAGGAGAAGAAATCTGG^ 

AAAGT TT ACGCT AG AAC CGAAGC ACCCGACTCTCGAAGGTGTATT AT GTACAAC AAC TTTGTTCGTGTT AGCATTCACATC AGT ACTTATAACCGAG AAC 

AATTT ATC AGATTG AGT AAC TTTTGATGCTGGAAAT ACAAT ATTGT ACCCAGAAGATGTGGT T AAAACGTCAGACGACGTTTTGAAGCCCACAAT CC AGC 

CCAACATTTTCCATTGCGCTTCTATTTCACTTTCAGGTACAGACACACATATAAAGGTCCGAGCTGGGTTAG^ 

GGCGTTTAGAATCTTGTAACCCTCGGATTCATTTCCCTCCATATTCTTTAACGTTTCATTGTATACT 

GTTTTCGAGTACACGAACGACACTGAAGGTraH^AATATCTGAAAGTTGA 

GGGAAAGACGGC^GAAAATTAGCAGGGTAATCTGAAACATATTTTTCATT 

ATCAAGCCTCATGAGGGCTACATTTCTACAAGCAAAAACTTXCGTTATTTTT^ 

TTTTTATCCTCrTGTTACAATGGAAATTTTCAGTCTATTCCC^GTGTTGAAAATACrTC^ 

CCACGTGTTTGAATACGGCTTGTTGTTTATTGeCATTGTTGCGAGAGAAC^ 

GAAAGTTTCG^TAGAGACGGACCACTATCGCTGCTGCCACCCCCGTCTTCTCCTTCATATTTTTCATCCACTAAATC^ 

GGAAGAGGGACATTATCTGGTAATATTCCGTCAAGAATATGTATCCTCACGATGOTATCT^ 

TCATTCGAATTGGCAGCAATGATGATGATGATGAACCATCTCTATCAGG^ 

AGTGCGTTTCATTATACTGTGTAACAATTTTGGATCCTGCACATCAGTGACACGT^^ 

TTTGACGCCC^GTTCTAACCC^TTCTTCAAGTTCCriOT^ 

CAGCTTTTTCTATTTTCGCAGGTTTTCTTATGTOTATCACACCAGATCTTTCATCG^GCGTAAAATTGTTCTG 

AATAGAAATAAACACAACGAGAAGCGTGAATT<ZAAAGGCACCCCACCACGAGCCAAAAATACCCG 

CTTAACGGCAGGGTTG^TGTTCGGCAAATCAAGGAAACAAAA 

ATGTATATCAGACGTGGAACAAGGCATGGAGTGTTGTGGCAGACAAGCACAGGATGCATTAATGACTCGCCTAGCCAACTTA^ 
AAAGAAACTGATGTTAATTTGGAATACTTGAGATACGCGTCTACGCCCCTCCTTGGGGAATTA 

TCAACCT AATGGCTCMTTC TCCT ACGCTGCTTTGGGTATAGAAAGT ATACTGAATTCT ATACGGAGAGT TGTAGTGGC T AATCATCAACGTAGAAAT AA 
TGKSAAAAAAACCTTCTGAACCAATCTCACGCCCTCAC^ 

ATCAGCATGGGGGCGTTGGACAGATTGAATTCAGO^TAGTGACAGCGGCCTTGGGGGCTATTGCCAGT^ 

TAAACTACATGTACGATGTAGAATTTGCAGAAAGAGATGCTGCTACTACAGATACAGGGAATGTAGTCTATCTTTCCACCAAAA 
TGACATAATAAAGCGTTCAGAAATATTAGATAAGGTATOIAAACGACCCGC^^ 

CAAT TGATTTGGGGCG ATGATTCTGTAGATGATACTtSTTCTT AT AGATCTCATC ACCAATGC GATCGTG 
TCATATGTAACCATTTAAGGGCAGTTATTAGGAGTATGAGGGAAATTTTATAC^ 
ATGGTGCTTCTGGTTG AACC TGT AC AATAGACTGGAATGGTTCATGT T AGTAGTT AG^ 
GCTGACAATGTTAACGTGAAAAGACTTCTGGTGGTAGTTGTGGAGAGTTTTCCT 

GGCCTGTTATTAATAACAGCAATAATAATAGTACACTCCCTGTGACAGAAGACACCTTAATGAGACTAGCGATAAGGACGAGTAG 

T ATTTTC GACGAAAT T AACTCCTTGACAAC AGCAGTGACC AACCGTATTACCTT C C AGT CTGCAG AATTCTGCACAAAGATTTTGCTCGGGCGAGCTCTG 
GACGAAGAAGAAGCTGGAACAAAAATGCTAGTAAAATCAGTCAAAGAGACGGGA6AAGAAAAGGAT 

TG AAGAAC AC AAAAAAT GAA G AAT TC^G AAAT AAACAT AGGC G AT AAC G AT GATG AG ACT AC AG AT GT G G CTT GTTGG GCACG T ACTT CCTCGAC Arc CTT 
TATCCGTAATAGGACATATGCGTTTAAAAAAATATi^GGCCTTGAGGATGCAAGTGATOTAGTCGAGCTGAAGCG 

GTCACCG AT AAGAGCAGTCCTC TCCT ATTTCCGTATGTGTCCGACTGGAG TTGCTTACT ATT AC ATCCCTGTTGTAAAG C ACCGGCC AT AATTAAAAGTG 
TGTGGTTACAAATCCTGAAAGATTTTTCCCAGGAAAATATAAAAACTATAAATCAAAAG^ 
CCGTTTTAAAAATAAAAAAATTGCTGCCGAACACGTTCGCAGTGTAAAAAAGTT ATT AAAT 
GAACACTGTATTTGGTTAACGATTTTGTG^AAACAAGTCGTTCAGAACACTCTC 
TGTTCACTCACTTK^CCCGAGCCTTCCGCAAGATGAAC^CCTAGTC 

AT T C G AAG AGGATGT AAAGAAT C C G GAAT CATCCAT AT AT AG AACTC C GAT ATC CCT CTT CC AAAAC AAGGAT ATTGTT ACAAT AGTTGGTGATT AC ATC 

CTCTCTCCGAAGACGGACTCATTCCAAGTTCTATACCCAATCAAGAAGGTCATCGAACACTTCCCAGTAATCTTCCA 

TCTGGGTACACCTTCTGGACGAACGCCATCATCGCCTGCTCCAGAGCCTGCTGACGTACGAGATTCT 

ATAC TAC AGGCGCCCCATC AAC TATCAAACTGGGAAGAGTCT ACTGATGAGCAAACTGGC GTCC GT AAAAGTTCTGGACATTTTAATGAGATGTGGATC A 

TACAAATTCATCTCATTAATGTGCATGATCAACAAGAAGAACAACACCAACTTTC 

TGCTCCACATTGCTGAAATGTTCTTTXTCCAACCCTACTACTAGCCJAC^ 

GAAAACACCTGCCCATCTAGCAATCCAAGJUIGATAATGCTGATGCACTCCTGTTCCTGATCTCCCTCTACGGC 
TACATGAAATCTCXCCTTGAACTCAAGTCTAACAAGTGTGTCAAGGTACTATCCTT^ 

AACTAGAACCAGATACCATGTGTGGAGTGTGTGCAACATCTGTGGAAGAAGATGAAAATGAAGGG AAAAC AA 
CAAGCATTACATCCATTGCGAATGCCTGATGGGAATGTGTGCTGCTGCTGGCA 

GAAAG AT GCC CTCCTAC AAT ATTT AGATGGTT AAAACTGGCT GAG AGATCTGAACAC AATCGTGT ACTTTTTGAAGC AAAAAAGCAAG AATTCT ATAAGC 
AGATGGAAGCAATGAAACCTCCCAGAG TTG TTGTTCCTCCTCGCAGGACATTTCTCACCCCAGCCAG 

AATTGCCACCAACGCCATCGCTGAAGCCACAGCTCAAGGAGATGTCAACTCCTACTTCCCTGTTCTCATTGACGGGAGCGGAGAAG 
GGAGAAGAATTCTTCAATrCTGAAGAGGAGGCGCTTGCTTTTGGAAGACCATTTCTGGAAGATGAGGAAGAAG 

CTGAACTGTCTAGACGAGGCGTTTCTGTCAATATTATTAACAATGATAATCCTCATCGACACATCTCTACAGTAAATATTGTGCAACC^ 

TGAAAAGTCACCTGCT^TTCCTTCATCTACAACATXKTTCAAGAATGACGTCTTTGACT 

CCCGTCATGAACCTGTCCAATGACAAGAGGGCATTATTCCAC^^ 

TGGACTTTCAAGCAGTGTATGATCCCCATCACATATCCAACTATATCGAGACGTTT^ 

CGGGGCCCAGGACTATTATGCAGAGAGTATCAGATACGACAATGATATTGTCTeATTCT 

GTCTTTGAGGGTAGTTTATTATCCCCACTGTTCAAGAAAATCAGGACTGGAAAATCTTACTCTAACTGGAACGAC 

GAGATATTGCTGAGGAATTTGTTAGGGTATGTGAAAACTCTCTAGCTTCACGCGAACACCCCCCTGTTCATGTACATCCC 

CATTCTCATTGAATATATAGTAGATTTCATCCACCACTGCATCACCTGGTCTATGCAAGT^ 

AATACAAATCTGCACCTGTTAAACTTGCGTCCTACTGATGAAAGGGTGGAAGTTTTAAGGG 

ACAACACTAGAATCTCCCTCAGCACCAAAAGATTGAGCCTCATGAAGATCTTCAACCATGATTTGG<3TGT 

TATTATTGAAATGTACTGTTTTACTTTAATCTAAACAATAAAAATAATGTAAAGAATTC 

GGCGGGTAGCCTCCTCXACAGGCACACTTGTCC^GTAGCCGTTGTCGACT 

AG TCT GAATT GK^ AAGAAATT GC CG AT ACT G AATTCCTGAATT AT CTT CCC CAT 

TCCTACACTAGAAATGAGGAACGAAGTTGACCATTTCTGGTCCCAAGATAACAG^AAGCTGAAACTCTTGGGGCATTTTTGTG^ 
GCATTTATAGCTGCTTCTATAGATGCTGAAACGTGCGTGGGCTTTTTGAGGTCGC^ 

TTGCCCGTGAGGATAAATCAAATACAACTAATTACAACTTGTACATTGATAGAAATTCAATGATGAAACAAG 
TTCATCAATACAGAACACTTCTCACACAAAATCTTCCOICCT 

TCCAAGAGGACGGCCAAGGAAATTGGAGCAGCCAGAAAGGTTTGTTTCCTCCAGAACCTGATTGTTGCCATAriAA 

TTCTTGATACAGGCAACGAACTAGAACAGCAAGTATTGGATGATGCATATTTTAATGCGGAGA 

ATTGAGTACTTTC^AC^TTTATTTGTTAGGAAAAGTCTTC^ 

AACATTCAAAATTCCTCATTCTTTAATGAAGATACGACJVCCTATTCTAGCGTCAATTTCTATACCATCA^ 
ATAGTAGATGGAGATGTCCCAATAACTTGATGACT^SCAGCAGAACGGTCTATATTTTTACGTGG^ 
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TGTAATTGTAGGATCTACTATAATGCCTAGTGTACTTTTCTATGGCGACCGAAAACATC 

TGTTCTTATTGGAAC^GTATATGGATTGCAGATCTTATGGGTTTGAGATTATAGACACACCTGAAAATAACT 

ATTGCTCGAACACAGATTTCCArTCACCGGTAACGCGAGTCJACAAGAA 

AGAACCTAAGTGGTACAACAAAAATGCCATGTGTGGTGAAGTATTG^ 

GACAGATTTGGTAGTCTATTACrTGGACGAGAAAAGAAATGGACGTGTAATTArCTAGATAGAATAAAGTCTCTAGAAACTATTT 

GTAAAATTGACACCATGTGTAAAATTCTAGAAACCAAGTATAACTACAAATCTTCTTCC^ 

AAAGATGAAGATTATCGCCTCTATAAACAAGAGGAG&TATTTCTGTAA 

GATCACACTAAAACCGGCAATGGTGGTTGTGCGTTTTCAAAGTATAAGAAAAAACAACTCGAACCTAAACAAC^ 

TTGAGGCTTTTTOSTTAATTAAGATGCTGAGAAATGATTGCGAACGCAACAAGTGTAGGT 

ACGAGAACTGTATAGGGCTTCGGCCAGAAGTTATGTCC^CGATCTGGTATTGAAGCG 

GCTAACACTATCATGTCTTTAATACCTAAATGTAAGCTACACA^CT 

GGGATGGAAATTATAACCCAATAAGACACTCTATGCTGGAGTTAGTATACGGAGAGGACT 

AAAATGGTGCTCGAAAAAAGGTGTGATTAAATATGAAGACTTTTTGGATCGTTACGAGAAA^ 

CTAAAAAAAT GTAGT AGAGATC AC ACT AAGGATATAAAAAAAA? AGAAAATGTACT AAATTCTG AT ACAC TTTATTC7TATTCTCTC GAT AAAAATGTGC 
AAACCCACGCATCTTCTAGTACAGTTGTAAAAAATGACACTGACGGA 

AAC^ACCAAAAAACGAAAACTGGAAACGATAGATATATCGAGTAGTGACGACGACGATGAAGAAGAAG 

AATAACTGCAGCAGCAGCATCAAGAACAAGAGCAAGAACAAGAATGGGAGAATCTGTTGCACAGATAT7C 

CTTTATCGT^CAATTGTGTAAAAAGTATGGATGTGTTGAATTTGTTAT 

ACACCTGAGCGGTTTGGGCCTGTTGACGTGCCTAACTTTTTGTCACCCTCATGAATACAATTTGTAAAGGT^ 

TTCCT GT ACTGAAGAATATTGAAAG AAGACTTCTTGAAGAGGACCGAT AAAAAAATGGCC ACCT T C C AG ACTGACGCCGATTTCTTGCTGGTGGGGGATG 

ATACTAGTAGATATGAAGAAGTGATGAAGACTTTTGATACTGTTGAGGCAGTCAGGAAGAGTGATCTAGATGACCGTGTTTACATGCT 

GGGATCTACTTTTGTCCTCAATGGAGGCATCGAAGAATTGCGTCTTTTGACTGGAGATTCAACGCTGGAGATTCAA 

TAAAATAAAGACGGTGACGGGAGACTAATATCTTTCTTAGTTTCCCGTCACGGTGAAAATGTTGGTTATTT 

TAAAAAAATAAAACGAAAACTGTCAATATATTGTimATTGATATACAATATCCCTTTTTACACAGAAATGGCATGT 

AGTGTTGTTTCCATCTTCTTCTTCTTCTACGTCGCTATCAAAT^ 

TGrrCTCCATTTTCTCGCTTCTTTTGTTCCTCCCTCTTCTT^ 

GTCTTTCCTTGTTGTACCTAGTGACAATTTTTGATGTGTTCTCCATCATG 

ATCAAGTTCTTGAGGCGAAGAGATGTCTATCCTCTGCCCATCTTGATAGACGGTGCCATCATTCATAAA 
ACAATACCCTTCAAATCTTCATCACTTGGAGGGGTCTGGAA 

CGTTGCAAGCGATAGGTGCGTCACAGCCAGTCACGGCCATGATGAGTCTGGGCCTC^TACCTGCAGTATCCCTTTTCTTGrTTAT 

CCTCAAACCATTTTCTGTTGCAGTTTCAAAGGAACCACTTTTGTCTGTCCTGCACAACAAGTA 

CCGCTGTCGTATCTGAAGGTCGCATTGGAAGATTTCTTGTGTACATTTCT 

TATACGAGTTTTTOTACTCCrcTTTCTTTACTACTTGTGTTTCTAGTTCCTTT^ 

TTTTGTGGTCTTTTCGACCTGGTTTTCTAGTTCCTCTTTTATTTGTAAATTTTTCCTAGCC^ 

AAGATAACAGCGTAGGCTAGTGTCGAAAGGTACATTTCAATGCTTGC^ 

CAATCAAGCAATGTGAAGAATGGCTCTTGAAAATAGGATTGGGTATATATTCTCCT^ 

ATCCTTGAAACCATCTTTCTGCATAGTTCTCTCCATTGACTTTAAAGCGTGCGTGA^ 

ATTATCATACAATCGTCTTTTTTTTCTACTGCTTCAGAAAGACTATTGCT^ 

ATTTTTCCTTGACTATTGTATTGATTTTACGCACGTCTTCTTCTTGCGTTGCAAGCCATTCCTCG 

TCCAGGTTCAGGAGGGGCACAACTTATCCGTTCAATTCTTAAAGAGT^ 

TTATCTTTCTTCACATGTACAACAAATGTGTCTTCGGACAATTCGCTTTGTCTTATAGATTCGATAGCTTCATAAACACCCATGACT^ 
TTGTTTTCTTCCCGATCATCAGGAATGACGATCTTAGTTTTAACTCCTTTTCTGATACACTTTCATTACTAACAGG 
TTCCATATTTATCTTCTTCAAAACAACAACGGAAAATACAGTTTTCTTTTAAGAAATATAGGTT 
GTTCAAAACTTAACTACGTGGTTCGGCCCATTTTGCACCAGATCTCT 

GATTTATATGTATTTGTCGATTTTTATTAATATTAATAAAGTATATTTTTAAAAATTAGGATGGGGACA 

GT T ATTT T AC C T T AATT TTTTT TC C CT AT AAT ACAAATAATTGGT AT C AGTTTAGG AAAACAT ATTTTT AAAATGAAGGT T ATC AT AT C A ACTAAACAC T 
TTTATTTACCCCCTTTTCACCTCAGAAATATCCTCAC^TAATCGCATGCAAGACTTGCGCATTTCTTCCCTGCCCT 
CCAACAGATACGTCTCCACTTCTAGAGATGGGGGT/TGAACATTACGGGAAAACMCTACCCTAGATAC^ 
ACTCTTGAAAATGGGGTTTACAACAATATCACCCTTGT'ATrrTTTGAGATGGCAC^ 

AT ATTTCCCT CT ATC GCT ATT AATTTTTCCATG ACATCATCAGCACTAAAACCGAGACC AAT CAAAAAATCT GTGCT AAC AACAAAACGTTCCTTTTCCC 

CGAAAGGTATTTGAGGAAGACTTTTTGTTATATACCTGTAAGTTGGACTGAAATAAAACCCTCT 

TACTTCCTGTACGTCTTTGTCTTGAGAAGCAAGCC^CTCGTCATCAGTAGTTGTAGACGTGTT^ 

GATAATAATGATTCTATCTTGAGGGAATTGTCCCCTGTGAGCTCTCTTA^ 

CAATGAAAACGCCGTCTCTAAATTCACTCTTTCGGATAGTTTCAACTGCTTO 

CATTAGAAAAGAAGAGCGTAGCTGTAACTCGTCTTCTTCAGGTGTCTCATCCAGGAGATTTATCAGT^ 

GT C AGTTG ATTT ATTTT TCTCT TTG AC ACAGTTTTTATGGTACTCTGGG ATT ACTTTAC TCAAAAATTT ACTCACZATGTT GTT AAAAGTTTG AGTGTATT 

CTTTTGTGGTAAACTAACAATGAAGTACATTTCTTAGTAAATGAGGTAGGTACrrGATTCGTGATGGATAACGATGGCGTCCCATGTCTAGTGAT^ 

CAC^TAATAAAAAGGGAAATAAAAATAAGAACTAGTAACATATTTTTTATTTTATCCTTCTCA^ 

AAAAATGGACGCTATCCTTGCGGATATTAATGGGAATGATACAGATTTATCCAAGCTAATCACJVGACGTGATTCAAAAGA 

AGAAATAGGGCTAAAATGGACATGAATAGAAGAGTAGATGAGGCTATTCAGGAAGCCGTAGCGGCCAAGAAACAAA^ 

TCCTGGAAGAAACTGACAGCGGACAAAGTGTCCCTCCAACATTATC 

TGTAGAGAGTGTTGAGGGAGATTCTTTGTATGATAAACTCCCTCCTTTTAACGTACMGAC^ 

TCTTACCTTGCCATGGTAGTGGTAAAAGTCGACTGTGAAACTGGGGATATCGAAGAAGAGTACAATCTTGCTCCTACCTTTGGTGTGA 
AAATATACAGAGATGAAAGAGACCAGATTTTTACAAAGGCTGATAAATCTGTGCGTATTTTTA 

ACAACTGACGTATGCGGTAAAAAATAACAATGAATATACAGAGTTTGTCTGTAGCG7TCTTTGCAGAGTTTGAATCTGA 
ATAGGTATTCGTGAATATGACAAACCTAAAAATGAATTCGAATATGAGGAGCGAGAGATCTTTAC^ 
TATTGTTATACTTTTTGGTGGACGTCAGGTCCCGTATTATATAGGACTATAAGGCACA^ 
TACGTCATTGGTGTTATTCTTCTTCCTTCTTTTCGTCTCAATAACAA 

AGGAAGAGATATCGTACCAGTGAGAGTGGTGACGGCATAGACGGAGGAACTGGAACAACAAACGGAGGAGGAGGAGGTGGAGGAGA^ 

GAACAAATGGAAATGGAACTGGAACAACAAACGGAGGAGGAGGTGGAGGAGAAGGAGGTGGTGGTGGAACAAATGGA 

AGGAGGAGGTGGAGGAGAAGGAGGTGGTGGTGGAAC^AATGGAGGAGGAAATGGAAATGGAGGAGGAA 

GACACAGACGATTTTGAGCCTACGCCAGCCCTTCTAAAAGAACGTCTACTTAATTCCATATCT^ 

CAGAGGTAGAAACTGCCTTACAATTATCTCGAGATGATTCTACCCAAAC^ 

AGGAAAACCAAGGGATTATTTGTT C AAGC TCGC AGGTGTTTC ATCGGCCT TTTTAGAAGGT ACC ACT AT CCGCAAAGCGGAAGAC CGTGCCC GT AATATA 
AATGAGGAAGAAJOTGCACAAACAATATTAAGTCAACTAAGAGAAAAACACATCAACGATG 

ATTTTTCCAATAGTCTTAATCTTGTTACTAAATACACAAACCATGAAGTGGGTCTTTTAGTAGGAGAAACTATTGA^ 

GTTTGAGAGATGCATAATTCTAGTAGAGGATTTTAATAGTGGAACTATTACTTCAAACACTATGCAGTACAG 

GAAGGATCAACAACAGATCCAGGGGAAGTTGTCCCTGATGATTGTTTGGTTT 
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CTAACAGATG^CAAGACATATGCTTTGTAATTATTCCTCGTTTATCCGCA^^ 

AGAAACCTATCTGTTTGTGGCCAATAAGAATGACACTACTCATTTTTCAATCAT^^ 

TTCTCAGAAAGAATTCTCCCCACTTTGAGTGACCCTGCAACCGTTCCA^ 

TT G/GTGCCTTT AC AG ACAAAAATTT ATTGTCC AGCC AAT AAAAAAAATAT AAAAAAAGATTGATTGT TTT ATTTTCAAAAAAAAT AAAAGACCATGGGAG 
ATAAGCAAAAGGTGGAACAGCTTTTAAGGGAATTAAAGGCAGAGGCTAATGATGATTGGCTATCGGCTAATGTTGATCCAATTGTTGA 
AACTAAATCTGATG AGACAGCC GAAGTTGTT AAAC AGGCCGTTGATGAAAAATAC GATG AATT ATT AGAAGATAAGGTTGAAGAAATGAGACCGGAT AT A 
1 ATCAATGAAGCATCCGAAACATATGATAAACTTGCTGCTGATATGATAAGAGAGGTAG 

CAAGAACTATGAATAATTTAAGAGATAAAAGGAAAGAATATGAAAAGAGGCTArGGLftCAT^ 

AGTGATGGAATTTCGTTTATCATATAAAGACCTGACTGTCCATGCCMTTCCGATACATATCT 

ATTAATAACGACCGCGCCAGCCCCGTGAACTGCTCCTTGTCAGTGT^ 

ACATACACATCAGAAGAAATGATTTACAGGAAAAGGATCTTTACCTGTCTGTTATTTGTATGTrAGATACTGATTTTTCAGGATATGATAA 
AGTTGATGCCCATAAATTTCATTTTGAAGCAGGGAACAGGACTATGTTTTTACCCA^ 

AT ATGC ACTAT AGTGTTCCCTC CCGCTTCTGCCAGTTCTGCTTCT AC AACAGAACTGGATAATGTCT ATT AC AGGAT C ACATGCAC ATG CTCCT AG AGGG 

GGGGGGTGGCTGTGTCCTTGTATAAAAAGCGAGGTTGGCACAGACTATCAAACATTCAGCCGTAAACCATGGAGGAAGAAAGT 

GATCGGAGTCCTCCCAGAAGAGGCTGCCTCCCAGATCCTGAAAGAC^CCAAGTTAAGAGTATCAT 

GTTATAAAAAGTGCTCTCCAGAAAGGATGCAGGAGGAACGATGAGGATATAACTGCCTGGTCAATCAGGGA^ 

ATTACATTGAAAACGTAAAACCTGCTGCAAAATCATTAAATACAAATATGGTAAACAG^ 

GGTGGCTTCTAATGAGTGTGTGAGAACTCTAGAAAAATATGAAAAGGGGAATTT^ 

GCCTCCAGTTCTAGAGTGTGCTCCCACATGAGAGCTTTGTGTTGCAAGGAAGAGGACAGTGATAAATT 

TTGAAACTCAGTGTGTTAGTGCAGTTAATTTTTCTCCCATAGAAAGAATTAAACACGTCTTCAGG 

ACAGTTATTAAATTTAAGAAGTGTAGCAGCTTACCATGTGTTAAGATATTATGGGGATAAAGTGAAAGAT^ 

CGAAAAGAGTTTGAGCAGTTTTGGGGGTTATGCTTTAAATTTGTTACTCAGCACGTAAAAACAGACCCTGAATTACGTTGTTATT 

ATGCCATAAATTGGAGAAGGGATTTTTTCrrGCTCAAAAGGGTTCTTTAGAGAAGAAAGTCTATTCCTTACATCTAT 

CATAGGTGACCGTAAACAGTTTGCCAAGATTCAGAAAAGGGATTTAAAACGTTTCAATAAAGGAGAAGAAGGG^ 

TGGATAGAAGGACATGTTAAACGGATGCCnXIAAATGCCTGTATGGGTTCTGGACAAACATACGAACAAAAACACAC 

GTAGTATGGTTTCTGGAGGAGACAAGCGCTGGTCCCCAGGGGTGTGGCTTCATTC^ 

CC AATTTCTGGATCAGGCTTTTAATAC^TTGAAGCGAGAAGCTGCTAGTCATTGCGTAACGAGGAATATTTGTACTA 
AGTTTTACTGCTAATATTAATTCTGAA<XGATGGAAATTAAGGAAGAAATAAAGAAA^ 

CTGTTAGTGCTACTACTAGTAGTTCTATTACTTCTACTCCACCTCCTACAAAGAAACAAAAAACMCTCCAA 

ATTGAAT AATCT ACC AACTTT AAAT ATGGAGGATTTAGAT AG AGT AC TAGAAGT AC ACAACC AAAAC TCTAAAAAGGGT GTAGCTGCT AC AGTTTT AATG 
AAGGATGGAAAT AAAGTTGTGT TTAAGGAGATGAGAAAAAGC TTT GGTTGGGGTTCTCATCAGAATTTTGTTCAAGTACT AAAGGATGAAGATGTGTGT A 
AATTGGACTATTTGTTGCCATGCCCTGATAGCGGACCATACAGAGGATTATACAGATGCTATTTCAAAATA 
TAGGATAGAAAAAGTGAAATGGGGAGAAAATGCCATGTGTTATTTCATCTCTGGGTGTGTGACTAG 

CGCCTrrCACATATGGGACCAAATAAACAATATGTGTATGATAACTATAGGCAATTAATACACATTTTAATTTTTAGATTG 
CCAACACATCAAACATTCTCGTAGGTGATGGAGGAAACTTGTTCTCTGTGGACGAGAATTATGTGGGCGCAAA 
AAAGATAAAGGAACTACAATTGCTACTGAAGACTTCCTTTAAGGTAAACUAGGTAACAAAGGAAGATATA G^ 
ACJVTCAAAAAGTGATAAAATAATGAATGGTCTATGTAATATTGGTAAAAATATGGGAATTGGTCC(»C 

GT ATT TTAGGGGTCGTGAATGACCTGCTTT ATGAC AACAAAT AAAGG AAT AAT ACTTTAT ACAATTTTTC^TTTCCTTCCTTTTT AAAGTCTAT AAAATT 

TTTCTAAGACAAAAGTGAGTTTGAGGGTTGGTCATAATCTAGCACATTTCATCCCTTTTTCCCCTTCTTCTTCTTCTT 

AGACCCTGGTTAGGTCCCTGCTAAGGGTAGGCTGTAACCTACC^TATACACTAGAGAATTTTACTCT^ 

TGGGTCACTCGAGTTTAGAGGGTGGACCGCTGGGTCGGCCTGATGTCAAGTTC 

GTCTGTTGCTCCAGAAACGTACATTAAATTTCTACTGTTTCTGGAACAGGCGTTTCTGGAC^ 

TACACTAGAGAATTTTTACTCTTCCTCCATCTCAAAAACTTTTAAAAAATTT^ 

TCJAG^CGAAGGGTGGACCGCTGGCTCGGGCCAATGTCAGATTGCACCAGAAACGTCTGTTGCT 

ACAGGCGTTTCTGGACACACCAGCACCCAGAAAGGGTAGGCTGTAACCTACCATATACACTAGAG 

AAMTTTTTCTGGGTC^CTCGAGTTTAGAGGGTGGACCGCTGGCTAGGCCTGATGTCAAG^ 

CACCAGAAACATTGCGTACTACTCCCATG^CCAGAAATATCTAGATCCTTTCT 

TTTCTTGCACCATOTATTTCTAGGAATTTCGCTGACGTCAATAAACTGGTTTGTCATCCTGTTT 

CATCTCGTCCACACAGAAACACGATATGATATCATAGATTTCTGGGAAGAGGGTC 

AGAAACATCAGACAGACTTGATCTGTACAGCTAGCAGCAGCAGTAGCAGCAGCCyA 

GC AGTCTCAGGAGAGGGGAGAATCTCT GCAG ATCTACTCCTGTT GGAAC AACTCACCCCGGACGG AGACGTGAT AAGAT ACGACTCT GAGCAGTACACC A 
AACCTAGGAAGATCTTTGGTGACAAAAGTGTGATAGAGACTATTGGACATTTTCTCATCCACAACCACA 

TGTGTTGGAAAAATTCCCCGCTCTACT CAATT GC AT ATGG AATGG AG AGTTCGGAGGAATGGCTCT ATGGAAGGCATTGT ACAGGGCT AAAAAGT AT AG A 

CTTCTCAACTCACTTTTGGTCCACAAGATAAAAAATTGGCCTTCGG^TGCCGTGATCCCT 

TCATGAGTGAGATTATTGACAAGGAAACTCTTCAGAC(^TATGTAAGAGTGATATACGTTCTCTTCTTGGAATC 

AGGT AATTTTTTAC ACTTCTAT GCC CGCTCAACT AAACC GTTTGAAAATTTCCAAT^ AGCTGAAGCT 
ATTTATGATGGATTCAGAGACCATGGCTTAAACCCATCAGAATATACTTTTCCTGGTTTGGAATCTGCTGATGTGT 
CAATATCAGGAGATGATGACAATATGTTGTTGAACCTCATCTGCAACTATGGTGTATCTTATGAAAAAACTCG 
TTTTTTAAAAATG^CACAGCTTCAAAGTGTCTCAGTGTTTTAAAGTTTGTO 

GAAAAGGCCGAAACATGTGTTAATTGTCTTGATAGAAATAATGTGTTGACTAAAGGGTCTGAACAAGAATCT 

ATGTAAAATGTTTGAGGAATATTTGTATAGTATCACAAC^CCTAAGATGTGAAAAATGTCTAAAAA 

TAACTTGAATTGGTGGTTGACTATGCCGGCAGGTGCTGGAAATGAAGAAGAAATAT^^ 

TTGTCCCCTGTCTCAATTCCTCATTTCTTCAAAAATAGTAGACAGCGAAATCTTGATATGTTGTGCCCCTACACT 
AAGATCCAAAGAAAAACGWVGATGGAAACAG^GTGAGGGTCAACCAC^ 

TGTAGCCGTCAGGACATTTACAGCCATCAGAGAAAAGCAGAACAAGGAGGAAGAAGACGC^ 
AACGAAJ\AGAACAAGGAGGAAGAAGACGCGAGGATCAAGCCTGCACTCGACATGG 

* GGATCAAGCGTGCAGTCGACATGGCGGTTGCAGCCATCAACGAAAAGAACAAGGAGGAAGAAGACGCGAGGATCAAGCGTG 
AGCCATCAACGAAAATAACAAGGAGGAAGAAGACGCGAGGATCAAGCGTGCAGTCGACATGGCGGTTG^^ 
GACGCGAGGATCAAGCGTGCAGTCGACATGGCGGTTGCAGCCACCAACGAAAAGAACAAGAAGGA^ 
CTGTTGATATGAGGATTCAACGTATAGTCGACATGGCAATTGCAGCTGCCACTAAAAAGGACAAGAAAGA 

* GTTAAGGGCTGATCTGAGAAGGGCAATGGATATGGTGAACGAAGTACAGAAGAAACTTGAAGACATGGAACTAGAAAAGGGG 
AAGMTACTAGTAATGTTGTTAGTAGCAGTAGTGTTGTTGCCTATTCTAAAGAAATTGTACCTTGTT^AGGAAATAATAA 
CTAGCACCAACTATTCTGCCAACAATACTAAGAATAATGTATTTGGTTCACCTC^ 
AACTCCCAAAATCTCTTTCAATTTCTCGTTCAAGACATAAA 
CACACATCTGTGAACATTTCTTGCACACGTAACACTACGACTTTTCTTCAA 
CTACAGAGAGATGGTATATAAAGGCTTTCTATCTCCATCATTTATCACTAT^^ 
GAAGGCTTCTCGCTTTCAGGGGATAAGTATGATACGTATGAGGATATCTTACTTGAACA^^ 
AAAGTGAAATAGAGGATAAAACTTTAATCTTTCAACTTAAAGAAGGAGAAAAATTCCATCTTGCAAAGG 
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TAATTCTGCAACAATTGAACCTATTATTTCTCCAACGACATTCAATGACAGAAACGAATTACTAAACCACGAGGGAGATATATCCTCAAGTCCCCTATAT 
ACTCAGATAATGAAGCATATTTCACCAGAGCATGATATTTATGAATTGGACCTT 

ACGTTTCTAAACTGATGAAGAAAATATCGTATGGTACTTTAAATGTAGTTGATGTGTGCCACAGAAA^ 
TTCTTCGTCATTCTCAAAAAATGTGTGTATTATTCCTCTATTTTCTGCAG 

TGTGATGACGTAGAGAGATATATC^LACTCT^ATTTTTTCTACCCTGAAAATACTACTACTACTACTACTACT 

CAGATGAGGAAGAACAATCCCCAAAAACTATAAAGAGAAATGACAACGCAAGTAGAAACTGGTCTGGTGTCTGTTTGATTTTTGAAGTATTTA 

GTACTACATTATTAATAGAGGAGATAGAGGAGGTTCTTTTGAAAAGGCTGTGAAGAGTGCAATTTCTTCTATCAAGGAAA^ 

ATTAATGGTAATAAACCTCGATTGGTTATGGTGATAACTGGGTGTTATACAGAATTGTACTTCAAAGATGCACTTAAACAGArr 

AATTTTTGAAAATGAATGGGAATTACTTTTCTCTGATTGATGAACAAGCAGATCTAATCGAATTCGCGATGAGTGTITCTGGTGCCG 

TGTT AACGGT TTGGGGATGTTCC AGAACCGTAAAATGATACC TG T AATTGATCC TCTCAC ATATGAAAATGTTGTATGTGGTG AG C ATG AT ATACAAAAA 

GAAG ATGCTATTCT T TCTGTAAGGAGAGC TATTGC AG ACT AT AAT GACTTTGT AAGT AAG AACAAGAGAGGGJ^GAAACGCAGCGCAGAAGAAGAAAATG 

AAGATGAAGATGCAGACGCTAGCAGCAGCAGCAGCAGCAGTCCTCCTCCTTCTTCTCCTCCTGCACATAA 

AT GT ACACT C TGT T AATTT T TT CAAAC AAT AAACT AACC AC C TT G T AT ATTTTTG T T T GT TT GC AAC AC CCTTTTT ATG AT ACC ATT AAGACACGAT GGC 
GGTAAACTTGGATAATGTTCT^GTGAATATCAACAACAAGGATGAAGATCTTACAAAACTCGTATCCGAG^ 

GACAC TAAAAATCAAGC AGGGT TTGAC ATGAGACGTCAAG TTG AAGCTGC ATTAT AT GAAGCAAT ATCCAAAAAGAAAGAAAAGGCCAT AAAGGC ATTCG 

ATGAGCTCATACAAGAAAGAGGTGATGAAATTACACCTTTGACTACAATGCACTATGAAGAGTGGCTAAAC 

AAATTTATTAGGTGATGTTGAGCACGCCGATTTTTTACTGGACCGAATGACACCCGTAAGCGAGGAAG^ 

GAGGTATCAGATTCAAAAACTGCAACAGTCATAGTTAAGGCAGATTGTGAAACGGGGGATATCGATGAAGTGTATAATC 

CTCAAGAAATTAAAATATATAGGTCAAACAATTCCTCGGAATTGGATAATGTCGCAGATTCTTTC 

TGGCAATACTAAAAAATTGTTGTATGGGTTAAGGAATAAAAAAGCAGGTTATACGTGTTTGTGTAGAATTTTTGCAGAAATTGA^ 

GCCAATACAAATATCGGTGTCGCTGAAAACAACAGAGATGAAATTGATGAAAACGAAGAAGGTAA^ 

CAAAATTGATCATCTACTTCTTTTTAAATTGTTGGACAT/UUUUVATGATATTTTAT^ 

TGTAATTGTAATGACTGTATTAGCTGT AT ACACT GCACCTCAAATAAAGAAATCGAAGAAGAGAAAAATTGAAGATGAAA^ 

ACTTTGGAAGATTTTGTAAAGGGTCGGCTCCTTAACGCTGTCAAGGAAAAACCTC 

TAAAAACTGCCGAAGAAAC^GCCCTTCGAGATTTTGTTAT^GAGAACGACT(rreTC 

TGTCTTCAAATT/GGCAGGCGCAACAAGCGAAACGCTAACAAACACAATCATCGCAGAGGTACAA 

ATTAAAATGTTAAAACAATTCAGGGCTGCGAACAAAGATAATAAAGACG<^GAAGCAACTCCTGA^ 

TGGGGT^GTACTTCAACGAAGTACTAGAAAAAACAACAAATATTGTCATTAACAAAATATTCCC 

TGAAGATTTTGATACTGGTGTTGTGACTGATCAAGCCAT^C^GATACCCTCCAACAAATAC^ 

CCTGGTGACTGCTTGGATCTTGCAGTTTCAGTTGATAAAATAAACC^CGTCTTGAAAATTTCTGCA 

TTCCACGG*TTCTCTCCTGTAGGAAGTGTTTCTTCCATGATATTGGG^ 

C AGT ACACA T TTT C AATTCACAAT GG AT AAGCAACAT T C TGT A GGGTGT G AGTTGGAC AT GT T AATT TTT TCAG AAAGGAAC TTG AGGAAT TTACCC GAT 
TC AAAACCT AGACCTCT AAGTG ATGCAG ACATATTGGC CTC ATATGGGAAGCGTC T AGGAACTGGTGTTTTCACAAJCAGAAAATTTGGTAGACGATTAAA 
TAAAATTTATAACCACTTTATCTGTAGACTTTTATGTCCTTTGTGCAACAAAAATAAACAT^ 

TACTGGGGAAGAAAGGATTAAAGACATTGTAAATGCTCTAGATAATAATGGCGAGTGCTTGTCTTCCTATATTGATCCGATTA 
CGAAAAACGGC^GAAACTGTCCAAAAAATCAACCAAGAAGTTGATGAACGGTACGATAGAAAAAT^ 

TTACAAGTGCTCAGACTATGTATGACCAATATGCAATAGACACATTTCAAGAAGGAAAAGGAGCCAACGGGACTGGACCAGTC^ 

GGTTATCGATACAACTTTAAATAAAATGAGGGGAAATATGCTCGAATACGCTCAAGATATGTGGGACGGAGATC 

ACAACGCTTGAATTTGATCTAAGTTACTCTGATTTAACTATGATGCGTC 

TGGACGGTTCAAGAAAGAAAGAAGACCCAATTAATTGTATCATTTCAGTAACATATCCAAA^ 

AGTGAATTTTAACCTAGAAAGAGTAGACGACTATGAAAGAGATATCCATGTTTCAATTTTGTGCATGTTACA 

TTAGGAGAAAATGCAAACTCTTTTTATTTTAAAAAGGGGCAAAGAGTCATGTTCTTACCCAAGAAATCTAAACTGT 

CTGATATGTTTTCTATAATATTCCCACCTGCATCTGACCAAGATTTTGCAGATGATATTTATTATCGAATAATTGTAACATGTTCA 

GAAATAAAATACAGTAATAGATTCAAAATCTTTTATTATTTTTCCTTTCTAATC 

TTATAAAAATCAGGAATATACATAAACATGCTATTGTAAGTTGTATTCATATCCCTGGTGATCATCTCGTGCCAGATGCCCCTCAGCGAG 

CTGTAATCTTOKCCTCTGGAGTGCTTGTTTCTATGCCTCTTGAGCCCTCTTO 

ATTCTTTCTGGCCGAATTATACTTGTAATCAATGAGGTATTGCTTGAAAGGCCCAGTCAAG^ 

GCAACATTCTTAAAGGCGGCAGCAGCAGTCTTGGCAGACATTGAATTCGCTTCAATGCTTCCAAAAA 

TGCCACTATTTTCTCCCACGCCTCCGTCAACAATAAGGTTGCGA^ 

AACCATCTT AGTCT ACATGTCAGATCTCCAAT^GTTGTTG ATGTCGTT ATGTTGC C C AGAAAAGT TCCCTGGCAAACTAATTCTG AAGGTC TTGTAGCTT 
CCATAAGGAGTACTGTTATTAATGTGGTTAATGGGCATAGATTCACAGAATGGAATAGCCAGTGTAT 

CACCATTGAAGATTCGTATGTAGTGGAACCCATAGAAAAGATAATAAGATTGTCCACATCGACAAGTA 
CCCTGATCCTCCTCATCTTCATCCTGATCCTTC^TTCT^ 

ACTTGAGGATAGTGGACATGTTGAATGAGTTGX^ATGCATCCAGAGGAGTGTCTGCATAGAGAT^ 

AGAAACATCCATCTCTTCCTCAGATTGAGGATATTTGGTCACTGAAGCAGATGGTCTGAAGCTGTTGTCTGAAG^ 

TCGTCGAATTCGC7VCTCTGTTACATCCCCAAACACAACTTCTGAACACTGGGAAGTTGCACCATCATCATCCTCATCTTCTT 

TGCTG AT AAC ACTTT CCTTGTC ATCTTCATCCTCT ACT ACTGTT GCTGT ATCCTT ATCC TT ATC C TCCTC ATCCTCTTCCTCTTCTTCTTCAGCAGCAGC 
TACAACAGTGACAGATGAAGACGAAGAGGAAGAAGAACACACTGATGAGGCGTCGTCATCATCATCATCGGAGTCTTCT/TCTTCTTCCTCTGA 
CCCTCTTCCTTTTCCTCCTCTACATCACTGAGACCGATAGTTTTTCTAATAAGATC 
AATCGTAATC^CTTCCTCCTCGTTCTCATCTTCGTCTTCATCCTCACTGTCGCTCT 

ATTGTTCAGTGACTTGAGGTTACTCTCCCTCTTGGCCAACCACTTGTCGTTCTTGAAGCAACAACATCCATTTCTCATAGAT^ 

ACAATTTGGTTAGT^GTCTTGAAATGTGTTCCTGCCTTCTTGGTAGACGAJIGAA^ 

TATAGTTTCCTCCAAGTTCCTTAAACAGTGACT^ATCTCCATTACTTCCAAATATC^ 

AGAGAGCTTCAATGTGAAGAGAGGGCCTCCCTTATCAAGGGCCTTCTTGTTGAGTGTTTCAATACT 

AGGAATAGATTGGCCAGTTTTGTAGTATCAGTGGCGATTGCACT 

CAAATGCCGACTTCACCTTTGAGACACACCCATCACAAAAGGTGGAAGGGGAC^ 

CTGCGCCTTCTTATTCTTCTCCAATTCTTCGTCCTCATCTTCGTCATTCTCCAGA^ 

GGCACCTTGCC(y^AAGAGCGTCCAG?TTX^TTGAAGAGATTGCTGCTTCTTCTC^ 

AGGAGTTGGTTTCTTGGCAGAAGTAGTAGTAGTATCCTCAAATTCATAAACTTGATCTGGTTCCTCAGAG 

TCCTTGATTGACTCCATTTTTAGAATATTGGGTTATTAGGATTCA 

GTGATGTAGAGCCCTTGCCCGTGCCTCAATATATACCCCTCAGTAGACCTTGAT^ 

ACCAATGGAACAAGTC^GGATAATAATATACAGTATAAGCTTTTGGGGAACAACATTCATTTATTGTCATAGCGCT 
GATTCTGCATCCAGCTCATTTTGTTCAACCCACrTCATTTGC^GGGGAC 

TAGGCCCTGGGGCGTTGCTGGAGGAG^ACGCAGAAGAAGAAGAAGAACAGCTGCCGTTGAACGAGTC 

AATTGGACGAGGAGACTTCTTGTAATGAGAGAACCACAAAATTCTTCTTCCCGTCTC 

ACCACATTATCTTCGTTTAAGGGAACATATCTGACCGACTTTCTCATCAAGGGCTG^ 
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CGTTCACGTTATAGTTGCATATAGAACAGTGGGATGA^ 
AGGGACGTCCTGGTTTTTGTTTATTGTTGAACTAGGT 

T AT ATGCAGATTGTGAGTGC TGGAGAAAATGGTATC TCAAAATCGGT AGCTGGTT C AAT AAT TT CTGGTTC AACTTCCTGTTGTTCTTCTTCCTCATCTT 
TCTCTTCCTTATTTTTGACACCTAGAACTCTAAGGTTGGCAGTACTAATTATTCTGGCTGCAGCCGTGCTGGTAGC^GAAGGGG 
AACAGGAGACATGTTGACCGTGCCGTGTCGTGAGGATAAATGGCTAAAGGrATATACGGCATGCCCTTTT^ 
GGACCGTAGATGACCATCTGCCTTCCTCTCTTTTTCTGGAC 

CCTCTAAAGGGAACAAGTGTTTCCCCATAGGGGACAATTCAAAAATATAATTCCAACATATTA^ 
TATTGTTTTACTAAAATTTGGAAGGTGATTTTTTGATAGACTATCG<3ACATTATATCCGCT^ 
TCAAAGTTGTCGTGTGTTGACACCATACCATTCCATTTCCATTTGGCTAGTT^^ 
GCCATATTATTTTAGAAAGTACAGGCAAAACAACATCCCTCACGTCAATACTGTCCAT^ 

T AT ATCTTCGTT TGATGGGGC CAATCG AACTAGAGGAACTAAATCCCCAGTT ATGAGATC GGTG AAAATAAAGCTCTCGTCATC AACAATTTTTTTX5GAG 
CCTATGTACATCTCACACTTACTATAAAAGTCATTTTTATACCCTGGTTTCTTTTCCC^ 

TGGC ACCTACTCTTCCTTC C ATATTACTGAT AGCTCT AT ACAT ATTAACC ACGT ATTGT ATAGGT GCTTCTACGT AC AAATCTG AAAATCCACT AGC GGC 

AGTTCCTTCACTAAAGG^TAGAACTTGTTCCATAGTCTTCTC 

GTGCCAGGCTACTGGGACCCGAATTCGTTGAAATGTTGTTGTTGTTTTT^ 

CCTCCAGGAACAAATCACAGATGGCTCGTACTCCATCACATGTGCATTTTACAGTTTCAGGGAAATTTAATC 
TATTATTTTTCCCATTCGTAGGCGATTATTGACATTAAACCCACAACACTCGCCTCTGATTCCCATTATAAAG^ 
GCACTTGAACCCACGAGTCTAAGAGTTGTATATTCTCCATCGCAGTTATAGAGACATCTTGCCA^ 
CCATGAGTCTTGCTCCTCCATATCCAATTTCAGCGTTTATTGAAGTCCT^ 

TACATTCATAAAAGTGTAAAATTTAACGTCTCrrTACAGATTCCATAATTTTTTCCTCAAGTGTTAGC 

TCAAAGATAAAAGAGTGTTATTCAACTTACACTGTTTATTGGCCCAGCAATTCTTTTCTTTCATCAACTGCATAGTAGAA 

CAATTTGGAATCCAAATCCCTAGGTTTATCTAGTCCAGCTTCTATGTCCATACT^ 

GACATGTTAATC AT TTC AATT ACAGTCTGGAGGC ATGTTTCATGT CT ATTGACCAG ATCTTC AT C ATCTTCATCCCATGTC ACACCCACCAGAATATTTT 
CTCCTTTCTTGAGTTTG^TACATTTTCTCTAATCTTTTTCATTATCCGTTGTTTCTTGGACGCCACACT^ 

GC AATCC ACAACCAAGT ATTTTTTC ATGATTAATTTGTTGAGGAT TGTGTTCTTTGAATTGAAG AAAAAAATCAT TC GTTCTGT ACCATGGAGAC AACT A 
TGGATAATCTCGTTCAGAATAACGACGTAACMAACCAACACCAGATGTTGCTACTGT^ 

GGATCAACTTAAGGCCGAATGTCCTCAAGTACTGAGAGCACTAAAATTGTCCAATACTTTAAAGGCAAATTT^ 
CAACATTTAGTGGACATGACAAACGCTAAACACTTTAAGGACCCAAAGACAAAGAAGATTTTAGAACT 
AGGAAGAAACTAGTTXrTTCATCGAAACGGAAAAGAGGTAGTGGTGCTAGAAGTG^ 
CAATGATGCTCAAGGTGTATTCCGCCAGTTTGCAGATATCATCATTAATCTTCCCTC 

CT AAAGACCATATATGACT TGT AT AGACAGGACATGGAAAAGGTGGT GGAAX3 AAGTTTT AGGGCGCC AAGACCTGTTTGATCACAAGTCAGAAATAGCC A 
AAGGTTTGGCCCGTTTCGATACCCACGTCTCCTTGCTCCCTTCGGATAGGTCGGCTGTTCm^ 

GGGC CCG AACAGT AAT ATTT TTGACACACTAAAC ACACTC AAGGAAGAAATC AAAG AACTTTT AT GTCATCATX3TCAAAT ATTT ATTGCAAAATCTT AC A 

CCGGAGGATGCAAATTTTGTGTTCAATAGTTCTGTAAAGTATGTAAAGAAATCATATCAATACTACATACAAACATCA 

AGTCCCTTCTCACTGGACTCAACATTAAAATATTGGAGAAGATAATCTC 

CATTATTTCG^TTTACAAAAACTACGTXSAAACTAAACCTGTTTCAAAGGATTATCCGTTCAGAGTGGATA 

ACGGGTGGCATTTCTGATCTCCCTATCAAGCCGGTTACATTATTGCAGTTGGTGTCCT^ 

CCGACGGCTTTTTTAATGCAGCGTGCGTCCTAATTTCCCAGTGCCTAACGAATGCGA^ 

TAATGTTACXCGCCATAATCTTCTGAGTATGAAAATGCTTCAAGAAGGTTC^TCCAGTGAAAAGAAG^ 

AAGGGCGGTGGTGGrGGTGACGATTCTGATTCAGAAACAGATTCTTCCTCATC^TCATCATCTT 

AAGAAGAAGAAAAAGGAGAAGCAGTAGAAAAAGGCAAGAAAACTAAACGCAAAACAAAAA^ 

ACTG ATTCT AAAAACAGGAGGTT ACTTCC ACGACACGAGTG AACT CGGCAATAAAATTAGAAAT TT AAT AGACAAGGATGATTTTGCGGGCGT AGCCC AA 

TATGCAGTAACAATCACTGAGATGCAATCTACGCCAATGAATCAAAGATTAGTATCTAGTCTTTTAGATTTGATAATGAGA 

ATAGTGTTGATACTGAAAGTACTTCCTCCACCGCCAAATCTAATAATGCTTTAGATAGTGCTAAATTGACATCTCA^ 

TTCTGG AGCTGAATT GGCAAGGCTAGCTGCCTTCTTTTTCGTGGTGGTGGAT AAT ACTGTAGTT AATCGCCATG AAGCATTCATTCT AACATCAAAACTT 

TTACCCTCGAATGAAAATAGAGGACTTAAAACTGTTGTAGAGTCATTCTTTAAAAATTTAACAATTAGT 

TGTCGGTGATGCCGTTCGAAGACGAACAACAACAACAACAATGCCCTCAACATGAACAACAA^ 

AATGGGAAAATCAATAGTGAACTCATTCCCTTCCAATAAGAGTGTACAATTAACGGCTGACGCGTTCAAGCA 

AATTTGGCGG^AAGATAAAAACGGCTATATCTATCGGATCTAATATCTCG^CCAATATACTATTTTCTA^ 

TAACTGGTCTAAGGCTAACCAACCTATTGAAAAACATATCCCAAAGTGCCCAAGCTAATAATATAATCAAAAATGCCAAC^ 

GG AC C AACAAAA CTCAGCAGCC ATGTCCAT ACTACTCTTTCCTC CAAGATC AAAAGAAACTTCT ATC TTTCCCGGAAACGAT CCTTC ATCTATAAAATT A 
CAGGATATGACTACAATGTCAAATCTGC^ACGAGGATrTTATTCCATCGCAGAAGG^ 
TTAAAGCGTACACTTTACTAGTGGACTCAAATACTATGGACATGGCAGTTAATTTT^ 
TAATGCGAATATG^CCCTTCTAATGTATTAGAAGGAGGATCTTTTG^ 

ACCCCTCTAGCAAAATATACAATGAAAGATGTATCAAATCGATACTTGAAAAAATTCAACAATGATAAAAATACAC^ 

AAAGAGC ACTCGTC GAAC AGGTT ACAAAT AAGCCT AC AAGTGTTGTGC ATTCCC AGCT AGCAAACGCC ATGGGSGTAGCTGTTAT AG<^ 

CAAGCTCATGGAAGCAGAAGCAGCGGAATCT<y\AATGAGAGCAGCAAATTATCAAGCAACATCAAAATCTACAAATGCTATTAATATAACA 

GGAATGATACGTAATACTACTCACCTGTGTACGACCATCGCCGTGAGTGCAGCCGCCGACATGTCAAAGCTCGCCAACAACCATTTTATGAGTG 

ACACTGCAAATAACAGTCATAGCAGCAGACGAGGAGACAGATCGTCCCTGATGT^ 

AGGAAGAGG AGGAGG AGT AT TAGGATC AGGAACTGAACAGACAAAGG ATCATGTGGAACGT ATG AAGAGAGATTGGAT ATT AAACAT GATATCTCCAGAA 
GACAAAAATACTACTACTACTACACCCAGTAATGCCGGCCGGACATTAGGATATGGATCTAATAT^ 
GTATGATGGATAAACTTTCTGAGATGTCTAGCTTCAGAACTTAAAGTGCCAATTTTCATATAAAAATGTA.^ 
GTTTGTGGTTTTGGGTCGTTCACC^TCCMGATGTAAGAGTCCCCTTGTACCCAAATTATAAC 

CCACCCGTCTATCATTCAATATCCCAACTGAACCAACCAACCCCAGAACCAACAACATGTCTCTTGTTGAGAATAATA 
AACTACTGTTGAAGGTGTTGTGGAGGGAGCAGAAGTTGCCCCTAGAG^GTC 

GAAGATGATGAAGGAGGAGAGCAACCACAAACGAAACCCCCAAAGAAGAAACGTAATATCAATAGCG^AAAGTACT^GAA 
CATCTCCAGAAATGCTCAGTGCTCTAAATGATATTGATAACGTGTCCAAAACTATTCCTCTAATCG 
ATCTGAAGAACAGATCAAAACGCTACTCACTGAAACTATTGCCGTGGAATATGGAACAATCACAAATGTTAA 
ACCGGAGAGCCTCTTAAGAAGAAGCGCTCAAACAATGGAAATAACAACTAC^GATACTGGCAAATTCGAATTGA 

AAGC T GTTTTGG ACG CAATC GTTG AAGGAAAT GACACCGT GAT C AAGGC AATCC TC C T C C C TGAAGGGG A GG GAATC GG GCTTCAATTT AAC AAG AG TGT 
TAGCT CTC AACAAGCT AAAAAT ATTGTCC AGGCTGCCGATATTGAAT TT GGACAAGTTGC GC AC ATG AAGTGT AATTTGTTCCAC AAGATGG AGAAGGCC 
GATGAATCTTCTAATTCTTCTGGT GAATC ACCAAAGGTT AAG AAAGTAAGGAGG AAC AAGTCCC AGC CT AC AAATO 
GGGATTCTCTTCAAGAGCGTATTGATAACGCAATCAAAGTTATCGAAATGTCTC 

CACCACTACTACGACCACTACTTCCACTGGTGTTGTTAATCCACGAGGGATTAAGGATATTCACTTTTTCGATTCATCCATTTCTAA 
GTGAG^AATATTGTTGCAGCAAATGGTGAAGTTCCACAAGAAGAATTCGTTTCT'^^ 
CTACTTTCAAGAAGCTTATTCATCACCGTACAATGAATCGACACATTAAGGCATGGT^^ 
AGCTGATAAGGTTTCCGCTAAGG^AATTGCTACCTACAGAATATACGAGGACAGGACTG 

GCACAGGCAGCCGAGGCAACTGGTGCTATTCACAAGTCTATGCTCTTCCAGAGTCCTGGAACTGATATTCAAJVAGTTCCTCGATGCTAA 




WO 01/38351 PCT/US00/28888 

22/201 



GTTTGGAGCCTATTTCATCG<KCGAAATTGTGTACC^ 

CGAGAAGATGAACATTGCTGATGTTTTGTCTATTGTTCATACTGATGGACTTTTCAGTAGTGTACACT^ 
AAGAGCAAGAGTAAAATTATCCCCAAAACTATO^AGATTAAGAAGGGAGGAGATACTTTCCATAGCGA 
CAATCACTTCGGAGGAACTTAATAGGGAATGTAACACC^GGGAATGAACAGCCTCCGAGCAC^ 
TTCCACCACTTCCACTTCAACAACAGCAAATACTCCAAAGAAGACGAAGM^ 

GTTGATACTACATCATTTGTATTTTACAATATCACTAAGGAAATGGTGCAGAGAATTTTGGCTCAGGAGAG^ 
AAGAGAAGATGGAAATTGTAGAGGGAGAAGAAGCACAAGAAACTTATAGAGGAATCGTAAAGATC 

AACAT CTGG A GT ACTTTTC C CT GC T G AT AAGGTGTGT C T AAAGC AC AC ATT GGAAGATT T GGGGGAT GT AC TTGATTTT G ATGT T GT AAG GGAGGAT AAT 
GTTAATAAGACGGTCGCTTCTACTACTACTACATCATCTGAAAATAAGGCAAGCGGAGGAGATGAT^ 
AGAAGCTGTTGCACGAATTGTTGAATGAATAATGTTGTTGTGTTGTATTTTGTATATAGAGAAAAGAATAA 
ACATTTTCCTAAACTTGATATAATAAACTATATAGCAAATATTAATGCTATTGJICACATGT^ 

TGCCTTTTCTT AGGCATTTGAGGCAAATAGGAAAAGAGT ATT TACGATGGGT GACCGTCGTG AT AACGCCTAC ACCAGGTGACAGT AT AG AGGGTTTCAG 
CAGACTGCAAATACTCACCTAACAGGCCTTGGTOTAGAGGAATCAATAAAAAATCAACAAAAAC^ 

AC AGAAATGGGGTTAAAAAC AAAT ATCCGAGCTGCAATCTGACATTGGCTCCCTACCCAGAGGT CCACCCTCCGAACTTG AC AT TC AGTC GACCC AGCGG 
TCCACCCTCT AAACTCGATCGAGC TGAAAAATTTTTT GAAAAGTTTTTG AGATGGAGAATGAGTAAAAT T CTCTAGCGGAAACAGTAGGT T ACACACTAC 
CCTTTCTGGGTACGTTCATATCCAGAAACGCCTGTTCCAGAAACAGTAAAGAGT^^ 
TCGAGTCGACCCAGAGGTCCACCCTCCGAACTTGACATTCGGTCGACCCAGCGGTCC^^ 

TTGAGGAGGAGAGAGGAGGTAAATTGATACAGATAGAG&TATCTTTAGTAGAGCTGCTGGTCCTTCTAGGAGCTG 

AAACAGTC^iAAAGTTAATGCACATTTCTGGAGCTATCCCATTT^ 

GTCGACCCAGCGCTCCACCCTCTAAACTCGAGTGACCCRGAAAAAATTTT^ 

GGTTACACACTACCCTTTCGGGGCGCAGCAAGTCTCC^GAAACGC^ 

GGTGTAATCTGACATCGAGTCGACCCAGAGGTCC^CCATCCGAACTTGACATTCGGTC 

TTTTTATAAAATTTTCGTCAAGGGAAAACAACACTAAGTTGCTTGTCAATTCATACACTAGTGCAGGTTGGTTG 
GATATCTGGATX^CCAGAATCCAGAAACGTTTCTATCCATTTCTGGAACAGCC^ 

GGTACCACTTCTGGCACATTCGTGCAATCTGACATTGGCCCTACCCAGCGGTCCACCCTTCGAACTTGACATCAGGCCGACCCAGCG 

ACTCGAGTGAGCTGAAAAAATTTTTGAAAAGTTTTTGAGATGGAGAACGAGTAAAAO^CTCTAGCGAAAACA 

GGTCATGTCCAGAAAC^TCTGTTCCAGAAAGAGTCAAAAGTTAATGTA^ 

GCGGTCCACCCTTCGAACTTGACATCAGGCCGACCCAGCGGTCCACCCTCTAAACrrc^ 

CGAGTAAAATTCTCTAGCGAAAACAGAAGGTTACACCTACCCTT^CTGGGCGCAGCTATATCCAGAAACG 

ACGTTTCTGGAGCAACAGCGTGACAGTAGTTTCTACTAAGCCTTCACTTGTACACATTTCACT^AGTATAATTT 

ATAAAAATTTTTCTGGGTCGCTCGAGTT^AGAGGGTGGACCGCTCGCTCGGTCT^ 

TTGCACCAG AAGTTGTT GT TGC TCCAG AAATGGTGX3GGGAGT ATTC AGAGTGTGTl'TCT 

TGTATACATTTTCTGATCCGAGAAAGTAAACCAACTGAACGGTGTAAAACGTACTGAAACAT^ 

AACAT AT C C C GT AC C C AAC T ACCTC AAAC AACAACAACAAC AAC AACAAC CATGACATC C ATT AT GGA C G AT C TCT ACAT GAGC TGTT C T TC TT CTTCAT 
CGTGCTCAAGTTCATCTGAAGAGGAGAATGAGGTAGGAOTAGAAGGAGGAGGAGGCAGAATCGGAC 
AAAGAGATCTTCTGTTAAAAGTACATCATCTTCTTCTTCCTCATCATCATCATCAGACGATTCTGACTC 
TACGTGGATATTGXIAGATACAAGGAAACCGCCC^AAGTAAGAAAACTGGATA 

TGGTTGTAAGAAGGAAAAATCTCCCCAAGACATAGAAGCTGAAAAGGAAAAGGAGGAAGAGTATGA^ 

ATAAACACATCTGTAGATGATTGTGTTATACCTGATCGGATTTTAACCCTCTTTTtn'ACACTTT 

CTTTCCTCCGGTTGGTGATGAAGCAAGTAAJICGAGGCCATGAATTCAGC^^ 

GGGTGACACATCT AAAATT T CCTCCTTTAT AACACCTCJUU^CGGAT ACT AGTAATTCATCTTCAT CATC GACGTTCGTC AAT AATTGTAC AGATGAAGAT 

ATCAAGAAGCGTAACATTGCAATGGGAAGAGTAGCAGAATTX5CTGTCAAATAT/TGCA 

TAATGCGAGGACCAACTTGTGGAGGTTCTrAATGCATCCAAGAAGAAACTCAATA^ 

TAGAGAAATACAC^GTGTTATCGCGCTATATTTATCGTCCGTCTGTGTGCAGCGAGCAATGAATAATGACAACACAAATT 

ATGGTTACTAAAATCTTGAATATTATTGGTAAAATTCCTTATAATGAAATGACTAG&GAAA^ 

AGAATGTGATCACGGATATGACTGGCCCCJ^AACATAACJUiG 

TGATGTTCCCATTACCTCAGAT^ACTTT^AACTGGAAAGGCGACAAATTTAGTACAA^ 

CATAAAATGGCGTCTGTTTTCAATAGTAGTTATTCCGTATATAAAGTCCTAGATCT^GACCATAAAATGTTATTAAGGGCT 

TATCAGCTAGAAATAAGTGTCTCAGTGJLAAGAAAACCTAGAACATTAACTCAGAG«nCTATTTGTT 

TAGTGGTTTGACCAGTGAAGAGAGTTCTCTAGGAACAGCCGTAAAATTGGTGTCAC^ 

GGATGTAGCATGATTAGCGGAAACTTTGAGGATGAAGACGGTGTAACACTGAAATGTTTGGGAGCC^^ 

CATTGCTATCTGATAGGCTTAGAAAAAACATTAGAAGAAACGTCCCTTTCTACTGAAAAAATATGGAAACTACTAC 

GCTTTTCGTGGTGTTGATGGTGTACATCTTTATACAGTACACGTCATCAAATAAAAACAAGACAGAGGAAAGA 

ACTTCTTCTACTTCTGGTCCAACATTTAAGAGJVGAAGCCTCT 

TAACTGACGCTGAATTTATAGACCCTGAACTCTTTTCTGAAGCCJW^ 

CCTTTCGAGTATAAGTAGAGGCGTTGCACCGAGCC^GCATCATTCGGGCTCATTCAACTGTGCTGACAACAACT 

ATAATCAACTTTTCAAGATGTTTGGAAGTCCTGCTAATAACTTTAATGGTGAC 

GCAGTTAGGTCCCCTTGGACTGTCTACTGCTGATTTCAAGAAAGTTGCTGCCATTCTTGCC^ 

AACTTTAAGAATGTGATTAACAACCCTAACCAGATATCTATTGTGCCGTTTCTGGGTTCATCGAAGGCAGCTGAAAGTGGTAGT 

ACCAGGCTGAAAATTCTTC T AAAGGAGGAAGTGATGG AAAGAAATCATCG CAGCAGAACAAGTTT AACCT ATTGAACAAGGT AGAGGCTGAAGAAATGGC 

CTTTAAACGTGTC^GCTGAACTCATTGCGGATACTCCTCCCTCTAAAGATAATCCATTGAGGGATGACCCTGA 

AAATTGACTCAGAAGAATCTGGAATATCTTTTCTG^AAGCAGTAACAATTGAAGTCTCCAATGATAGGAGTA 

GTGAAGTGGGGGAGAATCCATTCCTAATGACCATCAGTGTTGACATTAGAATCCTGCAAAGAATGGCACTTAATG^ 

CCGCATGffTTTCrGGACTTGGAGTAGAAAACAGGGCCAATTCCACCTATGTAGCAA 

AACTTTATTTCTGGAGAAAATGTGCCCCAGGCACTGAAGTATTTGAAGGAACATTATG 

CCTATTTTGTTGTTGAATTTGAAAGAGTGGACAACACGATTGGATTTCTCAATTCTG 

TGACACAATCAGG^UUSAATGCATCATCTGGGCCAAGTGGAGGCGGAAAGAGTGGTGTCCTGTCTTCTGGAA^ 
AACAATAGTAGTGCAGCTGCAGCI^CTGCCCCTGCTGTTTCTGCTGGAGTTTCTCCATCA 

ATTGTAGTGGTGATGACGTGTGGGGGAAGAAGATGATATTCAACACATCAGGAGACGGATCAGGAGAATCTTCTGGGCAGAACGGTGGTGGTG^ 
TTACAAGAGATTTAGGTGTGGAGAAAATACTGCTTCTCTTTCTCAGAAGGAAAATGTTCGTCTCATGGCGATGCCAAAGGGAAATG 
CTCAAGAACATTATC^UICTTTCTGAATAGTGCACTTAACTCTGTTGJUUUVCCATGTAATGTGTACAGAT 
ATTATACGTCAAACAAGGAACTATACAAGGXrTATTGTTTGCTCCAACCCAGCCAAT^ 

CCGTCTTAGGAACCCAATTGTGAGTGACAT/TGAAACTGTACJUULATCTTCCTTCAAACAATGGATCGGTGAGAACA 

ACAGATATGCGCTATGACATCCCTCCATATGCAAAGGGAAAGATTCGTCTTAGTGCCAAGAGAGCTTGTGAGTGCAGAAAACTGTGCAAGGACGTGAGGT 
GC TTTG ACAAGTCT AGAGAGGCAAATCTC ACCCCT AGCC AGAAGGCAGGAAGGGAGGTGGAGGAACC TTTTCCCCGC AAC C AC AATTCTC ACAGG AGT AA 
CGCTCACGrACTTTACTTTCTATGACAAGTACAGGGCAAGGATGAACAAGCTGAAGAAGGAlTCAAAGAAGAAGGTAA 
ACAGACGATTTTCTCCTGCAAGATAGGAACGCTTTTGATCTACTTAGAAAGTGCTTCCTTTCTGCCTCTCTTCATCACATTT^ 
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TGG7GCATAGAGGAGACAGCTTCAATATTAACTTTGCA^ 
GGTCAACGCGAAGGAAGCACTTGAGGATATTACAAAAATTAAAATGAAGAGAG^ 

GAATTCTCTAAGAAGGTTAGTAAGATTGTGAGAAGGTTTAATGAGATCACAAACCAACTCTGCAACAACTGCAACCTTAACTCTT 
ATTTCCACGTCTTTACTTCTGTGTGTGTCTACATCCACAACATTATTCCTGTGCTC^ 

GCTTGTTAAGGAGTGTAGAGACGTGGCTGGAGAGGACAAGACATATGATGATArTATCCGCAATTACGAAATTACTGTAJUlG 

GCACTCGTTAAATTCTGTCACAGGAATTATAATGTGGCAGTAACCTCTGCCATTAACAGGAGAGGGTACATGTC 

ATTGTAAGCTGTCTGATAACGCTATCCACTATCACGAA 

TTCTGAAGATGGAGGAGGAAACTCTTCTTCAGAAAAGAGCAATGCAGATGTAGCCAAGACTATGGCCTCTTTCTATC 
AGCAAGAAAAATAAGAACAAAACTTCAAATGAGATCCTTATAAAAATGTTCCA 
GTGATAGTAGTAGCAGTGAGAATGAAGAGGAGGAGGAAGAGGAGGAAATTGTAAAGAAACCAGC^ 
GAAGACACTGCCAAAGGAACCTGCCGTTAAGAAGGTGAAGCAGGAAGAAGATGTGGAGATGGAGGAAGTGA^ 

GAGGAAC AGG AGGCG AAGG AGG AAG AC GCT ACTG A GT AT G AC GACGAT ACAG AAG AG G ACGAGAAAGCAGT AGC ATCTG ATG AAG ATG AAGATGAT G AAG 

ATTCTAAAGCTATTTTCTAAATATCATGTGTATAAAATGTAGTATTTTAAA 

ATGTACATCATCAATTTGACATGGTTATATTTCTTCTAG^TTGCT^ 

CATGATACCCCTCGATATATTATTAATAAGGGGAGTTGCTGCTATATTAACAGC^ 

ATTTTTATATAAAAATCAACGTCTACTATTGACCTGGCAACAGCCACTATACCTAG 

rTGATTTTTCTGAATTCTCTCGTATAGCJ^GCAATACGGCTAGCTTCATTATTTAA^ 

AGTTACTTGAGATACAATATTAIvrCCATTGAGATAAAAGTTCCTCCAA 

GAATCTCTTGTTGCAGTTTCGATGATGGAAATACAGTTGAGCCATTGCATGTCAGTAAGGATATG^ 

GAGAAAAGTTCAAGTCTTCTGGAATCTTGTTGAGGTTGATATTGTCCTTGGTGTGTT^ 

TGGGATTCTACTGTATTTCTTTAGTCTTTCTGAAATGTTTGTCTTTATACGGTC^ 

AGAGAAGAGTCTCTAATTTTTTGCATTTGT^CATTGTGAACCTTTA^^ 

GTTGTTGTTCTTGAGTAGTACTTCCAATGCTATTTAATTTGTCCA^ 

GCCGATGC<:CGCTGCAGCTTCTTCACTAGCTTCATCArrTATAAAAAGTCTC 

ATAGTAATCCC^TTTTGTTCAGCTGTTATCCTTTCCAGTArrTCTTCAATGGC^ 

ACACGTGCTCGCACACTTCTTTAATAATTTTCATCTCTTCGTTGCTGACAAGCATCATCATTCTAGATATAATA 
CTGAAGAAATAGTTCGATCTCTGTGAACGTAGGCTCGAAACGGGCAGC^ACATTA^ 
TGTTGGATCGCTCTTCTTGTTCCACCACCTACTATATTATTACTGCCTCTG^ 
CTGC AATTACTTCCAT*TTTTGAGTCT?CGCCTGAAATTGGTAC^ 

AAGAACTTTCCAAATAAATTCGAATCCAGTTCCTGCCACTGTGCCCACTTTACTGGGGAAAATATCGATC 

TT ATAGATCCTCATTGGGCTTAAATGTTGTGTCGATTTAGTT TTTGAGG CTTCAT AAAGC AC AGCAGACGAGAGAAGGGGTTCAAAG AAT ACAGTTTCC A 

rGCCGAATTCCTTTACCGAATATTTGACTTGTTTAGATGCATGAAC^ 

AATTTTCCCATCAACCGAAAAACTTCTCCGTACACCATCAACGAATGAAGAAGAAI^ 

CCAATTTCCATATCCTTTATACCATCATCGCGTTCAGTATTCTTTACAACTTTAAAGTACGACACCAA 

CTTCTGCGATATTCTGACCATATGTCACATTTCTTGTTAGAGATTTAGGTGGTACATAATTTAAGAACGAT^ 

AGTGGCAGTTGTTTGCTGCTGTTGTTGTTTTTGTTCCTTAGAGGATATTAGAGGTACAT^ 

CCTGTTGGTACAGGAAGATTAACTACCCTAATAGGGGTWCTGATGAATATCTGGAGGAGGATGATGGAGAAGA 
TTTCGGCTACATTATCTCCATTCAAAATACTCTGAGfiTGTATCGGTC^^ 

AGGGAAAGCAAGGCACACGCCTCCATTATCGGCTGCCGAATGAATAACTGCATAATGTGTAACAATCT^ 
AATAAAAATTTGTGTTGATCAGGATCGGCTGTATATTTTAGTAGCACTT<^ 
CCGATTTCAAAACGCAATCAAAAAATGGAAGAACMCAACTTGAAGTACC^ 
TCTCATAATATTTCTTGTCATTGTTTCTAAACACGACCGATAAAGACCCACC^ 

CTTCCGGCCAGTATAGGCTCCCCITGGGAATAGTATGGAATCGCAGTTAGATTCrrcTTCCAATATTGTATATG 

CC GT AAT AGC AGATCCGCC C GAGAAAAGTCCCACAAC AGCAT C AGAGAC ATTTGAGAATTCCCTCCCTTCT AGJVTT^ 

TGCTATTATAGCAGCAGAAGATTCCCCAGAATCG^TCTCTCAGAATTAATTTTAACG^ 

CCGAATGTAAGAAACCTTCCAGAATCGTGTATTTCCGACACGTTACTTTTTCTG^ 

AAAAAAGGTGCATTTCTATCCAGTCTGTTTTCTCCGATATACAGTC^ 

AATGGAAGTAGGTGCCCCTGAAGGGACAATGTTGGTATCGGTAGAAGTTTTTTGCCT 

GAAGAT ACAT CTCCT AAAG ATTTTCCTTT AGACCATAAAATCGATTGTGCGATTTTTTC CTTTTGTTCG^TCGTCAATG AGGCGGTTGAGGAGGAGGAGG 

AGGCTGCTGTTGCTGTTGCTGTTGGTTCATAAAGTTTGGGTTGAGGTTTATTTTTAAATGATAGTTGATAATAC^ 

GGTATTTTTTCCAAAA\AAG<IACTGGGCCGTAAAGTAGTGCATACCAAATTCA 

TTGGAGTTTGAAGCCATGTCATTTAAATCACCTAGCCrrGTAAGAGAGGGATGTTG^ 

ATGCTATCTGGAGGGGGGTGTTTATCCACACCTOCACATCCATTAGACGAAATATCAC^ 

AACAATATCAACTATGCCGTGATGGGCATAAAACACACCATAG^CGTCTTTCAAT^ 

AG ATGGGAT AT ATTGAC AAGTTCTTCC ATGCCGTCGTTTCCTTT AAGATC T AGCCGT ATTTC TT T CTCATCATGCATTGTCCAGTCC TT AAT AT T AATG A 
CAACAGGCAAAAGCCCTGCCAATCTCTCTAAAGACGAAGCAGTTTCTTCCATGTTCAGCTTG^^ 

acgagaataa<^tcctcctcctcctcgaggaaatttttgttct^ 

gattttg t agat atttcttc aatagtcatctcttt attcgtc tct ataat aaacacagacgggtgt att ttt acccttatttttgcattttctgtttcac 
ttcctaaacgcccxrtcggtgaggtttgcaaggctagaatcataacacgcaaa^ 

ttgttgttgttgttgctgtagcccttcagccaatajggcaagctaaacggctattgtcttcgttgttcttaccaggx^gg 

tgcgttgttgacataatttcctcctcgacgtttctcgtcagcctgtaattcgagtagagtttgt^ 

accjatccaccggttacgatgatggatatcaacaatatatagaatagggccttgtcactagccttgcagcat 

tttgtagtctgggcctaattcgctctgtacgtgci^cgtattctctovgacacgtccatgttgttgggtggataaagtataaatac 

aaaacctctcacatgatggcaggaatcggtagaagagataatagacccgtattacatctggacattgatccaaataaggaaataccctataacgttc 

caacccctattatctgtgaaaaaaaccccttcgtgtttaacatgcaga^ 

ccctccataccctgxjtactgcagtagaagaggaggagaagcaaaaggaaattgagga 

aagctgagagatatccccagaacctaccctctcgaatrrcccgagaagaaggagaaggatttcccttgcgttgacactaccggtc 

tcatcgatctggagaaaaccccaccccgtagtgacgttaggcacggttaccactac^ 

taagctcactgaaaagcaagaaaacctgaacaaattggtgttggatgttgatgacgttgtgat^ 

cgagctggcctgtgcaagttctcaaaaaactagacc^taaaggtatgaccaaaccgot 

ctaacaaacgcgaaaactaacaataggaaccatggcctcatcatcatcatccc^^ 

gaagaaaatacactgtccctcaggaacaggaatgtgaacaaaccaacacctgttagcg 

aaatgagaagacttgaagatttttcttcagatgaagaagacgatgataataaatcatgtcattc 

rccttcatgctttaac^gtttttcagctggcctgtgctcttttgtgaggggtttctttg 

ctracaagcgcagccx^tgctgctatttttaagacta^^gatgtggctaaaact 

ctggaggagatggagatagtggtattgctgctgatgttgtctctctcgcta 

ac tc atc aag act acaatt g aaaatcttgt ag atggtgg agaag aaaccac agaattgt aaaatgttttttggt atac^^ acat aattt a 

tataatattttattgtttcctcttaaatgacmtttccttatgatacttttaggtgaaggaataaaggtagg 
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TTTCCATACACATGATTCTCCCACATGTTArTATTGTTAGGTGGG;^ 

TGCTCCTGGTTAGGGTGTT7TTCTTGTGTTTCATCTCCTTCAATTCCTGGATACACTTCTGGATAGATGGAGA 

CTAGACATGTTCTTCCACATTATCATCCTCTCCCTCCTCATTCTTGATATTGAAGATGGGAGAAGGTGGGTTTGA^ 

ACATATGACACGTAATCACCGCCATTGCTTCTAGGTGGAGATGGCGGCGCTGTTGGCAGAGGAAGACTATTGTCAGAAGAA 

ACATTGGCATATAGACACTTTCTTCCTCTTTATTTGTACTTGTTGGTTGTTGT^GAGCT 

TGGACCAGAAAATGTTACCAGTTTTGTGACTTGAGGGGTAACATTGGCCAGATCGTAAGTGGT^ 

TCTGGAGTGTTGAACCGGACGTGGCTGATGAAACTGGATTCGGTGTTGCTCCCGTTCCTGTTCTTCTTATTCTTATTCTTC 

TGCAAGTAGTAGTTCTGTCGATGCACTTCACGAGCATGTTCATGGCCAGTGCGAGCCCAATAGTC^ 

GACGAGGATAATGTTGTCGTTGGTGATAAGATTGTCCATCTCGAGAGATGTTTCTTCCTCGACTACCAG^ 

TCc J I-rl-rATACCGTCACTTGGTTAGAAAAAACGATAGTGTAGTTCACAGTATTTGTCC 

CCAGATATAGGTGGATAGACCGAGATTCTTCCCTTGATGCAGTTTCTATTCCATCCAAGATATCTTCACC^ 

ATTGAACTCGTGAACGCTTGACTTGTTGATGATGCATAATTCACCTTGTTCGACTGTTATACATACT^ 

GCACGGTCCTCTTCAGGTAAGAAAACTGCAAATATTGTATTAGGCACAACATCTGGCACGTTTACAGAATGACA 

AAAAGTGAAAATCTATCATCCGACACTGGAGAAAGGTTGAAGGAAAATCTTTCCGCATATTCTTGAGTTCCCGCAA 

CTTTTTAGGATTATTTGTTGCCAGACTTACAGCATCGAAAATACTTTCTCCCAT^ 

CATTAATTGTGTTTCAGGGAGGCTATAAAACAAAACTCTTACGCATCTGTTCATTCAGTACT 

TCCAAAAGGCCGTTCTAGGCATGTT ATCCT AGGAAAT GTCG ACT ACACTTTCTGT ACTACTGAC AATAATTGTGTCAGT CTAGAC AT CGATTTC AAGGAC 

AATATCACAGACCAAAACATTCAGTTATTGAACAAGAAATTGGGTAAGAAAACAGCAAAGAAAAT 

GTGACGAAGACATATACGCCACCAAGGAATTCGAACAGACAATAAAAGGTCTACAGAGAAAAAAAG 

TGCCGCTGCCACTGCTGCTGCGGTAGAAAAGGCTATGCTATCAGAAAGTGAAGGAAAAT^ 

GACACGTCCCAGAAACAGTTCACTGCATTGAAGAACAGGGAATCTTTCrrCAGTG^ 

ATCCT^GCT^ACAAAATTGTGTGTGATTAAAGGCACGACTGATATTGCTGATATTCTACAGAAAAACA 

AGTGTCCACTTTTAATAGATTCCACTTGAACTTTATTCGACTCGGGAAATTCrTCGAAAGAAATTGCATCTCT^ 

GGTATGTTTTTCAAGCTGCGAGTGCCCGCAAAGCCTCTCTTGCCTGGAGAGACTATAGGGGAATACTAC 

ATCCCAATTTTAAAATGTCTGACTGGTTGAGGATAAAA 

TGATGTTTCTGTCGTATCCAAATTACTATTTGGTTTATTCCATTACTTTATGGACAA 

AAATACGGCATCCCGCATCTAGAATGGTACTTGTACATTGACATGTTGCTCCACTCCTACCCGTACGTCA 

TGGTGWCCAACAACATATTTCTGAAGTGGATAGGACATACTATGGAACAAAGAACAGTATGGACGCTGCCATGTCT 

AGAGAGTATCTCCTTCATTAAAAAAATAAGATCACAACAACTTTTTGGACArTTGTGTAAACCTTCAAAAGAAAC 

TCTTTCGATCCTATAAACCAAGACAGGTGGTGGAATAAAAATGACCAATATTACGGTAAA 

TTTCTGAAAATACAAACAGTATGATGAAT AGTCGC ATT7 C TTGTC AAGGGAAATGGTGGCTAGATGAAAACGAATACAAGGATAAAC TTGATCATAT TGT 

GGA TTTGT GTACAGAAGAAATAGTGGAGGAATGTGAATCAAAGGGTTTTATTGCCTCCCCAT^ 

TATGTTTTATTAGCGAGAGCCTGTAATCAAAAAAATGGTAACAAAATGAGTATTAACAATAATAGTA 

ATGCAAAACTACAGGAAAAACACCGTGTAACTTTAGCCAGGT TG AAC ACG ATGATGGCGTCGT AC CGATTTTTGAAT AATTAC ATCTCAACAG AC AT TGC 

ACCTGATTTTGCCAAGTTATTTGGTAATGATGTATATAGTTTATTACATTTAATGACAA^ 

AGGGCCCTTTCAAGTAATGAAAGTACATATAAAACACCTGGAAATGCATACrTTACT 

ATAAAGGTAACAATAGAAAACGTAAATTTTCTCGAATCGGACAAGAAAAGAGC 

TGAAATCATAAAGGGTATTTGTACAAGTTGCGATCAAAATAGTACCAGTTACATAGAGA^ 

TTTAAAGCAGCrTGCAACCCATCCGCGAGTGAAGCAAGAATTGGTAGATTCTTTATCCTC^ 

AGAACAATAGATGCACCCCTAGTGATTTTATAGATTATGTGTACAAATTCACTGACGAAACAACAGGTCCT 

TGATATTCTTGCATCCTTAGCAAGCAGGAGAGGGATGGAAGATCGTCCCACAGCCAACTATAGAACCTCCTTACA 

TTG^TAAACTATTAGTTTCTGCTATCAAGGAAACAGCAGCCACTG 

TTCTCTG TCAA CTTGTGGAAAGGAGGAACAAAGACAATAATGTCTTCGACT 

GTGAGGTCTTTGATGTACACTGTGAGACGGACAGAGATTGTGGCGCTGCTTGCGAGAACACGTACTCTGTTGACGGAAAGGA 

TAACCAACAGTCGGGAAGATGTGCCAGGAGTGTTTATAGTGCGTCTTCTCTAGAAAGAGCAGCCAAT^ 

AATCCJIAAATTGGAGGAAGAACTCCCTGAATCATTTTTGTGGTTTATCAATCACAATGGAGGAG^ 

CGATG<^TCTAAGCATAGGGAAACTGGATAATGTGGACACTCrTGCCCAGGGTTTAGATAAACGGATGGCCTCA 

GTTGGACTCTATACTTTTACAAATTGATAAAGTTAAATATGAAAAGGCAAAGAAATGGATATT^ 

AAAGAAGAAGAAGATGCGAAAAAAGAGGATCAATCTCTTAGCGTTAGTGAAATTGTGGATGTT^ 

GAGGGTTTATCCAGAAAAAAATATATCCTTTGTCAAGAAACGAATTGAGAGAATTAGCCCT^ 

TTTGAGTAGGCAACATGATGTATCTACGCGTGAAGATTTATGCAATGAAAGTATGAATGCAGGGAGGGCAGAATCCAT^ 

GAGTACGTGGCTACTTGTGCATGTCTTTACTCGGAATATTTAACAGGGCCT 

AAAGGACTGGAAGACCTGAATTTCTAACreATCCTGTACTTCATTTTAAAAAGGCAGAAGCT 

TAGTCCAGATAATAAAGGTTTCTTGTGTGCGCCTGTAGCTGAACTTGTAAAGACGGCATTAACTTTTAGGGG1TCACACG 

AGAGAT ATAAATC AAGCTGAAAATCTACC ATCCAATTC AT T TGG TGTAAACTGGC C CTATGTGAATCTCCT AAATCGCATTCAAGACCAGTACAC GT AAT 
TTGATAAAAAAAGATCGGGGAAAATGTCTGCATCTTTAATATTGGACGAATACCTCAAGAAGACTG 

GAAAATCAAGOTAGAAATCCAATCACCTGAGGAGGCTGCGGCTCTTTCTGTTGCTCTCTATGGAGCACCTCCAAAACCTT 

ATC ATCACTGGAGAAAG AACATCTT T AAACGAC AAAT ATCT ATCGGATAATGTCC TATTGAAAATGTCTGTT GCT CGCGTTGGAC AAGAAAATAATCGC A 

AGAGAGCCGACCAGGCAGCTGATGAAATTAGAACCATCATGGAAGATATTACAGGGAGTTTGTCCGGTGCGTACAG^ 

AAATAAGGTGCATATAGGCATCATGAATAACAAAACGCCTAGCATTGTTTGTGGATATT^ 

ACAGATTTTGAAAACCCCACTGTCATTGCCAATGTGACTAAGCGGATGGAGAGCATTTTTTCA 

TTGTTAATGOTGTTGCGAATAATATGGATATAAAGTCATCAATAGATTGGGC 

TTGTTCAGTTGACACTATTGTGTCCAGAGACGCAAGTGTAGTTAAAACAGCAGTT^ 

AC ACAGCTAACCTTT ATG AG CGAG ATTGAAAAAC TGCGAAAG GCTGCAGTTGT ATGTTTTGAGG CACTCATGTCC GATACTAGGG AGAGGGCATTCGTAG 
AGTTCCTATTTTACGTTAGCTTTAAGGAAGATGCATCAAATACCAATTCAAAATTGTTTGTTC^ 
GCCC^TAAAATTGGTACGCCGTTCTGCTGAGGAAACACTATTCGGGCTC^GTTTCATGTTTAAGGTAATGCC 
TTCCCTACCATTCCCCATTCAACACAATACCATGGTCTATATGCTACA^ 

CTCATTTTGAGGAAATTTTAAGCGAAAGAGCCAAT GC AGTG AAAAAATTTGGTGT AAACGAT AC GAGGATAG ATT GTCTAGATGC AGTAGCAAATCTCAC 
CGGACCTGTGTATGTTCTCATTTTAGATCTTGTACGTACTCTAAGTGCGCAGAGATCGTGTTCAACTAAATTTCTCCCT 
TTGTGGAATAGGTTTGTGTCATAATAAAAATGGCTCAGACATCAAAGATGGGAACTAACAAGAGGTGTTTTGAGGAGGAAGT 
ACCTTTCACAAAGAAATCTAAATCGGAACCACCCAGTTTTGAAGACAAGAGTTCATCCA 

ACCAAGGAAGAACAACTTCTAGAATTCGTCAAGGATCTGGAGCGGAGCGACCCCACTGTTCCTGATGAGAAGGTCAAGCAAGAAGTTGAAGA 

CTGAAGCTATTGCTGAAATTTTTTCAATGTTTGGGATCCCTCAAGAOIGCAA 

AATTGTTATCGATGCAATTAATCAGCCTGTGCGCAAGATGTTGGTTGACC^ 

GATAGCGACGAAAAACTGAGCGTCATTCTTAAGAGTAAGAAATCCCCCAAAGAATTTGACCTCTCCTTTTCCGAT 
TGGTTGGTGTAATTAAGAGGGTGGCCGGAGCTATTGAAAGTAAGGAATTGTTGCAGAGTAACAGCATGATCATGA 
CAAC^TTCCTTACAAC^TGAAGATTAATATTTGTGTGTTTTTGACTAACTTTATTTC 

GAGAAATTTGTAATGAGTCAGGTAACAAGATACATTTCAAAGGATTAGAAAAATAAAAGATGGTATAAAATTACTTGTATTTTTATTC 
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TATATTTGTACAATTCCCACTTTACTTCTTCTTGATTTCGTCCTTG^ 
GCGCCGTCTTCAAAI^rTGCTGCACACGTCAATGAGGGTAATT 

CCTTAATCTTTCCTGCAATAACAGTTGGCCGAACAGTAATGTCGTTGATCAAGAGATCGACGCCAGTGTTGTTGAGGATC^ 
CTCGGCCCCACTGCCAGCGGGATCAGTCCTTGGGGCTGTGACGGTAGAGATGACAAGAT^ 

TTCATGGCCACCTTTCCAAGAGGAGTATTGTAGGACCTCTCTCCACGAATTGACATTACCTTTGCCCTTCTTTGAA 
AATTAGCGACGACGCTTCTTCCAACACCTGTGrTGAATACAATCATTATAACCATGATAACGATTAGAGCAAT 
TGCAACGTCCAGGTTTGTTJiGGTTGCCAAATTCCATTTTT 
CTGTAGCACCATATACCCAGAAAGGGTTTGTTTT/TAAAATTTA 

CCTCCC^CTACCCCrACTTTCATCTCCATCTCTACCCCCTCCCTACrrCCTCCCrTCACCTCCTCCTCTCCCCCTTTACTCCT 
CGTCTaCTGAATTGGATGAAGAATrTGTAATATTTTGTGTGCrrCATTCTTTTCACTTGCTTTGCTAGAGAATTGACGCTAGA 
CATAAAAGTCCTCCGTCTAGAAGGTGTGGAAGATTCTTGGTCAATATACACACT^ 

GT T CTTTTCCT ACC AAC ATG AG AAGAT ACACT GCT AGGC ACGT C C ATTGCTGTT G CT GC T GC TGGGGTGGAGGG AGAAAC ATCTT GATT TTT AGAAGGAC 
GGTACATTAAGTTTCGCATAGGCACAACAGATGTAGATTTTCCGTGCOTACCCTTTTTATTGTATTTCTCAG^ 
GTACACTTCAATGGACAATTTAGCTTTCTTTAAAAGTTTGATCACTACTGCGCTTAGCAAGTGTTTCCTCTTGTCCAT^ 
GCATTATCAAAAATAGGGGATCTCTGTTCGATGATTGATGAGCCTGC^ 

TATTCTTGCTGTTATTGTTACTCACTCCAGAATACGCGTCACCGAAAAAGTGGTTTAATGATGTACGCTCA 

TACTCGACGAATTGCTTCCTTCTTGTCCTTCTCTGAAAACATGTTGCCGTCTTCACTTCTTTTCTCTGCCTCACTTCCATT 

TCGACTGCCTTATCAAATTGTATACTGTTTAAGACGTAAATGG^AGGGACAGCCATTTTAGAAGAACAA 

AATCTTTTGACAAAGTGAACTCTTTTGGCCAATTTAATAGTACAATTGGTATGTTCAAGAAAATTGTTAACCTTCCCA 

GTGTTTCTCTATGATTCTAGAAATTGCCCCGAAACAAATT7TCTGCCCCTAT 

TGTTTTAGTTTAGATGAGGTAGACAAATCTGATACATCATTTGTTTCAOT 

CCACTGATGCCTTGGTCACTATGTCCGAAAGCATATTGACTAATCTAGACCAAGAAACAGCACTTGATGTTTTAAAGGATATGA 
CAGTCTTTGACCAAGCTTATACGTTrTGACATGGTTGATTATCGTCGTCTTCTTTCAGTATATTTATCAGrrrc 

CT AT AAAATGAAGAATGCTCCACC ACCI CTCC TCCTCCTCCT CCTCCTCCTCCTCCT CCTC C AGC AAT AGAC TCGTCGCCTTCAGCACAT ACTCCACCAG 
CAGATGAAGACATGGAAGTTTCATCCTCGTrTAArrCAGATATACriTCAA 

TTCCCCCATTATATAAGTTGACGCGTATGCTGCTGAAAAAAGAGGGTTG^ACATGATTTGCTTCCTCTC 

AACAGCTCGG^AGAT^TTAATGTCTGTCACCGCAGCAGAATCTAAA 

TCTCGGATGGTTTTACTGGGTTTTCTGGATCTTGGAACACAACTTTCACCAGTT^ 

TGTTTTTAAGATTATAGACTCCATAGTXrraGTGTAATCTTTCTTCGGCCTCr 

GATGGATTGTTGTTGTTGTTTGAAAATAAGGACTGATTTGTAGGTGGA<5GTGGAGGGG<S 

GAGGTG<TTG<5TGGAGGTGGAGGAGGAGGAGGAGGC ATCAGCATC GAATCTTX^GT CCATG ATGTCTCTACTTTCTTCT CC C ATTOCATTGTCTGGAGGAAA 

AGTCACGATAGATGGAACAGGCGGTG^GCTATATCGTTTAGTTGGTCATATTGrTCCTCACCTCCACTACCAAGAATAGAAAA 

GGTGAAGGTTCCrTTTATCTGGGGGAGAATATCTGTCTCTTTCATGATT^GAGGGAA 

TTCGAACCCTGGGCGCATTGTTAAATTGATGCATAGTACTTATTTC 

TTCTTCAATATATTCCGTGTGAGATTGAACAGACTCTTTACTTTCTACA^ 

TCTT^TCCTCTTCTTCATCCTCTTCATTATCATTGCCGCCACCTGCTAT^ 

GGCTGGATTCJUVCTCTATGAGAGTGCACGCTTTCATTTTCTATTGCATCT 

CTCTTCCTCCTCCTGTTCTOCTTCCTCTTCCTCTTCCT^ 

TCTTCTTTGTTCTTCTCTTCT^CACCTGCTGATCCGCC^ 

CTGGCGAGGGGGrTGTTCACTATACTTTTCCACTGGTTTTCAGGTATAATTGTCGCATCT 
ATCTTCTTCCXCTTCACCTTCATTGTCATCTTCATATTCATCTCC 
TCATCTTCCTCTTCCTCCTCTTCCTCTTCTTTCTTCATCATC^ 
GAGGATCTTTATTTCTTCCATTTCTAACTCCCGTGTCCT 

ATCCTCTTCGTCTTCATACAAGTTACTATAATCAACATTATTATTGATATCGrrCCTCGTTGTCGATATCCAACACTCCCG^ 
TTTGATGCCGCTTCCArGATGAAATTTTAGTATACAATATTGTGTTAACCGTGGA 

T^ACCTCTATAGX^TAAT^CTCAG^TGTATATTCTATAGTTTGGTCT^GAACGTGCTTTTGGTTrTTGT^ 
AGAGCATTGACAGTTGAGT^GGAAGGGGTGGTTCAGAGTAAGACCAAAAATAAAATATTTTTTGAT^ 
AGGGTACTAGAAAACGTTGTCAGAACCAATCATGAGGTTTATCCCGATGTTGATAATTGCGCTTATCGCTGCATT 
AATTCAAATTACCTGGATCACAATATCAATAAGGAACTAAAT^PTGACCCG^TTCCCT^CAA 

AC AG AATTCTCCCTTCCAAGCTG AGTGTTTTAGAAAG AG GGTCAATC ATTTTAGC AG AAATGGAC AAGT ACAAAAATGC ACAACCTACAGTAAAT AATT C 

TCAAGAGAGX3AG*2AACATTTCTACTCCTCAACAACAACAACAGACCACCCC^ 

GCTGATTTATGTTCCTGTATGTGACGAGTCTCGCACCGACGAGAAAGAAGAT^ 

GGAGGAGAAAAGGTTGTAAATATCGTAGTAGGGAGTATTAAGATGATGAAACCTGGTTCAAAAATCAAM 

GTGCGATAGGGAGTAGAAAGAAAGCTGAAGAATATATTGAACGTCTATACAATTCCTTT^ 

GATTTATTGTCCATCCTACAACAATAAAAGGATAAAAAACAATCGTT^ 

GTGAGGGAGXyVAATACTACGGAAATCTCCCGCTTCITCTTCTTCTTCTTCAACGTCATCATCC^ 

CTAATTGTACTTCTAAAACTGTATGCGATTTTGTAAAACACATACAGTATGAAGAAGATA 

AAACTATGCTTCAAGATTTTCTAAAATTAATAGATTAACTACATGTATACAAGG^ 

ATCATTTATAAAATTCCATATT?TTGTATC(2ATTTCGTAGTCGACGGGTAGAGATG^ 

ATGGCTATCACATACATTTTCTTATTGTTTTCCCTATTGATG^ 

C^TTACTGTTGnTCTTAJ^CCACCTGAGCAAACATGGCTGTAGGGGATTATC^^^ 

AAArrTTATAAACTTTGrrCACAATATTATCTTTAATAATATATGCCGTGACAGACGrn'TATAGAAGATGCAAGCG 

TGTACAACCAATCTGGTTTCTTCGACTTTACAAGAAGCTAATCTCGTTACCACTGAAAAAGATAAACCTGTT 

AAATGATGGAAAAATATAGATCGGACTTGTCTCCTAAGAACGTGGGGGAATATATTTTACCTTCAGAAAAAGAAACAGACAAA 

AAAGGGTAAGAAGGTTGGTCTTTTAACTGCCCTGAGTAATGGTCATGACAGTAA 

AAGGACAAAAGTTATCTCTTTAAGAGATTGAGCAAAGATCCGCTCGTCTACT^ 

CAAAAAAATTCATGACATC^CACAGTTGGGGAGTAAGCTCGTGTATCCTCACCCTATACGGTATGGTACTG 

CAACAAAGCATACGGAATGXJATAATGAGGATCTACACACTTGGAACCCACCCTCTTCTTCTGTGCTCGTCCCAGACTC 

GAATGTGCTAAAACGGACCCAACACATAGAATCGGCATCTACGGCTTTGGAGGGTCT 

TGTTATCrTAATTGTGACAACCACAAGGACTTGCTTAAGGCTCCTCTAATTACAGAGTATTCTACAAATCCAACTG 
ACGTGTTTTATTCCCTGCCCCTGGrTCCGAGCCTGTAAAAT^ 

TCCCATGAGCCCGTGAAGCTATCAGATACGGGTCACTATGCAGTTGGATCACCCATTCT 

AAACAG<3ATCTCCTCTGGCATTGAACTGCCCCTX^ACCGACAAGGCTGATGGAATATATCAAGTCAATCAAA^ 
GGGGTATCTTAACGCCAACCCTGTGGAAGCTGCATCACTTTCTTCCTCGGA 

GAAGGAGAAAAAATAAAGAAAATTGTGTAATCATATGTGTAGrTTTATTACCACTAAATTTAT^GGTCTTCTTCGTACTCT 
TCTTCCTCCATCTCAAAAACTTTTCAAAAATTT/TTCTGGGTCGCTC^^ 

CGCTGGGTCGAGXX^UITGTCAGATTGCACCAGAAATGACAGAGCTCCAGAAACCTACATTAACTTT^ 
ACACGTCCAGAAAGGGTAGGGCATAGTCTACTGTTTTCACTAGGGAATTTTACTCTGCCTC^ 
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AGTTTAGAGGGTGGACCGCTGGGTCGGTCTAATGTCAAGTTCGGAGGGTGGACCTCTGGGTCGAGCCAATGTCAGATTGCACCAGAAATGACAGAGCTCC 

AGAAACGTAC^TTAACTCTATACTGTTTCTGGAACArGTATTTCTG^GC^ 

ATTTTACTCTTCCTCCATCTCAAAAACTTTTCAAAAftTTTTTCTGGGTC^ 

GGTGGACCGCTGGGTCGAGCCAATGTCAGATTACACCAGAAACACCAGAGCTCCAGAAACGTACATTAJiCT^ 

GAGACTGACGCACCCAGAAAGGGAAGGGTATAACCTACTATTTACACTAGCGAATTTTACTCT^ 

TCGCTCGAGTTTAGAGGGTGGACCGCTGGGTCGGTCTTGATGTCAAGTTCGGAGGGTGGACCTCT 

G AGCT AC AG AJ^CGT ACATTAACTTTTACGTGTTTCTGGAAT AGACGTTTCTGGTCATGG ACGC ACC CAGAAAGGGAAGGGTGT AACCT ACTGTTTTCGC 
TAGAGAATTTTACTCTTCCTCCATCTCAAAAACTTTTCAAAAATTTTTCTG 

TCGGAGGGTGGACCGCTGGGTCGAGCCAATGTCAGATTACACCAGAAACGTTTGTTGCTCCAGAAAC 
TTTCTGGAGCACGTTAAAATAAGTAGGGAGAATAATACATAACATTATATTTTCAAWCCATT^ 

ACTTTCATTGTAGTATAAGAATCCTTACTGAAATTTTTCAAATCATCOiAAGAGAATACAATCTTTCCAGTATAACTTA 

TTTTCGTGTCCCTGGCGAGACGGTACTTATTATTGCTTCCCATCTTTAAATCTTCGCTTCTACTCCTCTTCATAGACCTGAATAAGGCT^ 

AGGGiGTCTTTCCTACAGGTTCGACAATAAGTCTTCCTATAGATGCGTTCCGTATCAACT 

CTCCCACTAGTTTTCCATATCTCTGATTCCGTTTGCATCTCTCTAAATCC^ 

TCTTCAACATGTTATCCACTAACJVTCTTGGATGAAAATGCAGAT 

TCTTCGTGCTCTAAC^GCATTCCTTAAAGCCaUUVAGCTTGCAAAGGATAATAATGAT 
GTAGGAAACTTCAAATCTCCCCATCTTGCCAATAGTAAATATACTGCTCTGTC^AAATTGT^ 
ACAGCTAGAAAGAAAGGGCAGCAAACCCTl'TAGATACATGGACATATTTTGCAGCTTCAAACACATACCCTTTC 
TTCTTCGTCCTCrrCTTTTTTGTTTCATTTTTCCTTCAAAACCTATATCACCACCTAACCCTCCCCCTC^ 

ACCATCAACTCCCCTAACCCATTGACATTTTCTGCTCCrTTTATTATTACCACCACCACCACCTAAAGCATCGAGTACTGCTTTCC 

CCTGTTCTCCCATAGACGGTTTACTTTGTTCCITTAAACTGATGGTAAAAGTGTCTAGAGATTTTGTTATGA^ 

TAAAGAATGCATATCTTTCAAAAACACTTGAAGAATTAACCTGCTTTTGACTGCAGAGCTATTAGTTTTC^^ 

GCATTCCATAGCACATACATTGAAAATCGGGCAGCTGAGTCGGGATCAAGAACGCGGGTAACAGAGTAGT^ 

GAAAATACTCCAAAGTTTCTGGTGAAACAATATCGTTAAACTTGTTACATCTCTTGTATTCACGTTGAAGAGCT 

CCCTAAAATTCCGTCTATCGCCTTTGGAAGATATGGGTGGTCCCTAAAGAGATTCTCAGCAGTCATATCCGTAGTTTrAATTC 

ACCCATTCTTTTTCCCTTTCTGTGACACTTTCGGATGGCATCTGCAGATGGSGATTX^ 

CGTTTCTGGTCCTGGACAGCACTGCCCTGTACTCTGCGTCGATGGCTGCTGATAATAGTACCATGATTTCCCTCCCXAAAAG 
TACCTTGGG^GGCTAGMTCCTTCCTGTATTACAGAAACCATGTTGGTTAAAACACTCGATATAGTTGG 
ACTGTAACTCCATCACTAAAGGACCCTTCAATAAAGTCTTTTATCCTTTTTTCAATCATGTCAGCAGCGCGTCCGTACTCTA 
CAAAGGGATCTAAACCCCTCTCTTTCATTCTAGAGGCGATAGATTGCTTCCATCT 

TTTTGCTCGTCCCATTGCCAGATCGGCCACCGTTGAATTGTTTCTGTTG<5CCAACATTTCTGAATCAATTACCCATGA^ 

GTGGAATAAAGAACTAGATTAGCTGCTGATCTAAACACACCCTCCATAATATTCTCAAAGGCATTCAAAAGGTTCAATAATG 

TAAAGTCACATAGTTCTTCCGCTGCAGTTTCGGTGAATCTCCAAGAGGGGTCAAT^ 

GACGGCATCTTT'AGATAGGTGTCCTTTGGCTAATCTT'AGTCCCGAATTAAAAAATTCTGTCGT^ 

ATAACAACCTTCCTTTGTTTTTCAGCTGTATTTTGT 

TTTCTGCTACTAATGCCAGAACATAGTCCTTTGACTCTGTCATGATTTTTGTGGTGCTGGTAAAGTAGTTTT^ 

AATGTCT TC GGTT AAAGGG AT ATT T AT TT GTTGGT ACTGTTT TCGTCAAGGCTGAAAAAT AATT T CAAAT AT ACAAT GG AGT ACATG G AAGAAGG AGACA 
TCGCTGAAAGGCGCTCAGAAGGTGTCGACTATATTCTGGACGAAAACTCTGCTTGTGTAGTTAATGTG 

CGCCGAGGAGGCACACTACGCACAGGACATTTCCGCCCAACTTGTCACCCATATTATCCGTCTGGCCCACTGCTCCGAATCCAACAAG^^ 
ATTGCCAGTATTGCGGGTCTTTTCATCAACAACATCTTTGACAACAATTCAACAAAGAAC 
ACAAGTCTAGCGTTCTCAATATCTTTGGCTCTCTAGATCCTCTGAGTATGCTTTC 
GGACAAATCTTTGGGTGTGCTCTTTGAGGTGCTTC^AAATTACAA 

GCCTGCACC GGTCTC AJVGGAAGCCATCCGCCAGGAAC AACCC AT GGAAGCAATGTTGTTGTTTT T CAAATGTATC AACCATAATAGGTTCAATT T TGGAA 
GCGACATAAAGTCAGCATACG^CTCTGAAACATGCATGAGATACTCTCAGGACGAACGCGCAGTCGTTGTGCCT<rr 

CGAC AGGGAT GATCC AGCTCAT ACT CTTTC CTCTTTCGGGGATACTATCGAGTATGCTGAT^CAG AT AAC GCTTGGGTTTCTTCCCTGTTTG 

TCTAGAATGCCTATGGTAGACAGAGCCGTCATrGCTCACrTTTACGTGTACACAATGCTC^ 

TTGTCTATACCGTATTTGTTCGTATGATTTACTCTGCGATTGAAATTTTGTTCTGTGATACTGAAAATTCGTCT 

GAGCTATGTTAATGCCATGGTTAACGTATCCGTGCTGGGTTCTACGTTTAACCTACTAAAAGCCTACCGTTG 

CCGGTTCTAGACATTATTTCCGGAGGATGGAAGAAGAACTACCCCTCACCTGACCACATCAA 

TTGCATCACCTTCTAGAATGGTTAAAGGTAACAACAAGGCTAGCAACGTTAC^TCTGGCCTGGATAGTATCAGGTCT 
CCCGTTTGGAATACTTGAAAATAAAGCAGGGTATGGTGTAATAAACATTGCCAAG 

TTTAATTGCAACGCTTTACACATTCTACCTTCAATTAAGG^CTGTGAGGCACTTGGGGCACAAAAGGGG 
ATTTTGTCGCATCTCATATGGATATTGCCATGAAAAAGCAGGGGTCGGGGJ^GATTCTTGGACT 

TTCATTCCCTAGTAGTGAAGCGGAATACAAGAAGAGAATCC^TGATTTCACAAGATACGTCATCTTCTCTTCAACACCCATCAACGACGAACTAGTCAA 

TCTCGCTGTATTCTTCCCCATTCTAATGTTCTGAACTCCCCTATCAGCTTGAGAAATATTGACCCAGAATCAGTCCCCGACACTCGAT^ 

TCATGATGTGGCAGCGGCCAAATATCGATGAACCTAATCTTTCTGCTCTAACTACAAGTCAGCTAGAA 

ACTCACCACCAGAGCGTTCTTCAATATCGACAGGATCAATTTCCAGATGGCAGACGCTATCATTAA 

AAAACTGCCTCTTCTTCTTCCTCAGCGCCTAACTTTTTCCAAATCTT 

GAGAATCTATGCAGCATGTACAAAAGGAATGGAGTAGTGCAGTAAACAATGGGAACAGAGGAGTAGAAAATTAT^ 

AGAACTGTTCGAGCTGCTCTACAAATTGATCATCGAGGAGGATATGCGGCCATCCAGCCTGATCGCCTCATCTGAATTCTTGAGCAA 

ATGGATGAACTTCTTATCAGAGCTAATGCTTCTTAGATAGGGTTTT^ATGAAATAAAACAA^ 

TTTGT ATGT GAT AC AAT ATTTT T AGTGTT ATG ACAGTGCAACCATT AT ATTATTATC TGT AATAGGT CT TGGTAAATTTTT AATG AAAGGAAGATT AGGT 

GCACAATCTATTGTCATAAAAATGTCCAGGTCAATTTCTTCAACACCTTCCTCZAAAGTTATAGAA 

AGTACGTTT^TTCCTTAGCTGTGCTTGCAGAGCTTATCCAATAGTCTGATG^ 

TGTTTTGTTTGTAGCGTATACCCGTACACGGTTACCTTCTACTACAAATGAAAACTCTATCCGGGCTTTTTCAATA^ 
TGTTTTTGTAGCACGGGTTCTAATCCAACTTGTAGAGAATCCAGATTCGTTGCTGCACTAAT 
ACATTCGTCCTTGCCAAGTGCGCATATCACGATCACTGATAAAACCTCCTCGTATATTATTTATC^ 
TGTAGGTTGAG^AGGTTGTTGTTGTCTAGCTAGTTGTGAAG^GGTGTTGTTGTCAACATC^ 

TCATCCCCATTTTCATCAGAAGGATCAACTGTTACTGTACCAAACAATAAAATATTGGTCAAGTCTATTACGAGTT^ 

TTTGTGCAACAACTTCATCCATCGTTTCCTTGACTTCGTCATGGTTCGTCCCTATGAGATCGAACAAGTTT 

GACGTTTATGTCACCTTC^GGGATTGAAACTTTTCCATCTGTTATTTTATCCTT^ 

GCTAATTTGGTGAAGAATTCATCCTCAGCTTTTTCAAGAGTTCTTGCAAAATGCTCTC 

TTCCTCCCCCTCCTCCTCCATTATTATTATTATCGTCGTCAGTTTTTTCATCATC 

TTTGGCATTTGAATGTAGACTTCCCTTCCCTACGCCCGAGATAGCAAAGAAGAGGGCA 

T AT ATTCCT CCATTGTTTG AAAGTGCC ATT TTTAAACCCTTC AATTCTT ATCT AAAAAAC AAAC AGATTG AAATCT ATGT AT AAAT CATC ACTCGCGTTT 

TCGATTTCAGGCATGTGAATTGTAATGAGAGAGGCCGAATCCTCGGATGGGTTGATTATGCCATTTGTATCATGACTAATATATATAGG 

TTAGATAATCACCTATGTTATCAGAAGAATCACTATTTCTTAGATTTGATAATA^ 

GCGGTTAAATTCTAACGTGAGAGTTTTGGCTGAAGJATCTATTTTGAACGATAAACTTAAATCAATTAAATC 
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GAT AACATGTGTTT ATCGTTT ATTGGT AC ACT AAATG AAGTTG AAGAGTCGGTAAAGGC AAT AGT AT TGTACTTG AAAG AAAATG AT GTATGAT AAAAAC 

OVGGTATATArorTCTCCACCGCTCCTCTTTTAAAATTTCAGTTATGAAATCTTCCCTGGGTTCCATGAT 

AACCACTGGTATTCCl^CTCAAAAATTGCTTCACAGATAACAGGATTGAA 

TTGATTTCTTtXCTrACTTGATCCTCTATATAT^ 

TTCTTGTGATCCAAGAATTGTCATGGTGAGCAGCTCCCAATTGAGCCTCCACTGAAGCGATTTCAGCT^ 
CATTATTTACTCCTCCTTAAAAGCAGTGATTGCAAA 

TTTAGTATATGGCTGTGTTTATATCTAGATTTACATAGAATATCAATATCTTT^ 
CTGTTCGATTAAATGCTTCAAAGCTTAGCACGTCATTAACTACGTCACTACGAGACACA^ 

CAATTTTGCCGTTAT TTTAAAATCTCCCAAGGGAATGGAGAAATCGTACCTTGTAAATTCTCC AT CGTCACCACC AATTC TC AAATTGT ACGTAGC AAC A 
CTAC^CCGCCCTACTTGACCAGTTTCTTGTTGAATTTTTTCGTCTATATCTGTGCAAGATATTGGGGGCCGATA 
TTCGTTTAGCCAAGTAATCAGTTATTATTTGTCTAACATCTATTCCTTGT J IT^ 
TCTCACATCCAGTTCTGCATCCTTCATGTAAACTCTGTCTCCATGTCT 

AATTCAGATAGATTAGAAACACCACGTCrTTTCTACAATTTTATCAAGGATGAGGATCAACCT 

TACTTGC TT TTGCTAT ACAAC ATC AAC AGGTGTAAAAAACATAC AJVAAGAAATAAT AA C 
TTTTATTGTTCTGGGGCGAAGATTATATACATTCAATCTTTTAACGTAGACCTTTCT^ 

TCCTGGAACTGGAACTGGGGCTCGGTAACTGGAGAAGTTGTGTTGGTGGTAGTGACAGATATTACCTCCTCTAT^TT^ 

GTTTTGGTTTTAATTGTACTGTAGAGGGAGTTTTCTCTGTTACC^ 

TCAAGTGATGCTCCACCCCGTGCTTTTCCCCCTTATATACAATTTCTCACGT 

CAAAGAACATGTTTTACACAGTTTGGGCACAGATAATCAGGGTTAAGTCTCAAACCCCATTGGTTCATTACAAT 

ATTTTTGAGAAGCATTAGGAAGAGCTAAATTGGTAGAGTGTCTAAGAGCATGCAACGGTTTATTCACAACC^^ 

AGAAACTTGATTCTCATCCGCAATATCCTCTGTATTTGCCCTCTTGTGGGATGAAGAGGACAAAAACACA 

TACACCATACAGGCTAACATTACAGCAACATAATCAACACCAGTTT^ 

GGAGAGCCATTTCAAACAACATGTGTAAATG^TGGCAAGAATACATCAAATCCC^ 

AGAACGAACTCCTATTAATTCCCCTTCCTTATCATATCCTACAACATCCATATCAGGATTTG<rrA^ 

GGCCTATTCTTTGTGGCAGGAGAGTGCATCTTGATGTTAAAATCTCTCCCAAAGAATCTTGTGGTCTTAACAi^GG 

TGAGCATGCT ATTTAGTTGAGCCC ATGCGC C TTGTCTAGAACGAGWWTATTCAT AGCC ATTCT ATTCAAAGATTTAGGATT 

AGGTGATGGTAATTTATTGCACATGGACTCGTAATTTGCTGGTTTCTTACGGCGCAATTGTTGTTCCTCAGCA 

GTGATAGTAGCAGAAGAAGAAGAAGATTCTCCGTAATTTGTTGCAGCCTTAGCACAAAGACTCATTGAAGT^ 

CCTCTAAACGGGAAACTGATTTTACGTCTCTCAATTCTAGGACCTTGTTTCTGATCAAGTTTTTCAT 

AATTCTATCCTTCTTTTTGTTGTCACTGGAATGCCCACTAAACTCATTCCATGCTTTCT 

TTTCCAAATTTGATACATCTTTCCATCATTAAAGATGAATGGATATACTTCM 

TCTCAAAGACAAATGTTGTATCACAAGGAAAAACAATAAGTCTTTTAGACATAGCTTCGTCGACA^ 

AGGAGACACAGTAATGCTGCTTTTTGCGCCCTGTTGGAAAATGCCTCTAATCATGACGCTGTTTTGGGGTGAAGTCATATTCTTTAAT 
AArTGGCCCGGAAGCATATGTTTGGTTATAGAACTGGACACATCATTCTTTTGGGGGATGGCCT^ 
TGTTATGTTTTTGGAGTGCAGACACCAATTGCCTTCTTTTAGGCGTATCGACATTGAA^ 
AGCAAAG^AAATTGTGAACTTTGTTGCCTATTTTCTCCAAAAACA 

GTATTTTTTGATGAATTAGAGGAAGAGTCGTCCTTTTCGTCAGAAAGCAAATTTTTATCCATACACATGCGJ^ 
TCTTATCCTTCTCAATACGCTTCTTTAAGCAAGAAAGGACATTTTCGCAGTC^AAGTG^ 

AAAACT AT ATTTPTGACGGTTTCTT AAC CAC ATCGAAT AAAGATGC GT C ATAACGTTCGT ACATGTTC ATAAAGTCC AGATCTCT ATCCACTAAT AGTTT T 
GCAGCTTCTGGGTC(TTTTATTATGGACGATAACTTGCGAGAAATAGTCTTGATC 

TCCAGTGTAAACGCAATGGTACGCAATGAATAGACCAGGGGCGGATGTATTACC^ 

TCCC CCAGGTATTTTTT AATTTCGAGC ATGTC CTGGCCAGAC AAAGATG ACGTG ATC ATC GGAAAAC CTTTC AGTGGC AGGGTTGCAGACTAAAACAG 

GCGAACCCTCGCGATCAAATGATGTATTCCATTTCAAAAGGCCATTGTTT 

CTGCTTATTTTTCTTCTTCTTGCTCATAAATTTCTCATTCCATTCATCT^ 

TCATCAATAATGGATTCAAATGATTCGTTATCGCATATTTTGGATCCAAATT^ 

TGCCTTCATATTCGATAGGTTTCTTACAGTGCCTGTAC^CAGCATCGGTTTCCT 

GCCTCTATGGAGCATAl^ATACACAATATTTTTCTCTTTAAAGGTAGTTGAAACAACACCA 

GAAATATTGATAGGATCCTTTATGCGGACACCATTTTCGTCATTTTTACCGTCGTTTCTCTCAA 

CACCTCTTTTACGTTTTTTTGTATCATTACAAGCATTTTTCCTATTCTTTTCGTGTTTGAGTTCAGAAATAAAATCGTACT 

TTTTGATTCGGTCATTAAACGGTTAATATTACTAAAAACGATCTTAGAGTTTCCTCCGTC^ 

ATTTCTCCCCCACTGCCGCCAAAATCGACATTACCACTTCTGGAAGCAGACTCCTCCTT^ 

GGTGGCCAACTTC^TCAArAATCACGATACCCTTTTTGCCAGCAAAGGAAGCCAAAT^ 

CGTAACGTTAATATTTGTAAACCTTTCGAACCAAGTCCTCTTACACAAATC^ 

GCAGTTTTTGTGGCAATATTCAAGGCAACGGAACAAAGAGT^TGAACCATGTGGTGGGCT^ 

CAAAGTTCATTAGGCCACGTAAAAGATACACGACATCCATTCTGGGCATGCCA G TT TT T T 

GGCTGTAGTTGTAGAATTAACATCATCCTTTTGCTTCCTCTTCTTATTACTACTACTACTACTACTACTATGTTTTGAAGA^ 

GAGG AGG AG GAGGAAGAAGC C G TT T CAC AAT C GCT AG TGGCTT T AGC AA GTTCGT AT AAAATT G AT AAAAAT AT AT AATC TT CAT C AGC AAT AAAAATTG 
ATGGAGGTTTTAGAGJ^AACTGAATGCTGTTTTTGATACAGGTTCGTACTTTACAACGCT^ 
TTCATTTTCGTCATCATCATCCTCATTTTCATCGGTTTCTTCCTGATC^ 
TTCATATAAGATACGTGTTCGATTAAACAAAGTCCTCTACCCATTGTAGTGC^ 

AAGAATTCAAAATAACGTGTTTTTCTGCCTTTAATTTACACCTTGTATACATGTCCGTGACCAAATTTATCCCT^ 

ACCATCAATCA AAAA CTACCTGTTATTATCAACGCTAATACAGGCAGTGTTTTCAGCTAACCTGAAGGGAATTGCrr 

GCATTTCCCATTTTTTCAACTTCAGCCATAATTTTAGAAATGAGAGCAGTTCCTGCATTTGTAATACAAAGACGTAT 

TCTOG-GCAGGACACACTGGATTAGCTAAAAATGAGCATTTGTATGCT^ 

TCTATCGTGGAACAACTCGCCTTGTTCTGACTCATTGTCAAACATTGAAGGAGCT 

TCGATCGCCTTCTTGTTATTCTTACCCTTTGATGATGAAAC^ 

ATAAGAAATCGCAACACTTCTTAAAGATTTTTGTTCCTCTAGGAGCGACAAATTTACCCTGCACA 
CTTA TCTGTTCCT<K;TAAAAAGTT AAAATT ATCCTTCTTAATGGCCAGACTAGATTCGTCCAGA 

ATCCTCTTAGCATCCATTGAATCTTCTTGCAGAAGGCAAGTTTTATCTGTACCACCTTGATCGTCATCATCATCTTCA 

TGCC^GCATATAGACCAGAATGGTCCrTATTGTAAT^GATGAC^CGAAGCA^ 

GATACTGACAATTTCAATTTCTGATGAAACAAAGTTG^ACATTTC™ 

GATTTGT TGAAAAAATCTCC TCCTCGTCTT AT ATTTGTCCCTGGTGT AAAC AACACTG AACAACTATT AGTG ATCCAAG AC GT AC AG AAA.TTACCCT CGG 

GATCTAAACATGAAGACAAAACTTCACG^GCGTAAACAGCCTTAGCCAAACTAGCTGCAGAAATTT^ 

AGATTTAGCAGAGAAAGTGCGGTGCAGATTCATAAGAGGGAAAGTGTCCTTCAGTAGTTGGCCATAGTCT^ 

GAAAGTGGAGGGTTTGAArCGGTAACGCCACATGGAGCATTATTCTTTGTCATTTC 

TACACAAAACGCTAAT<nX3ACGCTCGTACAATTTAATCCACAAAATGTTTAGATTTTTAAACAGCA^ 

CATACCATTGTTACACTTCCACATCTTACCCAAACTTTTATAGGTGTTGATA 
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AATG AAGATTGGTT AGAAGAGC C C AAGAT AACGT AC GC G T CT TT AG AAT CGT T AACAAT C TT AAT GT T TTCTAAT T TT T C AGTGG AAGT AGT T AG AGAAT 

CATCATAAAGAGACAATAAATCTTCATCATAACCCATAGGAACATTCTTTAAAGAGTCTTTTTTCTCC 

ATTACTACTGCTATTAGAGTTATTGTTAAAGTTCTTCCAGGGTCTCTTTCO 

TAACAAATTGGCATAACTTGAGACCAGTACGAGTCAGAAATATGATTGGAGAAATAGCG^ 

GGATAAAAGTTCTCAAGGGACGGCCAACTATTCTACCCTCATGGCCTGTATTTGCC^ 

TTTCTTGTAGAAGTTAAACCAAAACGAGCAATCACTTCCTTCATGTTTCAATT^ 

GAGAGAAGGTGACCTGATGAGGATGATCCACCCCCACACCCACCATTTCCTCCCTCGCCAACAGAAGACATGTAATCGCC 
CATAAACGCTTCCAGTATCAGTCAAATCGCGGCACTCTTTTCTTATCTTGTTAGTTTCTTCCTCAACTGCCTTATTC 

ACAATGTCATTATTT AT C TGATTTTGTGTGATCAT GTTCTTAATTCGGT T A\AGTACATAGCCATCGT ACGAAAAC AAGCC ACTTTCTCTGTC TTCTTCGT 

TTCTGGCGTCCATGTGTATCTGGCTGAAACCCGTGCGGGATCTCGAGTCCTCATCAACTACGGATGATG^ 

ACCCGCGTTATTAAAACCTCCTCCGCCAGATTTCTTTGATCGTCCCCT^ 

TTAATCTCTTCTAGAGAAGGATCGATAAATGCTTCTTCGTTAGCGTCCATTACTTTAGCTTGACCGGCAGGAATGTTCT 

TGATTCTAGAGGACTTTTCTACAGCAGCGAGTTGGGCAGTTGTACAGGAATTACTAAC^ 

GCGAGTCTTGGCGCGGAACTCGTCCAAAGATTTAACACTGACATCATTAACGTC 

TCTGCCGCTTTTACTAAAACAAGCCTTGCAATACGTTTCATTACCTTCCCAACACGTTTTTCATACATATCATTGCT 
AAGAGGAAACACTAACCAAATTCTTAACTAGTGAAAGGGGCCTAGAAAGAAGCCTGCTTCGCCGTCCCATTAGGACGTG 
TTGCAACCAATTACTAGTATCAAATCCATTAAAATCAAACGACATGAGAATCTTTCGGGTCATTTCTT 
ATAACATATGCGGCAAAACGGTAAAAGTGGAGAGGATCTTCTTCAATATCGTAAAATCTACCTTCTTTTATGCTAG^ 

CCTTGCCTGTACAGCTG C AC C ACTT GC TAGTGATTTT AAAGC ATCGAATGAGTGGTACC ATTTT ATCCAC AGATAGACGC AAAGG AGC AGTT GCAATTTC 
TCGCACACTTTCTCCCTGCATCATGATACGATAC ATAGCGAC CATTG CT AGCACACAAG AGAGG GACAT T T CTGTCT TGAGAAC CTC TTC TGTTGCAGCC 
ATCCACTCTGCCCGACTGCGCAAAGTAGACTTGTTTCGTGGATAATATTCGTCTTCAACCCAAATTTGATGCCT 
TCrTCAAT^CT^AATTACGTTGTTATTTGTTAGCAAGTTGCT^ 

GGCCTTHSGCGACCTGCGCrTTCCAGTCCTTATTGTCGTCATTCAAAAGACATTCCCACGTATCAGCATCCAA 

GCCCTCGCGCTAACGCCCAGAAATTCTGATAGCTCAAATACGGCGCCAGATTCGTCCACATTTGACAC^^ 

CCCAATCCTTAACTAAACTTCCCCAATCAATAGGAAGGCTGTTATTAAAGCACTCCCTAACAGTCTCTAGAATTGTC 

GGTTCCAGAArcGTATCTTGTTATCAAATTTGAAAAGAACTCAGAATCATTACTTGTTTTTCCATTCAAAGCT 

TTAAAGCACTCOTATATGTTAAGTTTTGAAACGTTATGGTCCTTGAAATAATTTGCTGCATCTA^ 

CAGAAAGCGATTGATT AAACACTTTAGTTGGTTTT AG AGG ATC AAC AAC TGT AGAACCCAGCAC AACAGACTCTAAATCT TGATTCGC AC AC AATGCCC A 
AATACCAGAAAAGAAACCATTATTAACACTCTTATCTGATACACCATCACCTC^ 

ATGAGTTTGTTArrCTTCACTGATCACATTTTCATAGATACCCAAAGGTAATGCTGAAGATAGACATTTTCCTGC^ 
ACCTCTTATTATCTAAAATAGACCTAATATTACCATTAACTTTGTCCTTTTTCACAAATCCTAAAGCTGCAGTA^ 
TGCCATTGTGTTGAAACGGAGAGCTACAGTCAGATCTTTCAGATGTTTAGAACTAGGAATTGGGAGTAGTTTTATGCAC 

TT ACT ACTAAGGGGAGAATCGCTC TTG AAGTTAT T AAG AG AAGCAC AAG AACAAAAGGGAGGGATATTGTTGTTGCTGTT AGC GGT TCCGTCTTCACTAT 
TGACATTAGGCTTGACAATTTCTCGTACAGGCATACTAAAAGGCACTACCCAATTC^ 
AGATTCAGCTGCAGAATTGTCCAATGTTAGTACATCACACAAATTTTTCTCCArrTCCT^ 
CTGGATCTAAAGTTGAATGTTTTGATGAAATTAACATCCCTAAACACAT 

CATCACTA^AACCAAATTCGCCTAGAAAACCATAACAATCATCTTCATCATCACCACC^TAATAATCACCTGTTGCACCTC 
CTCCTCATCGTCACTCTCTTCATCACTTTCTTCATCGCTTTCGTTACTGGCCACAT^^ 

CTTAGAT ATTCTTGTTTTT GTTGAT AGAGT TTCTCC CTCTTCTCCGACT TCTTGTCTGCCAAAAC AGCT T CCTTATC AG ACTTGACGCTGGCAAAAAGTC 

GTTGGA\AAGTCTCCATGGGGGCGCATTCATACATTGGTCCAAAAGACTTGTCAGATTCATCATA^ 

GACCATTGTGCTTTCATCATACTTAATTTTGCACAATTTGCTGTTTGT^ 

CAGAC AGATTGCTTTTCAAGCACACTTTTGGGTACAT AT ATGCT AAGTT C TCGGTTAATTGCGT CGC ACATGTGTCT AAACTTCTCAAAACCTCCTTCAA 

ATTC AAAATCTTCACGAAT TATTTGCTGAT AGTACTCTTCTGGAT GAGAAGTGATGTTGT ACAGCGC ACAAAACCGC TCAGC AAG AGT AACGACTGTCC A 

CGCGAATATGGTGTCGTTCTTGTCAGCACCCGTGGCGAATGTTGACGTGATGATGGACAAGAGCTTCTT 

TCAGATGACGAAGTGGATGrTGCACCTTCTTCTAGACTAGAAACCATATCTGGTGTTGTTGATGATGTTGATGCAGCA 

GACTTTGAGTAGGTGGAGGAGGAATACTCTCCAATTCTTCAATTGTAGGAGGTTTTAGTGATTCTT^ 

CTTTGATAGTGCATCCCATGTGTTATTCTTGAGTCTTTGTTGTAGAATGTCCGTATCTCTGGGCAATTTCAGGTTTCTOT 

TGAAGGATGGCGAAAGGCATGA1TGTGTCAAACTTGATGGTGTTAACGCCAGTCAAATTCCTGATAATGTTCTTT 

AATTCTTCTTGCT^TGGCTTTCTGGGGAAGAAGAAACTCCATTGCCAGAAG 

GAAACACACAATCTGGTCA\AGCTTATCATCGTCATTGArGATGCTATTATAGACATTGGCAGCAAGGGGA 

TTGATGTTGTTCTCGTCTCCAGAACCGCCGCCAGATTTCCCTCCAGATCCTCCTTCCTCATTATCTTCTT^ 

ATCCTTTACACGCCATCTTCTCCTTTAATAGGGTAG^ATTTGTCTTGGTCAATAACAAG^ 

ACAGAAATCTTGATCCATAATCAAACTTT/TGTTTCTGTAAGGCATTTCTGGCCCAAGT^ 

ATAGAACCCATGAGGTTTTCGAGATATAGACACTTTTTCTCGAAACTT^ 

GAGAAGTCACCATGTTTTCGTTGCTGATGGAGAGAATGTTCTTCTTTTCTCCCACC^ 

ATGTCCGGG AAGATTCC AGTTATC ACT CCCGCCAAT ATC A ATCTT GAAAC AGGCAACAGT TTTCTTGATCAT ACTTTCAACACGTTCG AC GATTCTGGGC 
ATACATTCACTCAGTAATTCAAGTTGTACCTTTAAGATTGACTTTAACAACACCACATTTTCTGTGTCTGAAACA 
TGGTTGGGTAGAATATTACCTTGAGCCCCATCAAAGAGGCATTGGTGATGA\AGGCAACCTTAAACACTGGCACGTCTGCT 
AAACTCTCTGAAGGCAGGAGTCACACATAGTTTCTCTGCACTAGAAGTTTCTTCCTCATAAAGCCTCATCATAATAA 

GTTCCAACGTGGXSGTCCTTCTTCATCCATGCGCCTATTTTCATTCTGGCAAGAACGCTCATTCAGCTTGTCCAGAAATTGATCACCAGTA^ 

CCTTTTTTACTTTGTACATGCCTTTCTTCTTGGATTTTGCAACATCCTTTAAAGTTGCAGGTTGA 

CACTGTGTCATATTCAACAACATTTACATCCAGCATACAG 

ATTTGTTTGACCAGAGGX3TTGCCATTCTTGGTCAAGTTATCAGTGACTAGAATTTGTTCCCCTTCGTTGAGAAGGAGAAGAGCT 
ACTCGGGCAGTTTGTTGGAGAGACAGATGCGTAGAGTCGTTGGTTCCTCTTTCATCTGGTTGGCACAT^TTArTAA 
AAACTTGCCTCCTCCTAGGTTGAGCACATTTGCCTTAAACATATTGA 
TGAATGTAGGATAAAATAACGTAACTTTTCTTGTGTCGTTGTTCACCA^ 

GACCGTCTGTATCCTACGGTATCCAGTGTCGGACTCTGAGGCAGAGAGCTTGTGAGGGCGra^ 

TATAGAACAGAAATCATGGACCAGTACCCAGAAGTGAGGGATACTCCTCAGACAGAACAGGAACAGGCGGCGGCGCAACAACAACAAGCA^ 

CAGCTGCCGCTGCTGCTGCCGCTGCTCCTACGCAGTACAGTAACACTGTTTCTGCAGAAACTTTATCCGCCATTTCTGAAGATGGAA 

AATCGCAGCTTCGTGCTGGATCAATAACCTTAACCCTGATGAAAAAATGGCTGAACGTGTCCAAT^ 

GAAAATGTGAAC CCTGGT AGTTCTGTTGTGTTTTTGAAGC CT AGAGC CC T C CCC ACCGGGGGC ACGTGTT T AGCCCCCAACT AC ATTGCAGT GC CT ACTC 
TTCGTGCTGCGTCAGAAATTATCGATTCTATTGCATCAACTAGTCTATAC(^ 

TAGCAAACATAGTCAATTCGGTGATCGTGTCATCAAGAGATCAATGATCCGTAATTGTTTTTCCAAACAAAAAAATGTAGAA 
CGC^GGAGAAAGGTTAACGCCGCCAAGGCATTTTCTCATGCAGTTCAACAGAAATCAGCAGTAA^ 
CAAACCTCGAAAAATTAGTAGATTTTTGCAAATTGAAGTACAGTCCCGATAGAAAATA^ 
ACAATCAGGTACTTCTTCTTCCTCAGTTGAACATACTTCTAATGATTTCT^^ 

GCAACAAACACATTTGATACTGCTCTTTCTAGGGTTTTTACCGAGTTTAAGGAACAGGCTAGAGCAGCCGTAGATGCAGCCG^ 
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CCGCCTCGGACCCTATTTTCTCTATTGTTCGACACAATA 

ATATTCTGTCGCAGAGTACTTGATGGCAGACAGAGACGAGTCTGCAGATATCGCTGCTA^ 

GGAGATAATAATAAGAGAAGACCAGATACTTCJ^TCGATGATCTCAAAGAATCACTAGCGGATTCTAT^ 

TTAATTCGGTCACAAATATTCCCACAGATACAGAGGAGTATGAATTTTCTCTC^ 

TTTATTGAGCTGTGCTTTCGGTGTACAGTTCCCCTTCTCAGATGAAGGTTTTGCGGCTATCGAAAGGATTATACGTA^ 

GTTTCAGAAATGGACCCTTCCTCTCTTTCAGATCAATATCTCTTGTTGGTAGGAAATTTCCAGGTTTCTC 

TTTTCGGTAGACAAGTTACGCCTAATACGCCTArTT^GAGTATCATTACT 

TAGGCTTC7TTGTTAACGATACCGTGTTGAGAGACGCCACGCAAAACGTCTCAACTTCCACCCCTT 

AAAAAGCCAATGCTCTCTGGTTGCCTACCTAT^TCAGGGGACCCCAGGTAGTGACCAGGGAAAGTGA 

ATATCTCTTTGGGGGTCTATTATGCCATGGGTTCATCGGCrGCAGCTATTGCCGCTGGCC^ 

CATGATGAAAAAATTCTCAAAGAAGGGAGGAAAGTATACAGAGGAGGAAAAGCGAATCAAGAAGGCTATGCCT 

TTGGCTTTGTTGGGGCAAACTGATGCCCACTATGGGTATGTCG 

CAAAGGAGGATGCTCTTAGCCGTGCAGAAATCTTGGCAGTTAGGAAACAATTGGACGGAAAATGTTCCTCTTCA^ 

ATATCTTAGAGACTCATTTTTCAGGTCAGTTAATAGGTCAGGAGGAGGATATGAAATGT^ 

AGTGACAACTCTGCTGCCAGGAACGCTTGGCAACAGTACGCAGAAGTAATGAGAGGACTTTCTAAGCA 

TCAGTGCTCrrAAATTCTTTCAAGTTCCCCCTTGTTCCAGAACAGGGGCGTAAAAAGACTGTAG^ 

TAAGATCATTAACGGCATGACAGAGATGACTCTCCAGTCAGCGATCGATGGTA 

ACTAC^GCTCAGCCGTCTCGCGTCAAAGCTCrrTAAAACATTGTCCAArrTCAGCGGAAACGGAAATGTTGTG 

GTGCCGCGGGAAGTCGAGGTGGGGAAACTCTCAAGTGTGTGGACATTCCTTCAGTCATTATTGCAAACCTAATCTCTGATAAGAGAATTCTAGA 

TTGCGGAGGAGGGATGAACCTCGCTCACGAAATCACAAACTTTATCGAGACGATTGCAGGTAAAGAACATAC^ 

AGATTGTCTGTCATTCTTTTGCGTTATATTTGGTTCAACGCAGCGGTTGTTTCCCT^ 

GCACTGGCGATGACATTTATAGGGACTATTTGGCCATCAGAGGCATGGTTAATAATTACAACAGTAGTCTCTCCTCTATTTCA 

TAGGTACAATTGCGGTAGTGGAAATACTTCAACCAGrAATAAGAATGTGACCATTAAGACTCAAGGTGAATTGTTAACTGTC 

GCCTTGTCCGCCTTTACCAACAAGGGAGGCGTGGGTGCAACCCCCGATGCTGCCAATATGGCCAACGTTATTC 

AGAACACCAACGTGGTTGri^CAGGGCTAGATAGGATCACTGAGACCATCAACTTCTTTTCATTTTTGTCTCA 

AGAGTATCTTAGGAGATATAGGCTAGGAGAAGGACTAGATAAGAAAGAATTGGATAATrTTGTGT^ 

GGGGTAAGTGGTTCCGCATTGTCCAGTAATCTCGATACTGATCGTCCAATTACT 

GTTATGCCTCTAACCGCTACGCTAAATTATTCAACAAAACAACAAGAACAGCAGCAGAACAAGCTCAGATGGAGC^ 

TACTATTCCT<^TTAGTAAACAGG?TGACCATCCCTGGATCCATC^ 

AGTAACGCAGAAACACACTTGGGGGTTATGGCTAACGEGATTAATGAAATGCAACCTCT 

TTAACGTGGGTTCAGTTAGTAAGCTGATTCAGAATGGGTTAACCGTATCTCTC^^ 

GCAAGATTTCGCTAAGCTTTTACTGGCAGTCACTGCAGAGACTTCTCTGGTTGTCTC^ 

TCTTCAGGTGGTCTTTTCAAAATTGATAGGGAAATGTTCGATAATATGAAGAC^GATTATGTAGT^ 

CCGCCATCGAAAGGTGCAATGATTCCGAT^CAGCTGCTAGGATTGCCAAGTCAGGTGAAATTTAT^ 

AACTTCTTCCTCCGCTTTAACCTTGTTCGCCAATAATCTCCAGAACCCTGCAAAGGTATGGTCTATGGGAGCTCTCCCCCATTTC 

cc aaaacttc atggcatttctc ac g atcaaatgttcc gcctatc t ac at att atc agggt attc ataagatggaactt aacagcgattgcaaaccagaag 

aatgggataattctcttcctggaaataggkjctagcaaattctttggcctttct^^ 

tttggcttcacctgcagagatttgcgatctggtgacgag<;gaaatggtaaagaccagtaacgatattgt 

gcgct^caaaagagccttcaagttggtgcctcagcagtagjuvaaatacgacgagtc 

ctctggctaagagcaaatctcctctatcttcttcctcatctttgtcgtctga^^ 

TCTTCTCGGTACCGCTAAAACTACATCCTATTCKTrTTCTGAAGACGCTCTCAATGCTC 

GCTACTAAATCTCTOTACGAAGTTGCAGCTGT T TOTAGTGTCATGAGTAAAGAGGAGGATGTGCGTTCT 

AAGAATCGCCCGTTATGCAAGACATTGGCATTGACCGCATTGCTACTCTTGTTACTACAGTTGCTACC 

AAACGATTACAAAAATTATCTCATTAGAAAAGTTGCATCGAATCCCCTTCTCTCTTCAAGATTGGGAGGAATATCCCCTACT 

AACCTTAAAGCTGTATATGATGGTGTTGTTTCTTCCTC^TCATCAATGACCCCCrCGTCCATC 

AGTGCCTAGAGACTGGCCCTTCAATGTTTGCCGATGCTGGTCATGGAGGTAGTAACAT^ 

CACCTACGCA GCTCT GAGCT C TGGC GTT GAGCGGCTG AGAGACT C T ATT T CTTCTGC G AC T C AGG AAAGAAAGAAT AGG ATTGC AAAGAGCATC GAAGC T 

CTGGAAACGTTCCTAACCGATGTGGTGGGGGGAGATACTTTGGATCAATTGC 

CTATCTATAGTGATTTCGGAAACATTGACTGCGCTAAAATCATGAAGAATGTGACGAGCAAG 

CTCTCTCTTGCACGAACTCGCTGGCCTGGTACACAAACAACAACCT^ 

ACTAATGACCTCAATAATATCCACGAGAAGGAAACATTCAGTCAATTGATGGCCGTGGCC 

GTCAGCGTCTAGTTGCTTCCGACGTAAGAATGTTCCTCG^CGGAACCATGCT 

CC AGGTTTCTGATAATATT AAAATGAACG AATTGAACGATGAAAC AAAGTCT CTTT^GGTT AAACTGGTAGG ATTTTGCGGTAC AGTTT CAGATGCGCTA 

GGATCTAGGCACGTGTtZTTCAATTAGACGTGTACAGAACGAAGAGGATAAGAAATTAGACAGGAGT^ 

TGAGGAAGAAGACTGAACTATACAGGGAAACTGATACTATTAACAAACTTTTCGGACATCAAAA 

TTCCTTGGTACATGACGCTGTT^CCGGCCCTAGGCCAAGAAGGTACAGCACCCTTGAGGATGTACTTGAGGCTC 

GTTTCTTACCCAGAGAGGGCAGCTGCTTCTAGGCGAGTGAAGAGGGC^ 

TC TTG AACGATATGAGGTCTTC GGCGGTCT CTTCCGAAATGATGG AT AT AGAGT ATGGT G AGGG AGGATTC ATGATGATGATT AGTGATG ATGAGGATGA 

TATTGCCTTTATTGATTCCGAAGAAGAGTCTGAATCATCTACTGATTTCTCCTCATCAGATGAATATTCCGATTCAT 

GATAATAATGGCCAGTCTCCTTATTCAACTACATCTTATTCGTATGATGCTCTAGACCGTCTGAATT^ 

GCAGGGGAGAAGGTGAAGACGATGAGGAAAATGACCTOTATGAAGAAGAACAAGAAAGGAGGAGACGTCGCTCATCAAA 

TCTTCATGAGAGTGATGATGACGACGATGACTACTTTGATGACGAATTTGATGGCGA^ 

ATTCAATATGGTCC^GGTTTCCTATCTCATTCTAATATTCTTAACCGTCCGGCTAAAGCACGCGCTTTCTTGACACGAGGCAA 

CGT ACGATAGAT TX^TTT ATGGAGG ATGACGATTXXCTCCTCTTC TCTGAC G AATCTACC ACTTCTTCTTCCT CTTCCGATTCTCCATTCTCCTCC TTCAG 

CAAGGGGAGAAAATGCAAGCGCCGAA<^UVGCGAGGACCAATGTGCCTTTGTTAAGAGAGTTGTACG 

GGTCGAGrFAGCATGATCACCCCAGTGACTAGTCAAAATACAGTAGGA 

AAGAATACAAAATTGTTAAGGGTGCTTCAGCTACCTTGCCCGACGAATACGTAGAGGGTAGAGCATCTA 

TCTTATCAAGGCAGCTGCTTATGTTGCCCGCACCCAAGAAAGTAACTTGAATATTATCTTTGACGCT^ 

GACCCATCTACTCTTTTGGGTGATACACTTTTGTTCGCCAAACAACTAGAGGCAATTACCG^ 

CTCCTTCACTTTTCACATCATTCGGTGATGCAAGTAAAGACACCCAAATGATGGCCGATGCCAAACAGA 

CTATCTAGGTGTCCCTCTCAGAACTCT^GCTTCATCTATTAAGGGCACTAATACAG 

ATGACCACAGCCGCTATTGTTTTTGCACGTTCATTCAACGATACTACTTTCCATGCACT 

ACAGCGCTTTCACCAACCTCGTCGGATCGGAACATTCTCAGCGTCTAAAAGTAAAGAGTA 

TGAAGCAGTCATGGGTCTCGTATACCCTACAGCGTOCATCA^ 

ATTCTTAGGGGAGTTAATTGTAGCCAATTGCCACGAAAGGATATTGGAGACACTGCTGGCCTGTTGACCTTTATTACATCAC 
GAGGAGAAAGGGGAGGTTTGTCTCTATACAGAATGTCCATTGTT^ 

AGGAAAGTGGCAGGATATGGGAGAGGAAATCTTCTACAAGAGGAGCAACGATCTGGTCGATTTTTGTTXIAA^ 
GGTCCTATTGCTAGGTTTGTTCCCAATGGAACTAACATCGCTGATATT^ 
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GGCTTAGGCGCGCAGAAGAGGGCGCTGGCGCAGCAGGAAAATTCATTACAGCCTCAGCCATGGGTA^ 

AACTGAAAAACTATACGACTCGTTCGTTCTGX:TCCAAGATTCAGACTCGTTCAATACACCAA^ 

AGGAAACATAAGGCTCTCAAAACACCATTCGGG^GAGATATTGCCACCTATAAGAACTTCCCATCCTC^ 

TGCGTAACTCTATTAGCACTATCGTGATGGACATTTCCAAGTCAAGGGGAATCAACTCGTTCTCATCCCGCAGTGGTTCTACTTTGGCCAAGATTTC 
ATCTGAATTTGAAAGGATACTAGAAACATCTGCTGTTCTTTCAAATACAAA^ 

CT AAAACAATTC AGCCATATTAGT AATGATGGACT TTCCGAG AC ACGCGCAGTCGTTGCC GT AATTGCTGAATCT TT AACCCCCGTGTATGCGGATGAC A 

CCAGCGAGAGAGGAGCATCTGTTAGTGAACTATTGACAGACAATACTCTCCTCAAATTTATTGTTCAAAATGAACTC 

TCACGTGACCGCCGCAATTGAAGGTTCATCCCAACTGCACGAAAAAATGTTGAGCCTGCTCGTTGCCTCAGCCGAC^ 

AACCTCGAATGT AAG AAATTGACTGAAGGAAATAGT AACTTTGT ACCAATG ACT AAC GAC CAAGGTGGTAC ATTC AT AAAGC AC AAAGAAAC AGGTATC T 

GGCTGAAGACCGATGAAGAAAATAACACCAGTTCTATCAAGGACAATGAT^ 

TGCAACC^TCCGTTCTCGTCTACAGTCTCTTTGCTTTGGAAAATATGCCATGA^ 

CTCATTGAAAAACTAGGCGAAGCACTCGCAGAGAAGGCATCTCCTTCTAGCTCGGCC^^ 

CTCCCAGTTCrTCCCCATCATCATCATCTTXCTCTTTCT^ 

CAAACAATCTAATGTCAATTCTTCTGACGCATCATCCTCATCACCATCATCATCTTCTTCTTCATCTGCCAATATTGAT 

GCTCT CCAACAACTTC AAAC AC AAGAATCTAACGATTTGAGTAACGT ACTTT CTGT T ACC AC CAAGCACAGATTTGCG7CTC AT AATCAAGCTGCAACTG 
TTGGCATCTTCAACGGAAGGCAACACGCAGAGACAGTTGTTGCTATACCAAATGCAAACAA 

T C TT ACCCGCTTCTC AGCC CCTGAAAATGTTTCCTCCACCAGCATGC^ ATC AAATGG AGATGATAAT AAGGTACCA 

GT AACTGTC AGGCTT AACC AGT ACGCCAACTCAATCTTATC ATC TATTGAAAACGC ATC AGAAT TT AAGGACTTGAAGGAAGCAGAAAGGAAAATCGAT C 
TGGCCATCCAGGCAGCTTCCACCACAGAAAOUVAGGAAATGGTCACCGTG 

T AAATC CCH^ AAGAAAAGTG CCCTCGAATT ATTGG AAAGAGTTATCAAGGCAGTCGAGGTTTAC ACCCCAGATT C ATCT ATTGCAGCCGTTTCTCTTCC C 
GTTAATGGAGATTCTATGGTTTCTTCTTCCTCGGGATCAGGATCTGCTCCCTCT^ 

ACTATTTC AACT ATGCT TACGGAAAATTG AAGAAC ATTGATG AAAAT AC^GAAG AAGGGGCAGAAAC TGTC C AG AAAAACATGGTCGAACAAGATGCTGC 

CGTTCGCATCCCTCTTCTAGTATCATATGCTCCATTCAG^GAAATGATGAGACGTGCTATTGACAAG 

AAAACAAAGATCGTGTCAGACACTAAACAGGCTTCCTCATGGGCGATCAAGGAA^ 

TTAATAACTTGCAACAAAACTTTTCAAACGAATCAGACAAGATAAAGATGGCTATTACT 

CAAAACTAGGAGCTTTATTGAAACTACAAAGAGCCGTATCGAGGCTGGTGGAGGAGATGTAGCAAACTT^^ 

GAAAATGACAACAATCTCTTCCAGAGCtTTGAAAGCATTCGCTGCTGATAACTCGGG^ACAGT^ 

AATCAGACAGTAAATTTGTCGACATGTAC^CAAACAGATTCTCGAGGGAGGAA 

AAAGGCTTTGGAGGCTTTCCCTAGAC^TCCAACGGTGCTTCAGAGCCTCTATCTTCTTC 

GAGATGTTTATGGCAATAATGATGGAGCGCACCGAGAGTCTGAGGAAGAGiMTGGCAGATTCGGCTG 

CTGTTAATACTGGTATGGTTAACATCAAGAGTGAAAGGCTCA(^GAGATTAGGAATCAAGCACAAATCGCT 

CGAGATTGTAGAGTCTCCTCTCACCCTCTCTTTCGGAGCACGAGTCGACCAGCTCTTGATCAAGGTA 

CAAC AGC AGC AAC AG C AGC AGC TTC C CAAATT GAC AGCT ACAGAACAGAG AAAGG AAC AACAAT ACGCTGC AGAT AGGGT T GTTT ACGATCC TTC AT AC A 
CCTGCTTCCTGCAACCTCTTCACGAGACAATTAAACGTATTTCTTCK 

TAGCGATGCCGACTTGCAACTGATGACCATCACTGACTTGTCTAGGTCTGTACTCGACTCTTCTTC^ 

CCCTCATCAATTGTTCCTGGACTCTGCCAGCAATGCGCAATGATGATCACCAACGTCCACGAAGCCACrCATACTTCTCCTCA 

ACAAAAGATCCCTGAAGCAGCTGACAGAAATGTTGAACGCTGCCACTTCATCCAGTGACGGTCCTGCCGTGAGACAC 

CAACAATGGTTACGTCAAAGATTTTGGATTCACTCACCGCCAAAAGGTT^ 

GTTGCACCT AAT ACTGTT ATCCTTCCTACT TCTGAGTTGT TT AACTGCCCAGGAGTTG AAAATG AC AAATTT AGATCC AT GGTT AACAGGACAACCGAC A 

AGAATGTGGCTGACGCACCCAAGTCATCTGCAAGCATCGTGGAGACTCTTGCTCGCACGTCTCCCAACGCCGA 

G^GGCGACACTTCAACTCCATCACCGACGCCATCATTTCTGGTATGAGCGGCGAATCTTCA^ 

ATTGATCAAACTACTGGCTTCCC^GTGTTTACAGGAAGAAAGCAGGGCGAAAGAAGGATTGTGCAC^ 

AGAACAGTGGCATCCCTTCATGTACAAAGGACCGTCAAACTTATATCGATATGGGCAGAAAATTCATG^^ 

GGAAGAAACTCTCCGTCTAAACAGGCTTTCAGACATTAACAACGTGAGACATTATGGCACTGATGTTCATGTGGCAGGCGCAA^ 

G^TGAGGTGGTGAGAGCCGCCTCCTCATTCCCTGACGGAGATAAGGAATCGGCTATGAAAAAGATGCTTCriTCT 

AAAAATCTGCCAGTCACATTAACGATCCTACTGCTTTGTTGAGCACCAACACGTCTATCCAGAATC^ 

CTCTAATTACTTGGGGTCTGCTGAATCTACGTTCGCCACTCAACTCGCCTACCGC^ 

TCAAATATCTGCCCTATGGATTTGATGGX5AAGTACAAAGCGCTACAATGACGCTTT 

GGTCAAATTGTGAAAATCTACTGAAATCTGCCATGTCTAACGTTCCTGCTATCAACACTGCCTT 

TAGGCrTTTCTCCCCTTTATGAAGACAGCACCAAATATTCGTCCAACC^ 

TCTACTGTTtrrCGGTCGCCAGAATGGCAGAAArACTCTTCTTTCTCTCCCTACTTCTATTACTTCTATCGCA 

CTTCGGACATG AAAT CT AACCT CATC AAGACAATTTCCCGCATC AAT AGAGACGCT AGC CTC C TGTC AATGGGAGAC AGCCAAGTAGCTGCAGGrTC TTC 

CTTCTTTAACTCTTTCCTTCGTTCTTCTTCCATCCCTGTCACCACCAGC^ 

AAGACTGT<^AGATCAACAAGAGATTCX!^GATGCTTGGAGGA 

CTATTCTCCAGATGAACGAAAATGAACTCGCTCGTGACTTGTGCGAAATTGAAAATAAAATTGAGACTA 
GAGGTCTATGCTGATGACTCCAGGAGGCGTGGGAGCCATTTCTTCTGGAGCAAGTACCAACAATGTTCCCCTTT 
TCCAGCGGTCTTCTGATGAACy^CAACAGTGCCAATGTAATGGAAGCTCTGGATAGTTTCAATACTACTC 
GTGGAAAGTCCCCCGTTCCCATGGCCAAGGAAATTAGGAGCATGCTGACCCAACCAAGAGCTCTCACCG^^ 

TCTCACTGAAAT CT GCCTCT AC AAC ACCCGCGACACTOLACC AGAAAGGGC AGTCGACAGAC TACTAACTTCAGC CT ATCTAGT AAAACAAGCT AAAAGA 
TTCGACGGAG7TTGACCCAGCCTTCCCTGCCGCCCTCACCTGCGCTTCTCACCTCAT6CTTTCTTCCATGGATT 
ACATCAAATTG^ACATGACTGATACTCAATGCTTCTTCAAGAACATTGAACGATTTGAGAAATTCTTGGG^ 
CAAGCAAAATTGTAACTGCCCCTTCCATCTCCACCACACTTTTACTCCt:TCAGATAACGAGCATCTGGTAT^ 

TCCATGGAAGAAATTAGAGCCACACCCTATC^GGCCAACAAGCTTATTAGTGACAAACATTACGTGATGAACArGTCCAAGATCG^ 
GATCTTCCCTCCTTAAGAAGGTTAGCGAATGGACTGAAATGAGAATGAACTCCAACTTTAATGGAACAT^ 
TGGCATGACAACGGCAGGAGTCAACCTCGACGTTATTGTCAAACCAAATAATGCAAGAAGT^ 
ACCGCCGACGCCAAGGGAACTGTCGCTTCAGCCATGCCAGCCGTCTTCCAGGCAACCG^ 

CAAGGAACAGAT AC ATC C AAAAGAGC ACAATGAACGCTCAAACTGTCGTGTTTGC T AATGTTTTGGAACAACT TATC GCC GATCTTGGAAAGGTT ATCGT 
GAACGAACTGXSCCGGCACCATCGCTGAATCTGTACCAGAAAGCGTATATGAAAACACCAAGGAAATGATTGATAGACTAGGCTC 
TCTAATAATAATGGAGGAGTAGAATC^TGGATTATGAAGATAGCGAAACAACATCCAACAATGGTCCCCT 
TCTATCACACACTAATTTCCGGCAAGGCAGCTCGCCCGGAAAATGTACC^ 

GGGAGATACATTCGAAGAAAAGAACGCCGAACAAGGTGCAGCAGCT^CGTATCCTCTACCTATTCTTCCTCT^ 

GCTCGAGTTTTCGAAGCCATCTCTAAGCAAGTAACTGATGCTGAATTCMGGATATCC 

GTCCACCAAATACTAACCAAAATGCCTTTGCTCTAGCTATCAAGAGAGAATTCAGCAGAAT^ 

CGCATTAGTCGACCCTAAAGGCGCGITACACGAGAAAGTAGCCATCTATTTGACCCTTCTTTCA 

CTCAGTAATTCGTCCTCAGTTGATCTTAGCCATCTAAAACCCATTAATTGTAGCAACA^ 

ACCCTATTCTTATTATGGCCCTCCCAGAAAATTTCACAGCTCTCTTG^CAACAGGAACAAATGGA 

CACCTTCCTTAATCATCCCAACACTGCTTCAATGGCGAACGGTGCAAGAGCCGCTGTGGGTG 

CAC^TTCTTCACGAGTCTACCGTCACAACATCAAACCCCGTCACAGACACCACAGAAAACGTCAACTATCATTCCTCTG^ 
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TAGTGAACCCCTTCAAGGATTCTGCTAGCTTCATCGTTAACAACA 

CATGCCATCTGAATCATCTGGCCTCGTCACCMTACTGGATGCTCTTCTTCTTCTTCCTCATCTTCGTCTGATACCrrCAAGTA 

ACGCCTGTGAATCTTCCCCGTGTCACACCAGCCGTTCTCTGTTCTGATG^ 

ACATGAACGTGAGATTTGGTTTCATGCCCGAGATTATTGCTGCCGTCTCCAAAT^ 

GAAC AACAT C AACAACAAC AGC AAC AACAACAAC GGAAAT GGG AAGAAAACAACCGTCGATCCAGTCACTGGGG AT ATTGTT AT C ACC AAT GCCA CATTC 
CCCGACACTCGTCCTCTATACACTGCAGCAAATGGAGGAACATCATC^TTCAAATGGGGAGATATCAA 

CCTTCTTTATTGGTAACCCAACCGCCGCCGCAACAGCTAACGGAGTGCCTCTTACATCTGAGGGAATTTCCCTCACTGAAGAAA 

AGGCATCTCTGAAGGATCAATTGGCACGGGGGCTCTGCGTGCAGCCGCCAACACCCGCCTCTCATCCGACA 

ATTGTTCAGCTTCAACAAACATTCAAGAAAGCTTCAGATAAACTCA 

TT ATT AAC AAGATGC AC GAC AGCTTCAAGACATTGGAGG AATGTCGT AGGGTGATCCAAGAC GAGGCTGCTCTGCTCGTTGC C ACC AGCGATCTTTTGAC 

CGGTGGCTACGGAGGAGATGCTGCTCTGGCCATGGTTTCTCCAGTACGTCCAGAAATGACTGGTCTTATTGGTGC 

AGCCACTTGTTGAAACTGGGAGGTGTTTCTGCTGCTAACGCAGCTATCCGCAAGCGCCTCAACCTA 

GAATCGTACACAAATCAGCCAAGACACTTTTGCTTGATTCAGACTCTATTAG^AACCTATACAACACTGATCTTCA 

TAACAACAATTTGGGAAGAATTATGCAATCTTTGGGACTTAAGGGGAATAATGCAGGGGATTTGGTTT^^ 

GTACCAGAATATGGAAACAATCGCGATCrTTACCAAGCGTCAAGCTATCCTTAAA^^ 

TTTACCTTACAACAGGTAAAAATGCTCTCGCACCGGTATCTGCTCAGGAAATGGCTTGTGCGTCTCTGACAGTCGGAGGAAGT 

ATCAGACGACAATGTTCAATCCCTGAACGGCCTTTATTTTCGGGTCTAGACTAGTGTAATATTGTATGTTTAA 

TGTATGTGATTTTTTCGGACAAATAAAAGAATTGGAATAAAAGACTTTGTATTATTTACCAAATTTATTGA 

GGTTGATTGTAGCATTTCCTCCCCCTGGAGATTTCATAGTGGGACATCCAAA^GTC^ 

AAGTGCAGGAGTAACAGAAAGCATGAAAGCGACCATGCTCAATGCGATGACGACAACGGCCAAGCC^ 

TC TGGT ATCT GTTTATAGGGTTTCACCCAGAAGTGTT TT AAACCC AAA 

AGAAAAGAGTCAGTCTTGCTCCAGCC^CCACCGAGGCAGGAGCTCCTCGGACGCTCCTGTATTTTTGTCCAGAAATGGATAAAGTTTGT 

TACTAGGGAGCGCACGTTTAAGGTACCTGCCGATCTACTGTGCGTTGCAACAGAACCGGAAATCTCTACCAAG^ 

ACAAGAGTGGTGGTGTTTTCAAGGTGCGTCTCGGTCCAGGAACTACATACAATAAATCCAAATGATGAAGGAT^ 

TGAAATTGCAATCTGCACAAGGAAAAAAACCCATTG^TTTGTACATCCAAATAAAO 

ATACCTGGATCCGGCAAC^CACCTTTTCTATCTTGATTCCTCCCTTTACCCT^ 

GAAGACACGTACAATGGTGTTGTTTTCTCTAACAGTGAAGAAAAAGAGAAGGATGCACTAGTGC^ 

TTGAAGC AGC CAT AAAAAAAAT AAT GCT AAGG AAAGTGT TTTTC AAGGATGGAGATCTTGATTTCGGGTACTT ACGT AT ACC AAAAT CT AAACTGGACAA 

ATTTACTCCCTATTTTCGGAGTCAATACGGGATAGTAAATGTTGAAAAAAATATCCCTGGT^^ 

TCC^GATGGTACCTTTACAACACCGACTCGGAATGGGAATATAAAAATGTGGCCGAAGAAAGAGTTGGACCTC^ 

AGTGTGAAAATTTATGTTTTAGGGACATAGACCTCAGAAAAAAGGAAGCAAAGGAAAAAAGGGATATAGA^ 

AAC^CTAACCCATAAGCATGAAATGCCTGAAAATATGCCCTATTTTGGACCAAAGTG 

GTGGATGAAATTTCTTATAATGATGAAGATGTAGACGAAATTTTGTCTGAGAATAGATCACTAAGAAATGTTTCTAT^ 

TACACACGTTATTAAAAAAAGGTGTGTCTATTCATGCTAGATTTACCCTTAATGGTTTGGATG^^ 

TTTTGAAGATGAGGAACTACAAGCCGCTTGTGCG(^TGTTAACCTTGAACAGTACGAATGGCTTTGTTCT 

AAGTCGCGGGTAGTGACTCGAGCAGTTAAGCGTAGGAGAAAATGTAGACACTGGA1TTATTTTGAT 

ATAAAAAAGTTACTGCTAGTATGGCATCTAAAATATGTAATGCAAAACACGACTC 

CGC ATATTTC AAGGT AG AACCTTC CCC AAT AAATTTTGCCAAGT T AAAATCTTGCCCGGATGTT AAAT ATGTGC AGAAAAAAAC AGATGGT ACATTTTCT 

GTTATAAGATTCTTTAGAAACATGACAAAGGGTGATCTTATTCAAAGGATGGATCTTTT^ 

GTAGGGCGGATTTTTATGCATGTAAAAGAGGAGAATCTATGCATATGTGCACAAACAAACACCGTATTCTTCAC 

GGCCATCGAACAAATAACCAATATCATCAGTGATACAAGAGGACCTAAGGCTATACACATAGAATACGCGATC^ 

GATGGAAGAAGATATGAAGCTAAATACACTGGAACTTTAACCGAGTACAAAAGAAATGAGGACyiAAACCT 

CTGTCAAC AAACCAT AT AAT AT T AACC ATTTGT ATGAGC AAT AT GGAAATTTTGATGAAGAATT AGAAGACAAGTTG AGG AGTGGTTTC ATTTXTTTA^ 
CACGT ATGT T ACTGCAAAAGAC AAC TGGGGCAGGTGTGC AACTGGAAAGGGGGCGTGCATCT AGGACCCATAAC TCAAAGTC AAT ACTGGT ATGTGTTTT 
TTGAGAAGGTGTCAGAATGTGCACATTAAAAACATACAAAATGACTACTTCAACAGA 
GGAGATTGGTGCAGTAATATCAAGACGGTATTTTCACCCTTCACAGAAGGCAAGGGAAATTT 

CATG TGGTT C AAGAGAGGC GGC AAACGCCAC AG AGCATTTT ATCAC C GT C TTT GC AAAGGACAAAT ATGAGCGG AAAAG AGT AAAAC GT ACAATC GG ATT 
CACCCTCGACAACACAAAGGAGTTGACGCCCAACAGATACTTGGTAGCAGATGTATACTCTTGGCA^ 

CC ACCAGGAAAGTCGGGAAC AT TTGT ACGCT ACTCTAAT GAAGAT AAAAGTTT^CTACT AGCAGAT ACCGGAAG AT ATATGAAAAAGAAGT ACGATG ATC 

CAGAAAATAAGACCAGTAGTGGGGGTGATGATGACGATGACGACGATGATGATGATGACGACAACAACAATGTTGACGTGTATGAAGAAAA 

AAAT GTATT C GAGGTCG AAAAGG ATG AAAAAT ATGCCTGT ACTTTTTC AATTT^ CTCC TGT AT GT AGAGGGTT A 

TTAGTAGAGACAGATGGACCCTCATCTCACCCTAAACGGGCCCCGTCAGCATTTA^^ 

CAGATGCACTAGAAGAAGAGGATGAAGTTGATGGAGTTCCTGAAAGAGAGAGGATTACAAATTTTGCTCTCAAGAGAGGACCTGCAACTGGCCAGA^ 
TGTATCTGTTAAACTGGAACATGATGGATCTAAAGCAGACCTGTACAACGTCACGT^ 

AAATGCJ^CTCATTCTTTCTCATCRTCTAACCArGGCTGGTCGTGTAGAGCTCGTCACTGGACCCATGTTTGCG^ 
ATACCAACAAGAAAATGGAGGCAATAAACATTGCCTGTTTGTC^^CACTCCCTAGAAACTAGGTACGGTTGTGG 
GGAGAAGTGATTGAAGGTTGTACTACAGTTTCTTCTATCAAGGAACTAATC 

TCTTCACGGATTTGGTGCTAGTCAATAGACTGGCTGACAAGGGGAAAAGGATTGTGATTGCAGCACT^ 

TATTCATAAGCTATTGCCTTATACAAATTCCATTGTTAAGCTAGCATCTAAATC 

TTTGGTAATGACAATGATAATAATGTTATATGTG^AGGAGGAGCTGAAATGTACGCTGC 

ACAAGGGGAAACTTGTTGTACTTGAAGGAGGTGACAGGTGCGGTAAGAGTACCCAAGCCAAACTCTTGTTGACCAAT 

AGAATATATGTGCTTTCCCGACAGGAGCAGCCATACGGGTAAACTCATCAATGATTATTTAACTAAGAAAATTGAACTAG^ 

TTATTTTCTGCAAATAGATGGGAAGTTTCTAGTAAAATTAAGCAGTTGTTAGACGATGGAATCCAT 

TTTTCTCTTTAGCTAGAGGAGTGGATACCGTTGAGTGGTGCTCTGCTAGCGATGAGGGACTTCCTCAGCC^ 

TGAAAAGTGTTCAAATAGGGATACTTTTGGTGTCGAAAGATTTGAGACAAATTCCATTCA 

GACGAAAAGAATGTATGGATTAAGGTAGACGCTCGCGGCACCATTGAGGAGGTGCAAACTAAAATTATAAATATTGT^ 
GAATTGTAAACTGTTTCCATGATGTGTTTCTTTCCTAGTGATTTTCACATGTTACCTAGAAAGACTTTGCC 
ACGAGTCTCTTCTGGAGAAGGTGTACTATGATAACAACAATGAACTGATTGTAAGAGTTGGTGGGATTTATATGCAGATATGCA 
CCATCACGATGATCCAGAGAGGTTCTTTTATAGTGTGTTGGAGGATT^ 

TTAGGACCGTGGGAGTTTTTATCGCGCAAACAAGTGAACCTCCAACACGG^TGCTACAAAGCTCTTT 

TATTACCCCAGCAAATGAAAACCAACCTGGAAAAAATGGAAGAAATACAGCGTACTAGACTCATTCACTCTA^ 

CTCTGACCAGCTAGATGGATGTCTTATATGTTAACACTT^TTCACTACCATGTAA 

GTAAAAACTTTATTGTTTCTTTCCTCTTTCTTTTGTCTGTCACTTTATTCCTCAGACATAACAACCATA 
GACATTTTGGGTGTGTTAAAAGTATCACAGGAAAAATCGTCTCAAGAAAGGCAATGGATAATTATAGG^ 
AGGAATTTTCTGTGGATACCAGCAATATTCTACACCTCAGTCTGCACCC^^ 

ATTGTAGGATTCTGGATATCGATGATTATCCAGGCATTGAAAAGGAGTAATAAAGATATAATATAAAA 

TTCACCTTGTACCGAAACCCATTCTCTAOU^GCACTCTGTCTTTGGACACGGGGATCAATATCTTGGAT^ 

CTTCATTTTCAAATACGAGGAGC^ATGTTTTCTCAGTATTCCTGTTGTGTATTTGGAGGTGAACAGGGAAGGAG 
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ATGGAACTCGTATACCATGAATTGTACTCCTCCAATATCTT^ 
TACMTCTCTTGTGGTGGTGGTTACACATCTTTCCACTGGTC 

AATGC GATGTGGAAAATTT T AT ATGTTTCCGACAAGGGG AAAAAT ATGAAT AAACTG AGC AT ACAC ATGTCGT ATCC AG ATATCTAGGTCGGGAAGAAT C 

CAGAAACGATTCAAACCATTTCTGGAACATTCATTTCTGGAAAGGGTTAC^^ 

TGTAATCTGACATTGGGTCGACCCAGAGCTCCACCCTCCGAACTTGAC^ 

TTTATAAAATTTTTGATGAGGAAAAATAGAAGTTAAGATGCTTGTCAATGCATGCTC^ 

ATATCTAGGTCGGGAAGAATCCAGAAACGATTCTAGCCATTTCTGGAACAATCATTTCTGGAA^ 

GTACCATTTCTGGCACCTCTGTGTAATCTGACATTGGGTCGACC 

CTCGAGTGAGCAGAAAAATTTTTAAAAJ^TTTTTTGAGATGGAGATGGAGTGAAAC 

GCGGCTCACTCTCTAGAAACGTCTGCTCC^GAAACACATAAAAAAGATAAAGTACATTTCTC 

ACCCAGCGGTCCACCCTCGGAACTTGACATCGGGCCTACCCAGCGGTCCACCCTCTAAACTCGAGTGAGCTGAAAAAATTTTTATAAAAT^ 

GAAAAACACTACCAGTGGAGCGACCGTACACACTCCTCTAGCTAACTACTAACCGAGAGACTGAGGCAGAGTCTGTCCAG 

CCAGAAACGATTCAAACCATTTCTG^GAACAATCATTTCTGGAAAGGGTT^ 

GTGTAATCTGACATTGGCCTGACCCAGCGCTCCACCCTCCGAACTTGACATCAGGCCTAGCCJU^ 
TTTTTAAAAATTTTTGATGAGGGAATATAGTACTCCGTAGCCAACATATACACATGAACACATGAGGCGG^ 
CCAGATATCrcGGTCGGCCAGAACCCAGAAACGTTTCAACTCATTTCTGGACAAGCCATTTC 
TCTGGTJVTAATTTCTGGCACCTTCGTGCAATCTGACATTGGGTCGACCCAG^ 

TAAACTCGAGTGACGCAGAAAAATTTTTT AAAAAT TT TTGATGAGGG AATATAGT ACTCCGTAGC CAACATATACACATGAACAC ATGAGGC GGTCT AC A 

ACAGAAAGAGAACTGATAGCTGTTCCAGATATCTGGGTCGGGCAGAACCC^UaAAACGTTTCA 

ATTTCTTIATAACTGGTATATCATTTCTGGTATAATTTCTGGCA 

GGGCCTACCTAGCGGTCCACCCTCTAAACTCGAGTGAGCT 

TACACACACTCGCCC ATGCTCGTCT AGCTGAT AGCTGTTC CAG AT ATCTGGGTCGGC CAGAACCC AGAAACGTT TCAACTC ATTTCTGGACAAGT C ATTT 

CTGGAAAGGGX5GT AC AATTTCTT ATAACTGGTAT ATT AT TTCTGGTATAATTTCAGGCAC CTTCGTGC^ GAC CCAGCGGTCC ACC 

CTCCGAACCTGACATCGGGCCTACCCAGCGGTCTACCCTCTAAACTCGACTGACGCAGA 

TTCTTGACGATAAGAGGAGGCAGTAGGTGAGGCTGCTTGTTTGATGTGTCAGCCACATCTGCGTCATA 

TGGACCATAAAAGGCTTTGTACGTCCAGAGACAAGTTTTAGTCTGATAGATTTCT^ 

GTATAAAAGATAGGTGCAAAGGTAGAGAATCATCATATGGACAAAATCTG7CCATGAGAC 

CAGGCCCAGX^C^GTTCCAACCCAAGAACTTCTTTCTACAAACTTTCTTGAAGCTCA 

TGTAGTTTATTGTGACTCTGAGACGTAC^CCAAACCTATACCGATTTTTCGGAACAAGAGTATAGTT^ 

AATGAAGATGTGAGTTACCAAATGGTTTCTTCCGTCTTAGAAAAATTTCCCTTGCTATTCCACTGCACTT 

CTCTGTGGAAGAAGTTGTACAACAAAAGAAAATTCAAACTCCTCAACTCATTGT^ 

TGACAGGGAGAATATATGTGATGCTTCAGGAAGGAGTGTTCTTATGAGTtSAAATAATGTCCACGTCAACTTTTCA 

TACTTGTTTGATATGTTAAATATGGAACGTGGCAAACAAGGAGGGAGTTTTCTTCACTTCTTTGCATCTAGG 

AAGAAATGGACTCTCATGTGCTCAGTAACATAGCGAAATTCATATGCAATGAAAAGGAAAAACTAGACT 

CCCTGATAAAACTAATGATGAAGGGTACATCCCGGTGGAAATAGCAATTATGGAAGACAATTACCCTGCATTGCT 

TCTTGGGCAAACACATACGGGGATCATAATGAATCTCTCAAAGCGTTTGCAATAAGAAATGATGCAAAAG^ 

ATCACTACAGTTTCAACAAAAATGTGACGAAGGAAGAATTTCTTAAA 

TT-GTTACAAACTCCCATGTGGACATTTCATGCATACATTTTGCTTGTCTAATAAGTG 

GATGACACAATTTTTAGAAAATGTCCCCCAACTATACAATGGAAAATGGOTATAJU\CCAAAC 

TT GAC AC AT ATTT AGAT TT T AT T T GCT C AT AT AAC GT CAAAT T A GAC AAAAAAT C AAAACCT AAAC AC AAACCTGAAAACAAAAA GG TG GAAGAAG AAC T 
AGCAAAAAGGACAGCAGAAATTGAAGAGGCCATAAAGAAAAAGGAAGAAGAACTAG 

GAAGAACTAGCAAAAAGGACAGCAGAAATTGAAGAGGCCATGAAGAAAAAGGAAGAAGAAGAACTCTCAAAATATAATAAAAT 
GACGACTGAATCAAGAATGTGTCAAGCTGAGAGATATTTCAACTGCAGCC 

TTCCGATCAGGAGTTG^CTGAGGCGAAAGAGAGGTTGAGGAAAATTrTATTGCTAGAAGAAGAAAC^^ 

GTAGAAGAACGTATATTCCTAACTAAAGATGATGAAACGTTAGCCTTCAACTTAGCCCTAGAAAAGAAAACGGAGGAC^ 

AAAAAGGCAGTGAAAGAAGAGATGGAGAATATACTATAACTTCTCATATTGAGAAACTACCTC 

ATAAATAAAAGTATAAAATGTAATAATATTGTTTTAATATCATTAATATTTTACC^ 

GATTGGACAAC^CATTCCCATTCGACCTCCTCCATACACGTCAAGAGTGTGATAAA 

AAGGAGAGAATATTGGTATACATTCGTATGAACACACGTCAAAGATTATTGACACGG 

TATATACAAAGCTATAAACCATTTAGAAAATATCCTAAAACTCAACAAAGGAGAAAAAATTATACTGATGGATGTAG 
AAAATTTTAATGAAAGGGATTCTTCCCAAGGGTAAAAATGGAAGTTTCAGTACATGCGTACGC^TTGC^ 

CT GT ATT TGAAACA G AG AAAG AAGC GTTC AATTCT AT AC AAAAT CT AGT AGATT ATT AT AATG AAATT GT AGCTC AC AC C AATG AC C AAATT AAAAT AAT 
AAAAGCGTGCGCATATTTCATGTACAACTTTC^AACTCTCCACCCTTTCAATGATGGTAATGGAAGAA 
GGTAATGGTATCGTACCTCATTTTTCACCCATAACACACCCTAGGGATC^ 
TATTGTATG^TTTTGCTGGAATCAAT AAAAAAT AAGTAAAATTCATTTTGA 

TGTTATCTCAGCAATAATGXIGTAACACATCAGCAGArGTGCAGATrrTATCXXCTTCCCCATC^ 

GAATCTCCAGTAATGGGGTTAGTTACATTCAATACAGCAAAGAATTGGGCAATATCTTTGTCGGCCAACT/TGT 

TCATAATGATAGATTTTCCAGTGGTGTCACAAATGGGGGATTTATCAAACGGCATGGCG«GX3ATATTGGGGTGX;AATGG^ 

GCCGAGGAGGGCAAACTTCCCGTGCTTGTAAAGXrrCCTTCCACAGG^^CTATTCGTCTGCTTGTAGTT 

ACGTGGGACAGCACTTTGAAGCTCATCTTTTCGTCCGAAGAAGAAGACAG^ACACTCTCTC 

TGTACAATGGAGACTCGCAATCGGTGAACATGGCGTCGGTAGGGAGGAGCATGGTTTGCATGAACA 

GGAGAATATGTTTTCCAATTTGCTTATAAGTGAGTCCATTGCTGCTGCTTGCT 

GAGGACTGTTCTTATATACAAATTCGTTAAAGXOTGACGCCCACAGATAAGAATGTGGTTAGACTTGAAT^ 

TTGTAAGGGGTCAATAAGGACTAGACTCCTCTACCAGAATTCATGTGCGTCATTGCAAGCAAAAACTC 

TGCATGCCAGTCTGTATGACATCATTTCCTTATCTGCAATGGATAATTATAATT^ 

AAT AC AC AT AACATT TCTGGGGGT GTATT T AGTGC ATT AT AT CTGGC ATCCAGAAACGGATGC AAGC AGT AT CTGAAACTCTCTCTCACTTTTG AC ATC A 
ATGAAAAATTGGACATCAACACGCATTGGTGTGGTGGATAAAAGGGGGCAACCGGCTATGACTAACAAG 

AT ATT ACACAATGGAGG ACCT AAAATCC ACTATCG AGAG AGT AT ATGAAGAAAG AGT GGAGAATCTAG AACAATGGAC AAAT ACT GT AGAGGAAG AAG 
AAAGGACTGTCTCAGCAATCGATTCTGTCCTGGAGGAACAAAAAAGGGCCCT^ 

AAAAGAAGGGATATCTGCACTCGTTTTCAACGCAGCAGACGCCAAAACACGTAAAGAATTGATAAATACGTG 
AGAAGAAAGGAAGCAACCTCTACCAATAATCAACTGAAGAACCAGATGTCATCTCTAGTCAACACAACCAAAACACT 
ACAGAAGAAGTGCCATACTCAACATGCAATACATCAATAACAAAAGGGATTATGAAGCAACTCAATTTTGGGTGTATACAAAC^ 
ATAAATTTTATTCTATAATAAAAGGTGTTTTAATTATAATTTTTGTCCCT 

AAACGGTG^CGGCGGAAACGAGGAACAGAAGAGCGGACCCAGCCAGAAGGATCATATCCCTGCT 

GGCAGTGTCCTCATCATCATCGGTGTCCTTATCAGTGTCAGGATCGCTGTCCTTCTTTGGTCCCATCCATACATTCA 

ATTGAGATAGGAACGATCAAAAAGAGACCCATGCGAGCCATAGACATGGAGCCTTCAAGATCTTCCATGGTGTAAGAAGGGCCACCGGC 

TCCTGCCGAAAGGAAGAGTGTTAGTCGTGGTGGCCATTTTTTTTCTGTTTGATG^ 

CGGTTTTTGTCAGXaACACCTTACGAACCTATTTGTCTTTTTAGACTTACTCAGCGTTAGTTGTCGCiVCT 
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TCTTCTTCAACGCCGGGTTAACGACAATCTTAGATCCACGG 

CCACCTTCTCATAAAATGAGGGGACCTTCTTATAGCGTTTTAGGACCT<^TC 

TGCAGACAAATAATATACAGGTAACAAAAGAATGGGAATTGTTTTCCGATAAGTTGAGAAAATT^ 

TGGCGAAGGAGAAGAAGATGGAGATGAAAATGAAGACGGGGGTGGAAATGGGGGAAGAATTGAAGAC^GAGAAGC^ 

TrGTCCTTTGTTGGAAGAGAAGATCCAGTCGCTGTAGATTTACCCACGTGGCGAGAAAACAGTACAGA^ 

GCGATTTAATAGTTGAATGTGGATGCATCAAATCAAAAGAGGAACTCTTTGACTTCATTTT^ 

GGGTATGGCAAACAGGAGTAAATTCACCAAGGAATCATTAATTGACTGGTTTTTTTGACT 

AACTGGTACTTGAAATCTCAAGCGTCTCCAATTTCATTGGTACTAGATGATATATAT^ 

GGGCAAAAAACCCATCTT7AACAGTGGCTTCATCCTTTTCTCCCACGCCCGACA 

AGACATTAATATTAGCCAGATGGCATTAACTGAAAGGGACTGCTTCTTCCCTCTTTTAAC^ 

GAGACAATGAAGAGACCTACCTTATCTCTTCTACCTTCCACCTCCTCCTCCT 

TTCTTCCAGTGTACAGGAGTAACTTTTCTACAGCATCCAA 

GT ATGCGAATGGAAAAATAAGCCCT ATAAGGATT ATGGT AAG AAAATCAACTATT AT TC C AG AAGTGTTT AACC ATCTTTTGTTCCCTGTCTTTGCCTCT 
AAAGACACTGGTGC G AATATCTT ATTTTTT AT CAAAATGAAATCCTTTGCAAGTGCATC TTT ACTCCTCCCTGG ACTTT TTAGACACCCCAAACAATTTC 
TCAACGGGCCGTGCAAATGGATGACTCTAGCAGAAAACAACATCAACGACAACAACATAAACTC 

TCCTCTGGGCT ATT AC ACCCAAGAGAGCCCTC AAC CCTATCAGAC ATGCGGCAATTT T ACTT CGACT ACAAACAAGAGACTACAAAACGTGCAGCCATT A 
TACTTTT AAACACT CTTTTGGAAT ACT AC AG<3 ACACCTTC AG AAG AGTGG^AAATTC CGTTT AATCTCTTGCTT AATGT GATGAATAAC AAGTGGAGT AC 
ACTC ATTCCAGGTGT CAAAATAAGTGCAGGT ATCATATCG AAACTCCC ATGGACC AT GAAAACAATGT ACGAG ATTGTTTCTTCGCCCAATAAT AATAAT 
AACAACGGAGACTACTATTCTACATGCAGGCGAATGGTAATGGAATATCCTATCGGGGGTT7ATTGCA 

CC AGAATGGTCACCTGT ACAAAGGGCAAAGAC CAC CAGAAGC T AT ATG ACATCTC TAG ACAAATGTTTGATATAATAGAAGC AAATGGACAACTC TG ATT 
ATTATTATTACG<3TTACGAAAATTCCCCAAAGACTAAAAAATTCAATACTGCACTGATGATTGGGTTCATTTT^ 

GGGTTTATTTAGATCTCGAAATGCAACCACCCAAGAGAGCAAAACT7CTTCCCCAACAATCTCCTCGACCCCAACTACATATTCTGGCAG 

AGGGGTCCAGGTTCTGGATCTGGAAACAAACCCAAAGATGACACATCCGTTGJUVGGAATAGACCCTGGCTTACTGTAACA 

C^GCTCGCTTGCCAATTGTCCTGTTACGTACTCTGTGGTTTCACGAGGTTGTCATCACC^^ 

CTTAATAACCAAGCAACGTTCGATAAAGAAAAAAACTCGTCATGGATCTTTCTTTCACTCTTTCGGTCGT 

TGCTGTATTTATTGTGATTTTTAGGTATCAGAACACTGTGA 

ATTCCTGTGACTGCTGAGGTTGGATCAGGCTACTTCAAGATGACTGATGTGTCCTTTGACAGCGA^^ 

CTGATGCACAGATGAAGGAAGAAGATGCGGATCTrGTCATCACTCCCGTGGAGGGCCGAGCACTCGAAGTGA 

AACATTCAAGCTGTGGAACAACACATCAAGAAAGATCAACATCACTGGTATGCAGATGGTGCCAAAG 

AACACCTCCTCCTTCACCCCCGTCTCTATTGATGAGGATGAAGTTGGCACCTTTGTCTGTGGTA 

GAAATCTTTTCGACATGTACGTGCACGTCACCTACTCTGG<^CTGAGACCGAGTAAATAAATCGT^ 

ACAATAAGAAAATAAAACAATTGAGGAAATTTATACCATATTTTATTGACCTACTTAACCTTCTTGCTATAC^ 

TAGCAATATTATCCTTGAACGGGAAAC^TGCACCAATTACAGGCGCAATTTCATACGCTCTCGGCCTATT 

AATAGACAAAAATGGAATGTTTGTATAGATAGAATTGGCAGACAAATCTGCAGTTCTCTTAATCAAAATGG^ 

CC AAAAGTCATCGC AGTTTCTGAACACAACTCACTGTT AAT AAATT C AGG AGCTGT ATG AGGATGGT TACTAAAGAACC TCTCATC AGTTCCCCAAC ATT 

TAAAATTGTAGTACTTTTTACATGGTACAATTAAACCAAAATCAATCATCTTAGGTTGACCAGTTATTCCAra^ 

CGGATTCACTAATCCTTGTTGAGACAACCGAGTGACAArATT^ACAATTTCTACCAAAA 

ACGATAGCTATGGCTGAAATTGAATCCGTAATG*^TTTCTTGCATTTAGATTGTAGACCTTCAGGCA 

ACAGAGTATCCCTCATCCTACACAAGCTGCCATGATCTTGCATGAG 

ctcgtccattitac^x^tcaggtctccatttaatacatcaatcacagaaatgcctgcaaaatcc 

acacctactgcgcctttcattcttccctcaagagcgtgactaaaaacagtttcaaatacaaattcacagt 

cttttccagtttcaagcatgtaatatacccosactttaacttcgagatata^ 

gtgtgctggtacaggaat atttctgcggtaactt t cccatttctt ac attgggc agtgacc t ccccc aagagtgtagttttctt ctccaat atagattc a 

catgctccagxsggaaaxscttaaaaacattcttt^cc^tgtgatcgatgctcatggtttt 

tcaagagagggctagtgtctaaaacggataggtttttactaccaaacccaacaggccaccaagatga^ 

aaacttggacgct^tttcctttg^gtcaatacagctctgttttatggtctcgtag 

gcctgtctaatciaaatggaacggagattctagaccgxsctgcaatattggtatcagcctctgccactgc^ 

acttgtcaatggaagaaaagttcctctttactacatttgtttcagtcacggtatcacaagaagatttcaagtgt 

gacagac^gtggagtatagtctctagccttcaaattgacagtagagaattttgcacaat^ 

gcgcggtgcttcctcaacacagacgcggcactaacagaaggcaattgctgcttcttgataatgtctgtcgcatac 

tttcatcaatagtaataggcctcrrcttcttatttctattaaagttgtaaggtgtgctctc^ 

ttttcctcccacctctccmtattcttctttagtctcttcatgggcctaagagaa 

caacaagatttcacagctatggctgaagotatcttgaattgccctcctccttctcctccttctccttctcct 

cgcttgctggcgtaagttttgtccgttggtccccaccctc<^ttttaacgacgttttttata 

cactcgtggatagttcctttatatat^aaccaatttttcaaac^g^ 

CAGGTCAGGCCTGCAAAGGACTGGGTATTTACCAGTCCGAATATAACAGTCAAAAGGCT 
TTGCTGAGACATATACTGTCTGGCCGCCCCrrrCCTTATTAGC^ 

TTCAACATATTTG<^CATCCAAACGT/TACAGT^TTCATGGATACTACACCCTCGTCTCCTAACAA 
TTTGCCTCTTGACAGATTCTGTTACAGCGTCCTGGATTTTCTTGGCCCACAGAGCTTTAATAGATCTC 

ACCTCTGGC AGTAATGGGGCAGTAATT AAAACAG A CAAC AGAAC CTCC T TCTCTCCAAATTCTGGT AGATAGATTGTCCGCTTCT^PTATTCAACAAT AAA 

CTTCTTAGACAGTTTGAAATAATGTCCAATTCTTTACTCTTGTGGGAAAAGGGTCCTCCAATAAATCGACCAAA 

CGTTAGAAGATGGTTCATTAAA<^TCATACAAAATTTGACATCAGAAGGAGGTTTCCCGCTCTTACAA 

CTGT AC AGGG TTCAC CTTTGAAGCTCT AACTTCT ATT AC AGC AT TTG AT ATGTCAGTTTCC ATGGCCTTCATTAATTTGGGGAT ATTCCT ATT AACAAAA 

TCACACATAGCTATTTTCGATTCAGGGTGGATGTGGGGTGAAAGAACA^ 

TGTTGATTGGGTATAGAGATTCTTGGTGCTTCTTTTCCCTT^ 

ATCT T CACTGTCCGACATTTGGTGGGC ACTATT AT TATT AGT GT TCTTG AC ACTGTT AC C AATTTTC TTGTACATTT TTTCAAT AGTT AAAAC ATTAGC A 
TTATCAGCACTCTCATTCTTCATTATGTT<rrTCAG 

TTATAAAGTGTATt^CTGATGCATATTTCTTACCCGTAACAGGTTTATGAG^TCAGCGAGGCTTGG 

AGTCGTGT AC AAGGC CTCGATGGTTTCCTTTGCC AGCTC AGGGG ATGT AGTTTT ACTCCCAT ATTTCGAATCC AT ATGT AGTAGTTCTTGl - riXSTTGTCG 

AGAATGTCCTTCAATAAAGGGTCCAACGTACTCTTATTTAAAATTTTGTGTTCGAATTCATATATCAGCGAACAAGTTTTCATG 

AACATCTCACAAGACAGATGACTGCAGTC^TCAGAAACGTATCACTTTTCCCACTTCTTTCTTC 

GTAATCGAGTATAGAITCTGCATTG^TGCGATGGTTTCTTTAACAGCTTGGG<n , AAATTAGATACAGTTCTG 

TCTTCTACCTGGCG^CATTGTCTTTTGTTTGTGTTTTTGTTTCCAGAGAGAGTAATTGCA^AATTTTTC 

CCACTTTTTGGAAGAGAGATTTCTGGCGGTCATCTTCAACAACTGGAGCCATGTCTGTACCGGAGCCAACTTTOTTCACAACTAACCGC 
GTCTTTTAAAGCCATTACGGTCC ATGCCAT ATCTC AAAG GTTGC C AATG ACCAAC AT AAAT ATC GGCACAGCAT AC ATTG AATTGCCTAAT ACTATAAAG 
GAAGGACTCGATAACCCGAAACCTATATACTTTGATCCTTTATCTAAAAAATATAAACCG^TACTTGCACT 

TAG AGG ACT ATAGAAAAAGT AT AGGATGGCGTCCTCGGGGGG ATTC T TT AC AGGAAT AG ATG ACCTTTTCAAGACAGTGATTC AACAAG AAAAACAAGAG 
AAAAATAAACCMCTCAAGCACCAGAAACAGAACCAAAACCAGGGCCATCTCAAGCTCCAGATCCAGTCCCAGACCCAGT 
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TCTGTCCTCCTCCACCTAATCCTCTCCCTCCTCCTCCTCCT^ 

AGCTCTTAGTGATATTGTTGAAGCCACAAACGAGCGTGTTGATGCGTTGAAAGAGAACCAAGCATTAAAT^^ 

CAGGTTTTAAAGTGCTCGATAACACCTTCTGTACCAACAGCTATTATAGGCGCACACGTGAAACAGGTGGCCAAAAGTAGCG 

ACGAACTCGATATAAAAAATAAGTGCTCTTTAGTGTACAACGAAAATGAGTCGTTAAAATTTTTCAGGGACCATGAGAACCTTATA 

C C AGTT ATTCTC T AGGC ACGAT AACACCAAATGCGTGGGGGC AGAAAT ATGTGTT AAAGGCAACG AAJiAAAACAAG^^ 

AAACTCCCCAATGCACCATCCTC^TCTTCAACTGTGCTAGAAATTAGAGGaSCTACCAGAAAT^ 

CTCTCAATGAAAACAAGGACATTCCTCCTTCAGAACGAGCCAACCTGGA 

CAC7AAAAGGAAGCTTTTCTTTAAAGGCAACACTTTTTATCAACGAAAACCAACATTCGATAATAAATTCA 
AGTGAAGCATCAAAACAAACCACTAAATCGCTAGACAAGCCAACGGACGACAATTTATTCCT 
GTTTGAAATTTAAAAACGTCCTCTATAAAAATAGTACCGCACATCCCGGaUUVCGAAATTACTACAAG^ 
TTCGGCGAAAGAGTACAAGATGGTCTTTGCAGAAATCGACAAGTGTTTGGATGTTCTTTTGGCCATAGG 

AT AC AAT AT AGAGG AAAGT TT AGAAGGTATTT AAT ATTCTGC TACGC CTTTT AT GCT CT AAAT AAGGC AAAACATTC TC GC GCAGT ATCC C C TC T ACCAT 
TTAATriCTTTAACCTTTTCTCCTTCATCTATTCTC^ 

TTTCCCCATGGG^CAGCCGCCCCATCCGTCTCAGCCAAGCGGCTCATGGATATCGATTCC<^ 

TTTGGTTCACCTTCAAAAACAAGTCTCCATACAAGAACATTGG^GTCTTTCTrAGGTT^ 

GTGTAGAAGTGCGTGTATCTCCAGCTCTCCAACAAAGGATATCTAAATCCCTFAGAAAGATGGTGTGATTC^ 

ATTCCACAGATTCAGTGGTGCGAAAAAAGTATCACTCGAATCGGCGCTTCGCCTrCATCATGGG^ 

AAGAGATGCCGAATAGAAGCAGCGGAAATGGAAGGTGTGGAAG 

TACAAAAAGCCCTC GGATT ACC TGTCCCTAAGAT AAACCCTCTCATGACAGC AT CTT CTTCT CAAT ACAJ\TTT AGGGGATTTTCT AG^CGTGGAAC AAC T 

TCTAAAGGCTAAGAGAGAGTTTCCAGCCGAAGGAGAAACCGCAGGArrTCTCGGCATGTTTGATAATCT^ 

GAAGGAGCCTTTTCAGACGTGGTOGAAAATGTAAAACAAGGCATGGAACA 

C ATT CT ACGAAG AAG AAAAGGATGTTC CAOU^CAGGAAGAAAATT CT ACAC AAC AAA GAT AT AG ATT G AAT AGAG ACGT GGC TTC 

TCCTATGAAGATGGTGTTTGTGTCTATACTCGATAAAACTAACCAAAAAGAACGTTTCATGTCTGT^ 

AGGAACGTACTGAAAAAGGATTGGAACGAATACGCTATCGCTAAAC^CAACTACGAATGG 

TAGTGAATTTTGGAATATTAGGTGACTTGAAAATAACCAATAAACTTGACACAAATA 

AGATCAGAAAAAATTTATAAAAAACACATCCCTATCTGATCGAAAACAATTGGCCCTTGTTC^ 

AGAGTGACTGCAACATCATGGGCTGTCGATGCGCT TCGT ACCTATACAAGAGGTGAT AAAGACATCtTTGCOSCCCTATCTTCATCGCTGGArATGTACC 
ATCTTGGGCACACGAATTCAGCTAATTTTGTTCCATATTTTAGTAGA 

CTCTGAAAAATTGGCCAAAGAAGAATTGGGAAGAGGACGTTTAGGGGGCCTCAATAAGGTAGG 
ATTTCTTCTGCCTTAGATATGGGGGAAGTAGAAGCTGTAATGGACGACTCTTCTAAAGTGAGA 

AGGTCTCGGCCGCCAGAGAAAAGGCGAGAGAAGCTAGTATTAAAC GTCT^CTTCTGGCCACTAATGCACCJUX^GCTGGTTCATCCAGAAACAGTAACAG 
TTTCTTCATGCTGCTGTTCCTGATTTTGTrTATTGAGTATTCCTTCGAAAGTGAAACCTCTATAGTGAGATTAC 

AAGAC G AAATGGT AT GC C CTT C AAC AGGTC CCGAAGCT AAT AAGAAG AGGAAATT AGT AAGG AAT AA^ CT GT ACT GACGT TGGAT GAT GAAGA 

T AAC ATC GTTAAAT ACAAC AAATATGATATGGTTGAAGACGAGGAAGCGCGTGAAAG ATT AC GCC AC CAGGACAAAC AATCGGTT ATTGCAGCCCGT ATC 
AGT AAAGTGTGTGAGC GG AAAAAT C CAAAGAAAAAAC GT C GTTT AGAAGACC CTG AAT T GCAAAGT G TGGATGAACAAT TGAT AC GGGAACTGGC TGC C A 
TTG^CTACTGACGAGATCTAAGATTTTGTAATATATTATGTGTCTTGCAGACCTAATATCCTGAGGATTAATAATAATAA 

GAGAC GAT ACTTTTAAC CAAGAAACTGCGGTGAAACTTGTAC GATGGT AT AC AG AGT ACG AT TGTTGTTGCCCATTGGTTAACCGCGTGGAGCGCCT TCT 
AGGATCGTTCGGAGGAGGCGTGGACGCCACGTCTGTGCGAAGCCGACCGGCTCTTTATGAAGAA 

ACGTCCCTTATTGAGGGTATACTTTTGGAAAGGGCTCTAACTAAACCCGATTTAGCTGCTGCAGCTTTTGATGTATCAGA 

GT AAT AACACTC AAGGC AATTTTG ATG7TCTCTTC AATG AC CAT AT GGATT GATG G C AAT AAT AGT AAAAAGT AT GAAGT T AC ATGCC CGTCATGCACTGT 

CGAGAAAATTAGTGGAGGTGCCGAATCTATTCAGAAGAAACCCATGTCTCTTCT^ 

ATTGAACTCAAGAAATTGTACCTCTCCTTACTAACGGGCTCGGCAGCCGGAGGAGGAGGTATGTAC 

CTTGGACC^CGTTGTTGTTCCACACATCTAAAAAGGACAAGACTCGT 

GCCTAGGTGCGTCTGTTCCGATCTGTTATATGCCCTATGCTCCACCAC^^ 

GGTGGGGAATTTTTATATTTTAAATACAO^TCTTTGAAGAGAATGGACCTTTCGACTCCAAAAC^ 
CTGAGACAAATTCATCGGCACTGGCCGCTTCTTCTTCCTCT^ 

AAAAACTAAGAAGAAACAACCCAAGAAACAAACCAAGAAACAAAAAACAACAACATCAACA 

ATGAATGTACTTAAAAAAGGAGCTGTTAATGGAAAACGGAAAATGATGGATTCTTTGTCAGGAA 

CTC^TGCTGGTGGTGGTGCTTCATCCGACGTTGTTGCAGGAGAAAATGAGGAAGAGAACAACCCCTC^ 

AGACTATCTGCTTCCATGCCCTCAAATAGAAGAAGTCACTATTTTTTCACAACACAGGATGAACAATAAC 

GTTGTTATTAATGGAAATTGTTTAAACTTGTTTCTTACTCAACACA 

AGCATGTAGGATTTAACAAATTTCGCATTTTGACTGGCGTTTCTTGCTTTTTTGATAGA^ 

GAGTAATAATGCTGCCCATTCG^TATTCTAAO;CTATTGTC!ATACATAAGAGAAAACTCATT 

GATTTrGTCGn^AAATCAC^GGACACTAATATAGGK^TACCTTTAAGCAACAAGGAAATAAGAGAA 

TAGCTGGTTTGGGAAAATGATCACCGACCCGGGGGTCATTTTACCCGTTTCCAAGGAT^ 

ATGACACTGGACCCTCACGATTTGGATATAAAAATCACATCTAAACG<^TAGCT 

CACGGGCAATGGAAAAGGAACTAAATAATAGTAGAAATATGAAAGAGTCAATATTCACGGGAA^ 

CATGTTCAACGGAG^AGGTTCAGCTTTGCGCTTAATTAGATCCCCCGCTTTGAATTCTGCTGra 

ACCATAACCAAATCTCTAAGGAGAAGTCAAGCTAGAGATAAGCAAGTGGATAAAACAAGAGAGAA 

TAGCTGCTCAAGTAATCCACCTCACAGACG<^GAGATGACGTACGTCCCCGATC^GCACTG 

CTACTTGAT<^TAAACGACCCCACTGGTTGGGGATGGAAAATTATGCCTAACAATrrCAATAAAACACTTGA 

GAAACATTAGTCGAGTTTGCCTGCAAGTGCGTC^CATCATCCTTGATTAAAAGG^^ 

TCCTCCCTCCAGCTTCTTCT ACTTCCAAT AATAGTCCTAGAGTAGCGAT AATG AC AT C TGGAAGT AGT ACTACTACGGGCATTGGAT C CTTGTCCATTC T 

TGCAGAAGATGGATCAACACACCACCAAATCAAGCTATCGGAATATAGGACTGGATTATCCATTACTGAAA^ 

CCTTCAATAGACG^GTTCAAGCAGAGCATCCTCTATCCCCTTCTATTCT^AGTGGTTACCT^ 

CAGCAGCACTAGTAGAAGAAGAAAACGGX^CAATAAACCTTCTGATAAAGATAACGAAGACAAGTACAG 

CACACCTCTAATTACACCCAAGAAATG^GAGCGTGCAAAATAAACGATCGGGCAATGATTACT 

GATTGG ACAAAT AAAAC T AC AAAGGTTGATTATTTTG ATAAG ATGGCTGCCTTTGTTGCCCTCATGACC TTTAGAAAATTCC AAGAC AT ACT AGC GG AT A 
ACTATGTTCCTCCTCAAACCCCTTCTCAGGGAAGTGAATACGCAGTGACCA 

TGGAAAT AAGCC ATTGT TTGCCCT AGAACAGCTAG AAAATGAAACCGGGATTGAAAGC AT AT AC GTCCT AAATATCATAGGAAATTCCCCTGATGGT AAT 
TCTGTCAGGG?TCGTCAGACTGGAAAAGGaAATGAGTTTCCTCTTGAAGGCGAAGCAGTACTTTACAGAAA 
AATGGACAGATAAGGCCCCGTCATCTGTAAAGGmGTACAAGTATTTTTGTGATCTAACAGCACCCATTTCAAAGAG^ 
CGGTGTGGA^ATTCTGCGTTGACTTATACACCTAGGTGCATATACCACACTGAA 

C AC GT AT CTTTC AACAA GGATT T G AACAT ATT AG AAAT T GGAAAAAAT ATT ACC AAC C AAT A CC AAAC AAACT AC AAAAGC AT ATT C GAAAT TG T GG ACG 
TTCCCATAATTGTCGCCTCTATGTCATCAACAAAAACAATGACTGTAAAC^ 
GCCAAAAACAGGGAAACAACTTCTGGCAGTTGAAGAGGTCAGAAACTTTAAACT^ 
AAC^CAACTCTTTGTTCACAAATAACTGAACAAAATTGCCCGTCATCTTCTGAAGGTGGGCGTTTTT 
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CTAATCTCTCTAAAAAGCGCGCACTGGAAGAAATTGCCCCAGAGACTGAGACTAGCATTTTGTCACTAGCCATGTAACGTGTCTC^TATATC^CAATTT 

AAACAACC TCTT AAT AAAGAGGATAATTATCAT AAT AGAAAATT AGGTAAAATGGGAGT AC AGAAGAAT ATCCTGGTCG GTGGT GGTGGTGGTGT ATCAT 

TGTTATTGGGTGTTGTTACACTTCTAGGAACAGTAACAGAAGGAGCACCGGC^ 

TGTTTTCTGGGTCGAAGGAAATCGTGTTTTAAGTGGAACCAAGAAGGAC^^ 

TTCAGACATGACTKn^CTGAAACTCGTTCTATTCAATGGC^ 

AACAC CGT ACTCATGAAACAACAGAACCT GACGATTTATTGTG&GACGGATCAAG AAAAACT ACC ACC AT AATTCT ACCT AAAAAAT GGTGGTCTGATGT 
TGTGTGGACATCTTTATGGAGGGATAACGACCIAGAAGTGTGGGTGCGGGCAG^ 

TGGCTTCCTGCTCACACTAACGGAAAGACATCTGAGGGTGATACTAATTCAGCCTACCTTTTCATA^ 

CTGACGTCACAGAAGATAATATGATGATGGGAAGAATGTCAGGC^CACCC^TGAACCCTAAGGATATG^ 

AGGAAGTACTCCTCAGTGTCTTGTATCGAATTCGGACATCCTGAACAAGAGTC 

ACTAAACATCAACTGGGGGAGCGGGAATATGGGAGCGAAGGAAGAAGAAGA 

AAGTAGAAGTTGCGCTACCTTACATTAAGAAAAGTGGAAAACTTATCGGACCTC^TAGAAGAC 

TACTAATCCTATTCTTAGAGAGGTTGTGGAAGATTTTGATTACGAGTCTT^ 

TTTCTTGATCAAAAAAATTGGAGACTTGGTATCATGAGCAGAGTTTCTTCTTCCATCGCCAACTT^ 

GTTTGGCAGTCTGGGTTGGGGATGAACATACCCCTAAATTCAGACTTAGTGW 

TGTAGGTTATTCCTCTGATGTTTTCTGGCATGAAACTCTTAGAAGCAAAATTGT^ 

GGGGAAGATT GGCCT AATAAAAAAC AAAC TGTAGTTGCT ATGTTT ATTTCAGGT ATTGTTTGTAT AACAGTAAC AGTT ATTTCT ATATTTTCAATTGTAA 

TATATTACAAAATAAAAATGCCTAAATTCTAATACAATGGTTTATTCATTCCTT^ 

CTTCCCTTGTTCCTTAAACAATATGTTTGATACATTTTTCACAATAAATT^ 

TC CACCACAGTTC^AAC ACC AGT AAAC ATGCTCTTTCTGATATCCAATGTTGAATACTTGGCAAATT CATGCCCTTGC ATTTC AGTCATGGCAAT GATT A 
TAGCACCATACATTTTACATTTTTTGAATAGAGGATATGTGGCAAAGTTGG^ 
CAGTGCTCCTCCAGTTCTCTTTACATGCAAAGAAAATTGTACCCTTTTCAT^ 
AGATTGTTACTAGGTGAAATTGTCTTTCTTGGTGCCAAAAGACTAATCCT 

GGAATTCACTAACACCAGAAGCTGCTGCTATTTTTGACATGAAC^TTAACATACAATATACATT 
ATCTACATGGATATTTTCTCCCTTCTCTTGAGCTGTGCATCTGGACGTTACAATAGACCCATGTTTCGA 

TTACC AGTCCT ACGC AAGTTGT GC TGC ATACTCTT ACATAC AACATCCATGT ATTTATTAACCTTATTTTGATT AGATGTGT CACT AACTTTTTTCAAAA 

TACCTTCAATGTTATTTGAAAACTCATCCGCGCTTGTTATAGGAAGACGCAAT^ 

CTCGTAACCCAGATTTGGTACATGAGTAGAATTGGTGGTAATTGATTTTGG^ 

GAAGAATTCAAAACGTTCAATTTTTCATACACTAAAAATATATAATGGACATAAAATTGG AACAC AATTGTAGCCTTGTTATG 

ACAACATTTCAATAGTGGACTCCTTTGGTGCCCTATTCACAAAAACTGGGTXSTATTTTC^ 

ACACAACTTTGAAGAGGGCTGTTTTACAATACCACTGGCTATAGAGTAGTGTGTGATT^ 

CATAATTTCTGATAATGTTTAGCCACATCTCCATTATTAACACCAAAGAAAAAGTCTCT^ 

CCTTAGCAGTGTTGGCCATCCAAGACGCATTAAACGATAAAGATTTTCCCGATCTACTTAG^ 

GTCATAGGCAGAATACATTGTTACTCCTCCATCTCCATTATAGTCATCATCCATTGCAGCGGAAGATTCTGCCACCTCTCTAGAGA 

TGGTTAATAAAGTCGCTCCCATTAAAATCAGa^TGCCTCCAATAGGCATAGTATAAGCCTC 

CCCGTTCAACAGAATGATTTTGAATAATGAACAAGTTTTGAGGATTACTATTGCGGGTTACAGCAACATACG 

AACAATAGTGTCTCTAAAGAAAGTGTGCCCTTGGCAAGAATATATCGTCATCGCTTG 

AATCCTCCATTCTTGATATTTGGATTACCGATAATTTGTCGTCCTTCTTTA^ 

ATTCACCATTCTTGTACATTAAATTAGTAACCACTCC^GTGTCC^ 

CACATTTTGGCCCTGATAAAGAGTAATAGATTTAGCCCTAGTTACCTTCTTTTTCTCATCCAAAA 

CTCAAAATTTTCCTGTCTGTAAACTTGTCAAAAAGTGGTGTACT 

CGTGGCTAATGTCAGACAGATCCATATTCAAAACTACAAATTTTACTTCCAC^ 

AGTAGAAGAGGAGGATGAGGAATTGTTATTGTTCTTTTTAGTTTCTTTGATGATAGAAGTATCCACTA 
TTAGTGAGTGTGATTGCACCCATCATCTTCTCAGTTTTGGAAGAAATTGCATCC 
CCTTAGGCTGCCCTTGTTCTTTTGTTATAAACTCTTCAATCTTCTTCCTCCTCTCATTTCC^ 
AGATTTTTCAGAGTATTTAAGTGAGTTATTGTGCCTATTGTTAATG 

TTAACACCCTCC^TCATTTGACCATTATTGAAAACATCTTCCTTTCTGTACATAAGAAATTCGTTA^ 
CTGGGTCAAAGTTGTCCATGAAAACACTCCCATTTCTGGATGATCTACCGGCCATTCGCGTA 

TGAAATACCGTACATTAACGAATAAGCGTGCTTAATAATGTCTATTCTCTTTAACGACGACATGATATATGTTCTCGA 
TATGCAGGTAATAACATACTCTTCCCTACATCTCCTCCTGTTGATATGGGCATAGATAATAC^ 
CTGCTCCTCCTCCTCCTCCTACTCCTACTCCTGTGCTTGTGGCCCATT^ 
GAGTTTTCCATTATCAAATTTAACCTGTTGCATTCTCACTAGTGTATACAAATACCACACA^ 

C ATTTTTTCC AC AC AAC ATG AAAGTCAACT AAATGACCTACACGATCT AGAGTAGATGC G TC ACGGTGC AATTTATC TGGTAT ATCAGTTCTATATATCA 

ATCGAAATAACAATAGTCCTCGTCGTCCTC^TAATAAGCTGCTGCTGCAGCTGTACGTGATGATGCACT 

ATAGATCTTATATTTTCAGATTTCGCTAGTAGTTCTTTGTATGTTAAAGTTTC^TTCTC 

TAGCGATAGGAGTTTCATCCACAAAGTCTTTTCCATCTCTAAAGAAGGCGTTCCTCACACACGC^ 

TGGT ACTGC ATCTATAC ACTTGGGATC AGAACCGTGCAC ATACTTGA.CGTTCAAT AT C TCTT CATCC ATT AAGGAGGTC AGATGTGT ATCCTTAAATCTC 

TCTGATAGGCTAGAAACAATATACAATTTTTCTCGGCCACTCTAATCAATT^ 

CAGTGTACTTTTCTAGATTTGCATTGCTCATTTTCTTGAGTTCAGATATGT^ 

AGATTTCATCATATCCAAAGAAGACTTCTTGTCTGATACAGAAGTTGTTCCACTTCTT 

AATCCATTCAAAGCTCCCGATGCTCTGTCCTTTTCAAGTGCATTTCTTT^ 

CATCAAATTCGTCACAATTTTCTTCTGCCGTGGATTGGGCCGGGCAAGAACTTGGTGCCGT^ 

TTTAATAGCACTTGAGACTAAATTGTTGTTAACTGACTTGACGTTATTACrrCTTCATTAAA^ 

CTTGGGTTGCCAAAAAAGTCTCTAGAGCAAAACATTTTCAGTATCTTTATAAAAACT 

GTTCGACTGACCCTTGGAAGAAGGTACTCTTTTCTTCCTCTTTCrrCGTCTCAGCTTCATCGTCAC 

TTCTTCGTTTTCCTCrCCCTCTTCTTCTTCACCACCTTTCTCTTTCT TC 

TCTTCCTCACTCGXSGTTCTTTATTTTCTTCTGGATATGGTTAGTGTTATCTTC 

CTCCTCCGAAAGGTATATCTGGCCTGTGTTTCGCTATACGTAAAACAGCATCAATAAACACCAATTCCCTGCA^ 

AAGGAACCTTAAATTTGCCAACACTTGTATCACTTTCCATTTTAGTTTATCGGCCACAAATTGGATTTTTT 

ACTGGAATCTCAAATCCTTTGTTGAGGGTAGAAAACACACTTTCTGGGTACATCAA 

TTGCAGAATTAGATTTGCAACACAGGAGCATGTCTGCAGATCCAGAATCAATCAAGCCGTGGTCGTTAAGAACTGCA 
TTTCCCAGCTCCAGGAAGACCACCCAGATGGAACACAGATATTCCTCCTCCAGCATAACTGACAGAA^ 

TGTTGC AAGGAAAC AGTGC TC AAT ACCTTTCTGGCCTCCT TGCACGT ATCTGXK3 AGCTTG AC CGC AGAAGC AGAGGAGGAGGAGGAAGAAGAAGACCC AG 

AAGGGGATGCCGCTGCAGCTGCAACTGCACTTCTGAGCACCTGGGTTCTGCTAGCCAGAAAGGGGTATGAT^ 

ATTATTGTTGTTGTTCTTCTTrTGTTTTTTGGTAACTGGCTCT^ 

ACGGGGGTAAAACCAATCCTTTTATACTAGTTCTACCTCCACGAGGTTGCCAG^ 

AGATTTTGTGTGTAACCCTTATCGTTACGATTGTTACAATATGTCTTATGGCATT^ 
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CGCTACCGTTGCAGCGGCAGGAGGGATATCTTCATCTGCTTTATTTGGTACCTACATAGGGTCC 
TTACCAGAAAAACAAAAGAAAAAGGCTGATAAGGAAAAGGCATCAAAAACCGATAACA^ 
AGAGCCCGCG<^X:CAGAAACAGGACTTTAAGAAGGAAGAGGAAATTATCCAAAAGAA 
TCTCAGAGGAGGAAGAGGATTAAAGCGAGCATCTAGAGGCAAAGGACTGCT^AGAGGGATTGGAAA^ 

AAAGGAT GTTTCTTTT AATT ATCATGTTGATT AT ATT AGCCCCATTAATATTTGT TGGGGTGAATTTCTTCGT ATACAT AAAGAAG AATAACTGGTTCT C 

TGGGGCCATTAGTAATGACACTACTATTAACTCCACAAATTCTGGAGGAGTAGGAGGATT^^ 

TCTAGGGTACJATATTCCACATCAACGAAAAATTTAGTCAATATAGCCATGGATGG^^ 

GGCCATATGAAACCTACAAAGACGTGGAGGATAGCCAGTTCTATTTACATTTTTTCCACGTTAGAAATTTTAAACCTTTAAACGGTGATG 
CCACCTTGAAAG^ATGAAAGTrTTGTCTTGATCGAATCACCATATTATAATG 

TCTACTGAAAAGCCGTATATTAACACGGAGATAACTTCCATCGTCAGCACCACTGGTACAGATGAAAGOTTCTTC 

ATGGTGAAGAAGGAGT T AC AGAAAAT AGGT AC TT T TT AC GCC AC ATGGCAAGT AATT AT G TTGT AAAGGCTAGT T TC AAAT C AGTC ATGCCCAC T AT TGA 

AATTAGTGATCTGACTGGCACATACAATAATAAGAATAGTGTTAAAATTAAGCCTGTAACTGGTACTT^ 

TAAAATACATTTTAAAGTATTATGTTTATTGTTTTATTTCTTCTAAAATTTTATTTTAGGTATCCT 

TTACCCTTCTTTGGTTTTTGGTTAGTTTTCTTGGCCCCACCATC 

CGCCCACTCCTGGTATTGATCCCAGCGC^GAAGCGATCGTGCCGGCC^ 

AGCAATAACAATGACAATTATTGTTATTAAAAGTAAACCAAAAAGTATATTT^ 

TTATTGTTATTATTGTTATTGTTATTATCCCCTTCCATGGTTGTTGCACATAAT^ 

AAGGCACCTTC^^GGCTGTGAAAATATTACATGCAGATAATGTTGATTCTGAAT^ 

GC^TATGGATAGTACAAATTGTCTTGAATTCCTCATCTTCTTTGTCCTCTO 

TATATAGTAATTATTCGGGTTTGAATACAAGTAGTTATGGAAATGGAGATGCGAACCACTA 

TCGTCATCCACTCTTCTCATTTTTAGGGTAATATCTCTC^ 

TTAAAACAATTTTATAATGATTTTTAAAGGGATAACTATAAAGAGACCCTT^ 

GATTGTGAACCXrTCTCAAAAGAGCTATCCATTTTTTTTCATCCCT 

CTCCTCCTCCCCCAGTCTTCTGCCTTTTATTATCGCTTCCAC 

AGCTATTATCAGAAAAATCACCAGGATCAAAACAGGGGCATTCGTATTAGACACT 

AATTTGGTTACAAATAGTTAAAAATATGTAACCATGCAGTACAATAAAAGGTGATC^ 

GCGTATTCTTTGTAACTATATCGGAATATACTTCATCCCATACAGCAGATGAACCTCTTTTTTCAGGCTTAATTGCGATATCC 

GTGGAAGATCTTTACAAAGCAAGATTTATCCGTTTGATTATTGAACAAAAAATCG^ 

GAAAAATCCCTCTTCCAGACTACAATATCTTGGG^ 

TGGTTTCATACCTCCCTTGGTCATCACATTTTACTCTAAAAATTGCCACTCTAAAATCACTCATGTG 

TATATC^TTTCAACGCTAGCTGGCGTTTTTAGTACGTCCCATGCAAACGCA^ 

GGGAAAAC^TCXTGCAGCCACCTCAGTTACCCAGGCACTAATGTTGTCTTCAGACATTGGTA 

TCCCTCTGTTGCCCTGCATCTTATTATTGAATACAGTCTCTCCGTAGGGTTTA^ 

ATCTCACGACCTCOTGTCCAAAACATTTGTCACCCTAGAGGAAATTTCCC 

GTGTACAATAAAAACAATTAATTTGAAACAAAAGGTAATAACAAAATAAAATACATAAGAAAG^ 

TATTTTTTGACGGACAAGTACATTAACATTATCAACATTCAACAGACCACT 

AACTTCCATACATCATTCTTCTTTACATGAATCAAGAAAGTATTCC^AATCA 

TATTAAATTTATCCCATGAACAGTATTCCTGGAGAGAAATGAAGGGGTCTTTCCCGTC 

GAGTCGAATTAGATTGACTGTCGTCTTTTCTTTTGTCTC^ 

GGCGGTGAAGAAGGAGAAGTGACCTGAAGAGGCGCTGGTCTCTTATTATTATT^ 

AGAAATAGTCGATAATGTAAAACTCTTCACCCTCTCC<^GTTCTTC^XACATCACATCACCAGATGT 

TGCTCCACATTCAATCAAATCCATCAGC^GTTTCTTCCCTCCTCTCCCCATTGCGACGAGT^ 

GGCTTGTCCGGAC^IAAGAGTAGACATCATGTACTTGGATTTGCAGCAATCACATTCTACACATACAGCAACTGG 

GCTTGTAGAGAAGACAAGTCTTCCCATACAAAATCCCCACAGCCTTCGCTGCAGCTGAAACAGCAAT^ 

AGCTCCAATGATATCACTAGCCATCTCCTTTACTTTGTCATCAATTTTCTTGGCA^ 

TCTCTGGTCAATTTATTGTCCCAGTGATAGACCATTGGTTTGCCCATCCTGTAG1*TTCCC 

TCTT T AGAG ATTTT ACACC ACATCTTTTGXAT AT ATATTTCAAGTCCTTCTCTGAT AGGTCAAGT AT TTCTCCCC AAT 

CTTTGAGTCGCCAGCCTTTGTCAAGATTCTCAGAGCCAAAAAGTGTTCTA 

AAGCACCTCAAAAATACTACACCGGAAAGATTATTCTCGATCTCTCCATCGGCAGTTCCTTCAAGAATAGAGGGCAAAATAATATCC 

AACTGCTATTCTGCTCTTGGTAATACTTTGTGAATGCCATTTTTGCAATAGAGAG 

AGAGGCGGCTGCACACACAGATTTGATTGCTG£ATAACCTCCTTCTGGCAGACTCAGACGTACA 

AAGTCGATAAACATCArTTCTGCCACATCTTCTCGTCCTCCAAATAGATCGCTGGCQAATTCTTCTC 

AGACTGGGATGGCAAAATATCTGTTGGAGGTTTTTGACGTCTCCAAAAGCACATCTTCAACAT^ 

GAAATCAAAGCACTGTCCAAACTCGAACCTCTTGTATTCGTCCAG7GATAATAATCCTCCTCCTGTATATGAATT 

ATTTTGCCAGCTT<rrPCCAAGGCATCATTGGGCACATAGAACCACATAAACATGGAGG^ 

CATCATAGCTAGCAGATTTCATGTGCTTTCCrrTCTTTTAGGAAGGCATATTCrGCCTTGTTTCCGCA 

CCTCGAGCAC^CTCCTGCATTATTGCCTGTAACAAAGGCATAAATAGCCCCJUVTTTCATCAGGCACAGCACT 

ATCATCATTTTGAAGAGTTTTTCAGCAAACATCTTGGCCCAGTGTACGCGAGCATACTTGCGAGACAACTTGG^ 

CAACACTTGCCATCAATTTTGTACAACCCTTACAATCACAGAGGTTTGTATGG^ 

TGCCAGAGTAGAGGCGGCATTTCCTCCCCCTCTAATTGACTGGGCCCTTTGAGTC 

TGTTGTT GTTGTTGCTGCTGTGGATT AAAGGTTTGCTT AG AGCAAC AAATiGGCACT ATCAT ATGCAC AAT TCAAG AACACGCT ACGCTTGTGGTTGTT AT 

ATCTTCCTCCTCCCTGTACAACTCTTCCAAAATCTAAAGGTGCCATATTTAAAAC^ 

TCCTGCAGGCTCTGATCTTCCTTCACTTTC^TAAAAACATTCAATTC^ 

TTCTTGAAGAGGGAAATAGATTGCCTCATAATAAAGGTTGCGATATTCTTGCGCACATTGC^^ 

TGTTGTCGTATCCAGAAAAATTGATCACATCTTCTTGGCJVCAATTCACTCTG 

ATTCTCCACTAGCGCCTTCCTCATGTCAAGAATAGTCAAGGGTTTTTGTCCCC^ 

GAGTTGGCACATAATTTTCCAGTGTCACTATTGGCAATACGCACATCCTCATCGGTGAACACAAAA^ 

CATGCATTTTCTT^AATGCAGGAATCATGTTGTTAAACACGTACATCAGGGAAGAATGAAGAGCTTCACTGCCACCGCCG 

CTTGCGCATATGCTTCACAAAATCTCCCCCTTCCCACTCTGAACAGTCTGG 

TAGTAAACTATAGCATTTTGAATGCCTGGAGCAATCACATTGAGTGTAGCCATGATGCCAGAACJ^CTATCAGCC^ 

AACCACTAATATCCATCAACGAGATACTTAGACCAGTAGGTTTTGCAGTTGGJ^^ 

AAGTTCAATCTTGGAGGAAGGTGCATTTTCCATCAAAGAGTTAAAAAAAGATGCAGAT^ 

CTTGAACGGATAC^GCCCAACATTGTAGTATATCTTTATGGTAGAAGTTGATGTTGACGGTAGGAAACGATCTCGACA^ 

CAGGCCGCTTTTATACCCCTTTCGTGTGACGTCAATTGACGCTCGTACGTATAAACAGGGTCTATTTTTA 

ATGGTGCAATTTCCTATTACTCAAACTAGTC^TATCCGGCTAAGACAGGATGT^ 

AGCGGTCCACCCOTC^ACTTGACATCAGGCCGACCCAGCGGTCCACCCCCTAAACTGGAGTGAGCTGAAAAATTTTTCAA 

GAGGGTGAAAAGTGGGTACGTACTATACACTCAGAGGGTGGCAGATCGGCCCTGTT^ 

ACGTTTCTAACCATTTCTGGAACAGTCATTTCTGGAAAG^GTTA 
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TCTGACATTGGGTCGACCCAGCGGTCC^CCCTTCGAACTTrGACATC 

AAAGTTTTTGAAAC^AGGATGAGG<3TGAAAACACCTGTTAGAAAGTGGT^ 

ATCAGATTCCAGAAACGTCTATAACCATTTCTGGWVCAGTCATTTCTGGAAAGGGT^ 

GTATCTTCGTGCAATCTGACATTGGGCCGACCCAGCGGTCCACCCTTCGAACTTGACATCAGGCCGACCCAGCGGTCCACCCTCTA 
CAAAAAATTTTTC^AAAGTTTTTGAATCGAGGATGAGGGTGAAAACCTGCTAAC^^ 

AAATGGCTGTCCAGAAATCTGGGTCGGCCAGATTCCAGAAACGATTCTAACCArTTCTGGAACAGTCATTTCrrGGAA^ 
GTACAATAACTTACCATACATTTCTGGT^ 

GTCCACCCCCTAAACTGGAGTGAGCTGAAAAAATTTTTCAAAAGTTTTTGAATCGAGTAAGAGGGTGAAAACACCCTCTTGAAAGGCACGAT^ 

TGTCGTCCGTTCACTGCTGTTCCAGAAATGGCTGTCCAGAAATCTGGGTCGGCCAGATTCCAGAAACGTTTCTAACCATTTCTGGAACAC 

AAAGAGTTACAATTTCTT AG ATTTGGT ACAAT AACTT AC CAT ACATTTCTGGCAC CrTCGTGCAATC TG ACATTGGGTCGACCC AGCGGTCCACCCTTCG 

AACTTGACATCAGGCCGACCCAGCGCTCCACCCTCTAAACTGGAGTGAGCTCAAAAAATTTTTCAAAAG 

GTAGGAGAAGGTACGCACGCATACTGGCGGCACATGCCTCCAGAAATGGCTGTTCCAGAAATCTAGGTCGACCAGA^ 

CTGGAACAGCCATTTCTGGAAAGTGTTATAT^TTCTCGCATCTTGTACATT^ 

CCCAGAGGTCCACCCTCGGAACTTG&CATCAGGCCGACCCAGCGGTCCACCCTCTAAACTCGAGCGACCCTAAAAAATTTTTCAAA^ 
AGGAAGAGGGTGAAAACCCTTGCAACAGAGGGGGTATAAAGGGAGGTACCCT^ 

ATCACCCCATCTCrAAATAATCCATCATGTATATCTTCGTCGAAGGTTCCCCCCTCACAGGGAAGAGTT:CATGGATGTCCAA 

ATGTGGAATGTCTTTCCTCAATTTTCTTCGTATGAACACTTCTGACTACTA(^ 

GAAACCAGAGTGGTGGATGGAATGTTTGAACCTGTCCTAAAGACCTTTGTTCGACTCGTG^ 

ACTACAACGGCCAAGTCATGGAGATCTACATCGCAGAATGGTTGAGACAAAGGCCACTAGCCTTCCACGTGTTTAC^ 

TGGATTCTTGAACGAGGAGGATCTAGATATGGATACTGCAACCAAGTGGATGGCTGAAATTATTAGAGAGAAGAGGGGCAATATTCAAGA 

ACCCCTAGAGTAGTCTTCAATGGCAATGTTTGTAGTGCATGTTTCTCTAACACTAAGAGAAACTTGTATAACTTTGGA^ 

ATTGTGATTTGTTGTGCCCTTTTGCAAGGCATAGGATTGTACATTTCTTATAAAAAATAAAACAATATATAAAATTCAGTTGTA'r^ 

A\AGT/TACTACACCCAATTCTCCCCCCTCTCTAGTGAGAGATCCCAATCACTTTCTACTGTCACTGTTG^ 

TCTTCCTCTTCATCTTCATCTTCTTTAGTTCGTTCTTCATTATATGCTTTAATTATCCTCAATACATTTAAAA 

GATTCTTTCCCTCACCACCAACATCGACCCTACGTCCATCCTGGTATACTGTTTCACCATCA 

TACAACAATGTCCACTAATGAT^CTGGTTCTG^TGCTAACTGATCATCGCCAATAAA 

AATTTGATCATATAAAGrATTGAAGATGATAGGGTAAGATGTTGGCCCGTAGGCAGACATGATCAAACGG^ 
AATAATTTGAACACATGAATCTGATTCCTCTTTCTGTACCAGAATATAGGGGTGTGTCT^ 

TCTAGTTTTTCTTCTAGGTATTTCCTCCTTGCGGCGTC^GGT^ 
CGGCCGAATGTGTAACAACTGCATTCAAAAATAATGCCATGCA 

ACTCACT GG AACAACT AAGCATTGGTC ATG ACAACTACC C ACT ATAG GATTTTCT ATGCGG AGGC CTTTGT AGG AT AATCGATGTGCTAXATCGT GT AAA 
GGTACCATTGATTCGCCCCCTAAAATACTACCCATACTTAACAGTTTCTTTTCGACTTTGTTGTTGGTTAT 

CAATTAT GT ACTTGCAGTT GTG AATTGCTGCATTGAAT AGTCT ATCAGTGAG AATTCTAT ATAG AGGGGAGJU^GTGAAAGACTTGT CATGGGGT AAAGC 

TTCTCTTTTCACTATATTGTTGATATTCCTCAAATCTTCTGGTAGTCTTTTGAGCCA^ 

ATTTCTGAATCATGTTGAAGAAATAATGACGTTTWPCCAAGCAGGGGTTGCCTGCTGAGAGGC 

GAGT ATT TT T ATGC AAAGAGAC G AT AT AT ACACCTTC AGC GAG TTCGC TTCTCCGT AT AC TT TT G AT AG ACTCGAT AATGGACGGT AT TCTTCT AT AGTT 
TTTAACGATTCCCI^GATAGCAAACCCGGACGCCATCTTTCATGACTCATTT^ 
ACGCACGTAAATACCAGCCATTGACGTCATATCCGCCTAGTTTTTATCCTCAACTTCTGAGAAT^ 
GTCAATGGAAATGCCGATTGACGTCATGTTAGGGGTATAAAACGACTGGCTCGAGCCCTCGGATTCACACTT^ 

(^GTGAAGAAAATACTCCTC^CTTCCAACAAGAATGGGCTCTGCTAACTTGAACGACCGAACTCTACTCGTGGAGGAAGGGCAAGTA 
TCTACCAACAAGAAACAACC^GTGTGCTGTCACATTTGCTATCTAGGA 

TGCTGGTGACTGTCATGACTTTGTTGATTGTGGTACTGGGAACTGCACCACITAATTGTGATGTGA^ 

GGTGCAGTTTCATCC TTACCATTCTTCTTCCACAACC ACC ACT ACTTC CACT AC TACTAC T ACT ACACCAACTCCACCAGAT ACT AAAAAAGTTG AC GAC 

GACTATGATGACGACGTCAATATTGGAGGGCAATCAGTTACTGTGAATAATGX^GGTGTTTTCATCAATGGAA 

CAATGGGTATCAATACAGATAATGGAGGATTTGTTTTGGAJVGAATGGTGTTTT^ 

TTTATTGTTTCTTT AAJU^AAT AAAAAAAAGAGTAACTCCGTCT AC AT AGTTTTTTTT AAATT GG AAGAT AT AAAT AGAATGAGTAAAAGGTACTGTGCC A 
TTACATGTGGAGATCTTGTATTAGCAACATTAGAGAAATGGGGGACAATAAAGATTATGA 

ATTG AAAAT AAAAC T GAAT T ATGTG AAAAA ATC AATG AGC AG AT G AAAAAT ACACAACT AAAAT ATGAT AAATGT TTCGT AG AGGAGGAGACGG AAAAAT 
TCCGGAAGATGGAGGAAAGAGTTAAATACCTCAAAGAG^AGGGAATCCCT 

CAAAGAGTTAGATGCCCTTCTTGAGGAAAATGAACGTATAATAAAGCTCJVTTGATGAAGAGTTGGA 

TC^GTTGGTGTTATAAATATATCCCTGCAAGGAGATACGGTCTCAAAAAATCCATGACTTGTCAGTATC 

ATAGATGAGCAGACAGGAGAGATGCAAAAAAAATATGATAAA7TATTTGAAGATGATAAAAGGTTCCG 

AGAAGGGAAACCCTCTAGACCCAGAAGAAAGACTTGTATTGTCGGCTGATATTGATAGGAGTATGAAAGAG^ 

TATAGAATTATCCATTGATACATTATTGGATGAATGTGAAAACTTGCATTATCGTCT^ 

GTACGTATGCAATTAGGTCCATATTGGTTTGCAAATAAATAATCCACACAAA^ 

CTC^CATAGAGTGTATAAAAGGAATTGGATGTTCCTATGTCCGCATCACTTCTAACTGAGACATCGTTGGAAGAGGAACAGAGAAGAGAGGTTAACAACA 

ACACCAGTAAGGAAAATGATACAATGGAAAAAAAGACTGAGACGGCTGCAACAACAGAAAAAGACCCAGAACCGTCTGTCAGTAAAAG^ 

AGAACCCAAAACAACTTCTACTGTTTACACTTCTGTAAAGTGTTACCTTTCrrTCCATAATCA 

GAAAGGTTTGAG^GAGGTGTAAATCCGTAAGCAAGATGATGGTCAAAGGTTCACTGTTTTTGAGGTTA 

ATCTAGAAGACGAAATCGATAAATG^CCAGATATGACGAAGGATAACTTTTACGTCCAAT^GTTGAGGA 

ATCTACACATCCTGTT^TAGAAGATGTTTGGAATTCCCCCATCGTCCAAGAAACATTCCTATCCCAGCAAGGAGAAG 

CTCATGGATTTCAATACCATCACCTACGCCG^CAAACAACTAAAAACTTGCTTCGAAACAA 

TATCTGGATGGTTAGCTGAAAACGGGrTTCGATAAAAAGTATACGAAACTCGrTACAACACTG^AT 

TGGTGATTTGGAAAGGGTAAAAGAAGGAACGAAAAATTTCGTGAC TCTTC ATAGG AAACATT T ATGTGTT ATTA£TG AT AAG AAGAATGGTACAATTTCC 
TATTCACCTGAAGAGAAATATCCGATACCCTCAATACTAAATTATTACAAGTT^CTACAAACAGAGTATCCACAAAACAAGAA 

TTGTCCC AAAACAC AAACT AAAGAT AC ACT ATTGTAC GTTTG AC C AAACGACCATTC AAGGAATTTGT AAAGATTTGGG AGTGTGGAAGGAT ATGGAAG A 

ACGACACAAAO\ATCAGAAGATATACTTTACAAGCAAGGATGGTACCTATTATTCGACGTTAAAAAGATTAAGAAATTGCGTCC 

TCTATCCAGACGGACG^CGAAGGCGTCTCTGTACTATTTTCCAGAGAAGTGGAAGAAGTAGAGACTGTTTCCAAGAAAAGTAAGA^ 

GAGGAGATG AGG AT AGG AG AAATT ACC CGCCCACTAATGC CAAGT ACGT AGTGGGTGT AGATCC GGGAAGAACT AATGTCGT TTC CTGTTCGGT ATTTG A 
TACC CGTCAAAAAAGGGT AGTGAG AAAAC ACAGAATG AC TGCGAAAC AATACT AT C AAGAATCT TGGATG ACAGATAGAAGAAAGGC AAACGAAACGT AC 
AAGAAGAACAATAAAGAGTACAAAGAGGCGTTAGAGGAAATAACTAGGTACGATAATGGCGAAGAAATTATAAATO 
CCACTAAAAAATTCGAAGCTTACTTGAAGGTAGTGAACGAGCACTACAGGTTACTGTGGAACGAAAAGGG 
AGTATACTCTAGAAAACAAAAGTGCATATCTAACTTTATAGATGAATTAATCCCTAA 

AAATTTGCCTGC AC GGG AAG AGGT GAGCAAT ACGCATC ACCTGCC AGGATTTTC GCC AAG AAGAT AAAGGAAAGAGT CGGAGGTGAT AAG AGGTTT ACTT 
TCGTGGACGAGAAATATACGTCAAAAGTATGCCATCGCTGCAATCAACCTTTAAATATCCTGGAGAAGGAT^ 
GCCGACAATAGTAACAACCACAACAACAACAACAACAGAAGAAGACGAAGAAAATGGAAAATGGAAGAAGGCTA^ 
AGAAGATGCTCGTCCGAAAAGACGCAATTCGGTTACAGTTCAAACCGAAAAGTA 
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CCTCTTCCTTTTGTACTCCTACTTCCATTACATGTGTC^ 

AGGGTTTTGGGATAAAAAGTTAATGGAAi^GAAAGACAAGATGCCGTTGAAGTATCACTTTATTCGAGTTGCCTGAGAAT 

CACTAGTGAAATAGTGGCGTAGCCCTGAAGGATGAGAAGTATGCTATTTTCGAC^ 

TCAACAACATAAAAATGGTACAATATTCTAGGTGTGTTTTACTATTCTT^^ 

TCAACAAATGGGGATATTTCTGCCTTCACCTTTTGCACAACAGACTCT^^ 

TCATATCGTTGTAGAATCTCGAACCCTCGACACGTGATTTTTCACGGGCAAGGAAAGCA^ 

TATCTGCAAATCACTGAACGTGCGGTCTGTAATGGTCATTTCGTTGTGCCTGAGAAGT^ 

AAAAGATATACCATTTCTTTATAAATTAGGCGAGAGGCTATTTCTTTCCTCGCTTCCGTTGCGGCTrTTT 

CTTCGGCAAGTCTTTCTCTCTCCATTCTGGCGACTTCGATTGATGAGGCGAGGATCT^ 

ATCGACTCTG<XK»GCTTACTCGGCGGGTGTGTTTCTACTGATTCTTCT 

GGAGAAACTTCTTTTGCTTTTGCTGCTGCTGCAGCTTTTGCTTCCATAGACCTGGTGGA 

TTAGTGTTCAAAGAGACTGAACAAGTGTGTGCTCAGGTGGGATCCTGCA 

CCAACTAGTACCTTTGTCACCTACACCAGAAATCCACACATGCAGAAGGTTTGATGCTAGT^ 

GGTGGTATAAATATAGCCCACAGGAGGAGAGCCTGTGTCATTCTGTGTGTATGTCTTGTACTGAACATTACTGAAATGG^ 
. GTGGTATTGAAATTGTACATAAAAATGGTGCACCAAAAAGGTCT^ 
GATCG AAGAGTT ACATCACAAAAC AAATAAGCAATTTG AACAGGCT C AAAAGGTCCTCGAC AAAAACGAAGAjGCGAAAGAAGCAT CAGCAACAGCAACAA 
ATAATAATTCCTCTAGACCCAGAAGAAAGACGTGCAATATTGGCTGAAATCGATAAACACA 
TAGGTTTACTTT<^AGATGCAGAAATC^TAACTTGGAAGAAAAGGAGGTTGAAGA 
AAAAATACATTATArGATTTTTTGGATTAATAAAAACAAGTGGAATAATTTTT^ 

ATCT ATTGTAACTTCAATAGAAGGGGTTATATCTTCCTT C ACTTT CTGT ACTAC AGCCT GT AC AAAATT ATCACACTTGTGGAGCAACTGGTTAACTGT A 
AGAGATCCAATCTCCATGTCGCCAAACTCTTTTGGTGGGGGTGTATTACCCCACCTAGAA^ 

CTCC AAAACTAAxAATTGAC AGT ATTTTGT ACCTC AAATT C ATGCT C AT AGTGTG AAACT GCC ATCTCATGTTCCCTT AGGGGGTTTTTCAGTTTGATGT T 
TTTGGTAGTATTGTCCAGTTGCACCAAAAGATTTTCCATTTTCTCTGCAGTGTCGT^ 

TATTCCTCTTCTTCACCATCGTCTTCTTCTTCTTTACATGGGAGGGAGAGGTGATGTGTTTCTTTGTGTTCT 

GCTTGTTACTTGACGAGTCCTCTACCAGTTCTTCTTGTTCCATTTTTACTTCCACrTTCACCTTTACTTCTTCTG 

AAGAGTAGAGGTGGCTGCrTACTGCAGATGAAGATGTTGTTGATGTTCTAGAAGAGACCATTTTGGGGGTTATTTTTGGAACT 

TAGTAGTATTAGACTCTTCATTGTTTGACTTCTTTGTTACTAGTTTGCGAAATACGGGCA^ 

CGCAGATAATGTTAGATTTGATGAAAATCCTGGATCTATAAAAAAGAAATCAAAACACTTAAACG 

AATAATAGTAGATTACCACAAAAATTTGTTAATGCTACAAATAGATTAATATACATAA 

TAAACCCACGGAAATTATTATTAGATGCAAAAGGGACCCAGGAAA 

CGATGGTTTAATGAATACTTTCGGTTTACTAGAATTTGTGACACAAAATTTCATTAACCAGTTAGAAATGT 

TATCATGCAGAATATCTTATGAATACTTTAAGGAATCTCAGAGAAATTGAA^ 

GAGGTTTCAGTTAArAAAGGACTGTTTTGTTTTTTTTTATTTTTTGTCCCAAACCCATTT 

GCCTTGTTCAAAAGGACTACATATTCATTCCCTTCATTTTCTTCTCTTTTCCTTTTCTC 

AAGTTTCTCCTCCACTTCCTCCATCTCCACGTTCTTCATTATACATGGAAATT^ 

CCGTTCGTCTTCGG^CAAGTTTTCTTCCCTTTTGAGTAACTCTTTGTATTCCT 

TTTCCATTATCCATGAAAATTTTGATGTCTGTTCCTGTTACCTCTCTGACCATT^ 

TGCCACTGGTGCGGGAGAAAAGTCTCGTCTTGAAGTGGTTTCGTACACTGTC 

GGGCCTACGTTCTCCCACATCTCTCTTTTTATATTCAGGTGAAGAGACATACCTGATTCCTTCTTCTGTTGCAG 
CTGCATAGCAAG^GAAATCTCGGTCAGAAAAAGCAAACACTAAGCAGGCTCCACCATCATAGCGGGATG 

TC AAAG AAATCCTTTGTTC TTCAGAGACCGCAC AACGTTTCCTC AATTCCTTCAGTTCTTCTTC C TTCATATTT ACTTCATT CTCTGCAAACTTTATTTC 
TCTTCTGAAATTATCCATCTGTAAATTATCCCTTTCATTT 

GGGTGTGTACTGTTGTAGAAAATCAAACCTAGTTTATCCATTGGAACAATTAGGCACATGTTGGAGTA^ 

CTTTGT ATGTTAACC TATC ACAT AT ATCATAC ACTGATATGCGGG AAGAAAAGC C AC ATTGTTCC AC CAATAATTGTTCCAT GTT ACGAAGTGTTCT AGG 

GCAATGGCTCGGCGTGTAGCCACACCCTACTAAAAAGTCAGTGGTTATTA^ 

AGCCTGTAGAGAGGAGAAAATTTAAAGTTTTTGTCACTAAGAAGGGCCTTTTCT 

TCCACTTTTCCCCATAATCCTCACACCCATCATCTTGCAATAAACTCCCTAACCTGGGGAGA 

GCCTGTCAATTGTCTCAGTCTATGCACTCCGTTTTTTATGTGAAGTACTTCCTTATCTTTTAGACGGACT 

GTAACCCTATCAACAGATTTGAATGATAATAGAATATGTTCGTATTTGTCAGGGTTTTCACCAGAAATA 

ATTCTTCCATTTTGATATACTTAAATGTGTCATACrrCGGGTGTCACAGATAA 

CGCAAATTGCCACGTTGCTTGAATTACTTAATCTGTAGATGAAATCGGCAAAAAATTACGTCTTTA 

GTACACAAAATGTGGCACTTTTCAAGCQVCX^CCTCTACATCCCAGTGAGGCGGTGGAGGTGGACGTCATATTTAACTGGAATATGT 
CCAATTCTTAAAAAATCAATAGACCAATAAAACAACAAAAAACATTTTGTAATTTATTTATTCAA 
CTTACATTTCATATCACAGTGTACAATACTATTCGTGCATTTACCAACACGGTACACGTA^ 
TTGTACACATTTCTGGGGATAACTTCAATCTTTACAAACAACGCTCTCTC 

TAATTGGTTCCATTTTCTTGAC^TCTTCTTCXCTGTAAACCATCACATGGAAAACTGGCTTGTATT^ 
TCCATACATCTCCAAATACTCCCTCAATCCAATTTTTGATTCTGCAAACTXn^ 

. AACATTCCATCCACGAGCCTGC^CGTTCCTGGTAATCGGATGCACGCAAATGCTCTTGAArGGTCC^^ 
TGTACATGAAGTTGAGGAAAGACPGTTTGCCCTCTCCGGCAGTTCTCATGTTATC^ 
AATCATGTTGAAGGGTAAGACTTGTCTAACTCTCTCGAGCGAGCTCTCTCTCTTTTATAGA 

GC ATT AATAATATATAATTTTT AT ATTTTGT ATC ATTC AACCGT AGTCC ATCTAAG AGTC AAC ATGTCC TCCTCCTCGTCGTCGTCGTTCTCGTTCCGCA 
TCTCCACCTACCAGACCTTTCTCAAGGCTCTGGCTCACCCAGATCTGGTGGACAAGAT^ 
CATCCAGTTTCTGGCCGACATCTCGCACCTGATCCAAGGAGAAAGAAATGGAGXSAAATCTGTTC 
CCAAGAATAGTGGGAAGTCrrCACGGGAGAACATTGGACAATGACATTGAAGAATCATACTGTTATTTTGTCAAGG^ 

AACTGATGTGGATTTGCTGTACATTTTCGGTACTrTGGTAGTTCTTCCCCCCAGATCTAAAGCGTACCGAGTCA 
TTCAATGAATTCAGTAACAACTGGCCTCCTACAAATATCAAAGGAGCATAC 

tagaaggagaagttgggggagaaagtgaatgggaatccatacacgcatctgtcgtcaagcgaatg^ 

tccatcaacatctagaatttttagagtgtatgtggx;tgaaccagttaatgatgcagtgacaaagatccctata^ 

agactcgccggtattctccagaaagtgtactcrrtattcaatgctaaatcttccatatctcctgtcttcaaattctatagacatc 

GAATTACATTATCTATACCATCTGCAAGAAAATTGGGATTCTATTTACTCCAAAAGGATACAACATTACAATCTTCCCTAT 

C^TAGTTTCAATCAACGCTGGTATTATTGGTGATGATTTCTCTGAAAAAATACGACAGTGCATTGAGGAGAAAAA 

TTTTGTGAAATTGACAAGACGCCCGATTTTTCTTATAGTGAACATGTGGC^ 

AGTGTTGTGGAGCAAAGATTTGTTCCGAATGTATATTCCCGTACATCATTTCCCTGTAT^ 

GTTCCAGTGXXCTGGTTGTAAAAGTGGCATGCTCAATCTAAAGGGAAGATGTTACGA 

ACTCATTGTTCTTCTCTCTTTGATGCTACCATAAATCGCGCAGAGGCTTGTTTTTAC^ 

TTGCGCATGGAGCTAAAGACATTCCCCATGTTTACAATAAGGTAGTAAAGAATGTAAAAGATTTGGATAGACTOTGTGCCT 
ATCTCCTGTCGTATGTGATGAGCCGPATGAAAGTACGGACTATGAAATGGTG<^TGTAACTCCCCCTCT^ 
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GAGTATGATGATGGTCCCGGAAATCATATGTGGeCAGCAAAATTTACCTGTAACTT^^ 
GAGATGCTGTAACTTTTCTAGGAAGAGCACCGAGGAAGAAJUITGGC^GGATGGGATGATCAATCGG^ 
GAGAAAGAGTGGC^AATTGCCCAAAAATATGTTTGATTTACTAGAAGGGGTAAATGCCGTACTTTATAGAGGCGAC^ 
TACCCCTGTGTTACTGGTAGATCCATGAGCCCTAGTCTGGAACTCGTTAAAAGAAAG 

GGGTGCGTCCAGACGCATCTAAAAAGTTACTTGAATGGGCAGAACTATTAGTCAAAAGTTATCTCATGGAAGTTTTA 
ACACCGCGCCCATTCATTTGTAGGCAAAACTCTCCTCATTACTGACGAATTGCTTCACATGCGTCCAGATGA 
CTAAATGCGGCCAGACAGAATGCGGCTGCTGCAGCCTCArTTTCTGGGTCGCTCCCCAAACCTGAA 
TGTATGAAAAGGACAATGATGATGTTAGAGTTGTAAATTGTCCTTCATGTAAAAAGGCTATCC^ 
TGGAACAAACATGTGCTGGATATGTGAAGAGAAGGTTTCTCCTGCTGATTCT^ 
GTTAAATATGCCTTAGAAAGTATGTACGGGTTTGAGATTTGTACCATGAAAA^ 
• TC TTT GATGT AC AAG AAAT GGTTGCT AAGAAAT AAT AAAACACTTGT GAAATGTT AT AAC TTGTTTT ATTCT C AACAAACTTT AATTTC T 

TTGCATGCCTTGTTCAAAAGGGCAACATAATCGCCTTCTTTCTCCTCCCT^ 

GTTCCTCGTTGT ACGTCTC AAT TGCCTT AACTGTGTTGAAAATC ATGCGACGCT C TCGGCGCAGACG^TTCCTTTCCTCCTCTGGTAATtj'r'r'l' C ' r' rCCTT 

TTTTATTATGTCCTTGCCTTCTTTAGAAGAGGGGTCAATCAOT 

GATCCAGTAACCTCTCTGACCATCTCCATCAACAACCTraCATTC 

TAAAATGGTTTCGTATACTATCTCTGATGCATATAGGTGCATCTGAG^ 

GTAGTCGTCCGAAGAGACGTATCTGATTCCTTCTTCTGTGGCAGAGTAAAAGGAACCATTC 

AAGCCAAAGACCAGACAGGCACCGCCGTCATATCTGGATGATGCAATTGAGGACT^ 

CATATACCTTTTTGGATTCTTTAAGTTTTTCTTCCA^ 

ATTATCTAATTTTTGATTACTCTTTTCTAGCGTGTATACAACTGCAGCATTGAAAAGGCATGACATGTA^ 
CTGTTGT AG AAAAGT AATCC AAGT TT ATCCATTGGTACAATT AG<5CACATATTTGAAAAACTGCC AACT 
TTAACCTATCACAGATATCATACACTGATACACGAGAAGAGGTACCACCGTGTTGCACTAACACTTGT^ 

AGGAGAAAACCCACAC C CTACT AAGAAGTCTGTTGTG ATT ATCAT AT AATCACATTTCTT AACGGC C GC ATTTGAAAGACGTTC ATGT ATGATTCTGTAT 
AGGGGTGAAAATTTAAAGTTCTTGTCATTAACTAGAGCCTTTTCTTC 

TTCC AT AAT T AGGGTCTTCATCTTCTAAGAAACT ACC TTGTTTTGGGAGACTGGATGTGTAT TTT AGTCC AATCTGAAGC GT AGGAT CTCCTGTT AATTG 
TCTCAATCTTTCGTTACCGTTCTTGATGTGGAGTACTTCCTTGTCTCTAAGACGAACTATAAAAT^ 
ACTGATTTGAATGTTAATAGTACCCGTTCATAATTI"rCGGTATTTTCCCCAGACAAACAAATACC 
TTATACCCTGTAACCCTAACAATGGTGTGCCAGTATAATAAATCTAC 

TTTCCTATTCATTATTTCCATTACAGGAAAAGATGATAAAATGAAGACGTGCGATGAGAT^ 
ACATATTTTTTAGCGTCAACACTCATCTTCJm3CAAGACTTTTTG 

GACATTAAATTGGTTTGCTCACCCCCCCAACATCGTTTTCATCAACTCTTTCCTCACCTCTATATAA 

ACTTGTACCCTCCGTCGAGATGACATCTCCAGCTCCATCACCCTCTTCCACCCXrCAAATCCAGTTGTACCA 

GACAACAACAAGGAAGTGGTCATCTACGACACCAATTCCAAATTCAAGTGTGAACCCAA 

ATGTTGTTACCCAGArATCCCCCGACCAGATATTTGTGGGAACATATATGGTCAAATATAACTGGTCTAAATCTG^ 

T AAC AACTGTCTGGAC AAT ATT AC ACGCCCTTCAGAAGT GATTGAAAGTGTGATAAAGAAAACGTC C AGC GACTTTAAAATG AAGTACAC ACGTTCCTTG 

ATGGACCACACCGAGAAATACTATTTTTCTGGTGACCAAAAATTGAGCAAAATTAGTAGTTGGTGTACAACC 

TCTAGAAAACTTTGTCAAGGAAGAACATGAAACTGTCGTGGTGCACAACCCTTCTGGAATG^ 

GAAGTGCACAATGAGATGGACGCCCTAATTTTTATGGCGGCYTTCTTGAAGCA^ 

TTGATTATGCGGOTGCTTTTCTGGACGAAAGATGGTGCCACCACGAGAAGAGTTTTTCT 

GAGAAAAATCATGCAAGCCCTGGACAATAACTACAACAACAAGAATAAGAAGAGGAAGAATGTTGGTGGAGCACCTG 

GGAGAGGG AGGAAAGG AAGCCC TAGAAGCTAGTTTCGATGTGATTGGGGGAACAAGAGGAGGAAGATTTGGT GTTGATTC AACACCATGCCCCCATTCT T 

CAGCCATGCAACTAAAACTGGACAATGAAGGAAACTATGGATGTATTGCCTGCTTCGCATCAATGTTCTT 

CTTCATATCAACGGATGCCTCTAAAATTGGACAAGCGCAAGCATGGATAGATGAACGACTACGAAACA^ 

AAAAAGACCTTCCATATGCTGGCTGATATTACCCAAAAGGCTCATGAAACTGCCTATTCCAATACCATCCCACTT^ 

GGCCTACTCACACTGTGGAACCTATTGCCCATGAATTTGTTACCCATTCTCTAGTAAAC^ 

TTTT GAT AT C TTGTACAACTTGCTTAATCCATTTCGAAAAATGTTGCTA G TGTTT AT TC AAAATTGTC AC ATTTT AACTGGACAT AAAAACAATGAAAAT 
GTGGTGCCTCGAG<nTCTGCTTCTGGGAAGTGGTGGACTACTAATTTTGTGGGTGTGAACAT 

ATAGAAAAATATCCGATTTGGCCTGTATGGAAACTCTCCCTCGTCTACCTAATCCAGGAAGCACTACCGTCGATGACAGA^ 
TAGAGGGGAAAATCTAGGGAGTGTAGGTGAAGTCtSTATCCGACATTACAC^ 

GAT AAAG AAA CTGGTT T CAT C TC TT CAGAATCGAT AGT GTCT GAT CCCTT CTTTTC A CTAGAAGT GAC T GGC TGTAGATC T AATCGT GC C CAAGAT AC T A 
TTAATAATGGCCGAGTTAGTGCTCGTCFAATGAGGATCCTAAAGTCACGT^ 

AAACG TT AAC CACG AT ACGGCC AT C TC T AC GGACGCT AT GG AGC G AGCT AT AGGGCAGC AC AAG A T ACT GT ACTA TGAT ATT GAAACAACAGATAAAGAT 
TTCACCGACAAAAAATCAGTCATCACATCTATTGGCTTCTGTTTGTGTACGGGAGGCG^ 

CACCTGGATCCGACGTGGAAAAGGTGAAAGAGACTATAATAAATTCGTACGATCCTGAAGAAAAGGAAGACATC^ 

AATTTTC AC C AACGAGT TTG AAATGT^GCTTGGTTTTGGAAAGT ATATAGAT AAAGTGAAGC CT C ACGTGATT AGTGGGTGG AACAATGTAGCTTTTGAC 
GACCCCTTTGTCTTTACTCGTA7CGTCAAACATTTGAGTGATCACACCAAAGACATGTCT 

CTAGAGCAACAGAAGGAGGAGGAGGAGGAGAAACTCCATATAGATTGAGCACCCCTCAAGAAAGAATACAACTA 

GGGAAAATTTGTAGACAAGAAAACTGGCATGTTGAAACCTGAAATGACTGCAGATTTAT^ 

CGCAACAAGTTATCCTCGAGTAATAAAGGATCAGGAGGATGGTTCCAGAAAAT^ATT^ 

GCGAAAAGGCCTATAAAGAATCCCTCTCTGAATTTAATTTGAACGCCGTGCTCGCCAAAGTGAGTAGTGTCG^CGAC 

AGTAGACCTACACTTTCArCTATTGGGATTCTTGAAGCTGAAGAAGGCCCAGGATCAGGCAAAAGTA 

GGTATAGTTTCTACCTCCATCAACAAGGAAGGGGAGATTTTTAGGCTGTGTATGGACTCTGCTTTAA 

CTCTATGTATAGGAGAAGGAGCAATCTGTAGAAATATGGGAGAAGAAAGGGCAGATAGAAGAGGTGTGGGA 

AAAGGGAGGTATGGTGAGTCAACCTATCGTCAATCATGTTCCCTATCAAACGATTGACATGACAAGTTT 

TGCACCIACTACCTTTGTGACCCATCGACAAATTATGCAACTGAGGGATAGATTX5GTACT 

TGGACGTTATTGACGAGTGCAATCAGATTGTGTTGTCCGAGTACAGACCCATTCATATTGCACT 

TCCAATTACTCGCATAGAGGAAAGTTTGGGTCTAAGATCCATAGAAAATTTGGAT 

AATATGAATGTCACTX5CCGCAGGTATGGATTACTTCCa:GAGATTGTGTGTGACATTAATAT^ 

CCCCAGCAAGTTTAGAGTATATGCTTC^AGTATTGCCCATAATGTTAATCGACAGACCGTACATTGG 

GGAGGATATTCTCTCAGAACTCGAAAAGGACTTTTCTGTTGAAAAAGA 

TTCTGTCATAGTCCTGTGACCCAAATGGCTCGTCACATTATTGAATCTACG^ 

TTAGCTTATCAGACAGAATTTATCGCCGCGTTGGCGCATTTGATTC^ 

GTTGGTTAGGACATGGAACGTAAAAACTGACATTCTTAAGG^AATCATCCCTCAA^ 

GCCAAGGAGTTTGCCAAGATGGGAGACATGAAAAGAGCTGGTCTAAAC^AAGTTGGACAAA^ 

TGGCCCTAAGAGC^CGTTCGAGCCGTAAAGAGTTTGCGTCTGGATCTGCCAATACTGCCTCAAGTATTTCCAACATGTCAGCCACCGGAGGA 
AGGCACAAGGCACTCGGTGACGGCCAATCAGATTACAGAAAATGCTCGATGTGTATTTGGCAATATTGGTTGTGGATT 
AAGCAGACGTACGGGGATACAGATTCTGTATTCTGCGTGCATAATATTGTAGGTGATGGAGGAATGATACCAG 
ATTATGTGATGGATATTGCTCTAAAAAATAAAATCGCTGCAATTATTCCCAT 
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CGCTGGTGTGGGCATGATGAATATCGCCCATGAACGTCTAGCTGTCGXTTGX^ 

AGTGCAGCGTTCAATGACATGATAAAATTGAAATCAACCGATAACAATAATAAGTTTGCATCCTTTATCAAGAGACCT^ 
TTCCCC^TAATCCTTCATTGATTCTTAGAGCGGCCGAAGGACCCGCT^ 

TATGGAGGAATGGTTTACCTCTTCACCTACGTGGATGGCCATGGATG<riTCTGTTATCAACAACT^ 
AACTGGATTGACGCCATGACTTCCCGCCCCATAGAAGCGGGTACAGAAATGATGGAGGCGGT 

G AGC C TTTGTGAAG AAGGG AAT T A C AC C C ACC AC C AAAC T AAAGG GTCTCCAATC AT T GATT GC AAGATTTTT AC C AAAAAT AGA GG AAAAGAAAAG C T G 
TTATTTGGATGTGATGAAGAATCATGTGGAGAATTTTGCATCCCATATAACAAAT^ 
TCAAAAGAACAGAGTAGACCTAATCCTCTAGCTCTAGCGATAAATAACGACCT^ 
CATCACTTTCTTCTTGGAGTCTTTCGGCAGAGGAAGGGGAAGTCCCTGC 

AAGTGTTCCTG^ AT TTTC AGTCAAGAATTTATCTGTTGT GCCCAACGCCATC^ C 
ATGATTGCTAAAAATGTAGAAGTGTTGTGTTCTACATCTGCCAATACTGGATT^ 
CAAAGGACGTGGCrATGGCTTGTATACGATCTCTAAATAATAAACAAATGTTATTGTTTGTTGGAGS 
CGACGAAGAAGCAGAAGAAGAGGACGAAGAAAATGGTGAAAACGAAGAGAACAAAGGTGACTGTGTCACGG 

AAGGATGTTGGTGAAGAAAC T AAAAC AAGCGAGAAAACG GAGGXIAGAAAG AAAGG GCT CTAAGACGGCAAAGGG AAA GAC GGAGGAA G T 

TGAGTAAATGTGGGAAGAAAGATGCGAGAGATGTCATTCTTGATCGTTTACTAAAAGCAACA 

CTTAC AACAATATAGC AATTGT ACATT ATCTTCC T ATAT AACTTC AGT^^TGAAATTGG ACC AAAGAGTAGC^^ ACCAAATGGAAAATTTAATAT CTCAA 
TTGGATCAAATACGTAATCTCTCCAACAAGAAGAGGCAAGAAAAGGGAGG^CCTTTTA^ 

TTTTCCCAGTTTTAGATGCGTCTAGAAAATTGACTCAAGACCATTGGAAAAAGTGCCCCGTGTCCATCCCAGAAACGCG 

TGTGCCTTTTGAAGTTGCACTCAATTCTCTAATAGGAAAACACAAGTGCAC^GATACATGC^ 

TACACGCTAGCTTTAAAATTTGAGAACGAAAGATTGGCCCGGCAAATT^ 

AACTATTGGCCCAGGAAGTGTTAAAAAGGGTrAAAAGATGCTCAAGATAGAAAGTT 

AATTATATTTTTGTTTGAGTCTTTAAGGTTTGCrrCAGAAACCTGTAGCTGGTATGAGTGTTAG 

ACCACTACAGGTACTGTGTGGAATTATACTGATGAAAGATTTTTTGGACCATTGTATAACATGGATGAACTTTGTAACGAACGT 

AATTGTCCTTOATAACTGGTATTTATCATACGGCAGCAGTAGAATTGGCTGCTGCATGTCTATCTTGTGTTTTGTAAGA 

TAATTTCATTCTGX5TTTTTATTTTATATCCTCAAGAGTTGTAGTAGTAGCAGCAGCA 

TTCTATTGATTGATATCTGGTCATTTTCTACAGCCCATGGTCGTTTAAGGAATATTC 

TATTAATAGTAGCGAATTAATTAAAATAAGCATTTTGTCCACATTCTGGCTAAAATAAA 

TTCGGGAAGTTTTGX^GCATGACACACTATTACTCaUULACTATGCTCACrrCGTAT 

AAAGCCTGTTTTTTCTTATAa^rrCTGTATTGArTATAG^ 

ATGAGTTGTCTTGTGCACCAGACGGGAGCCrrGTCATGAAACTCTACTTCTCTCCCACGTGTATTGAGCG^ 
GAGAGGTTGTAAAATAGAGTTATATTTAGGTTTAATTAGGAAACATTTGTGGAAACT^ 

ATGGCTTGCATTTTCAAATCATGTTCAAATCTTACTACATAACCCAAGAAATTATAATCTGCTTCTTCACC^ 

TTGGCTCTTTTAGTACAATTTCTTTTGGTTGGTTAGTCATGTTGTATATGTTCATTGAGXZTATGAT^ 

GATAATGCATGCACCGACAGAGGTGTAGAATAATCCTCCATATCTAATGAC^CC^ 

ACATGACrTCATGGGTCCGATAGAGATATTGGAAGCCGGTTCAGTAGCGGAAAAAATAGACACTTCT 

AAATACCGAGAGGGGAATTATCTTCAACACTATTAAAGTTTGTTGCATAAGAT^ 

TTCAGAACAJU^ATCTCCTATAAAATTGTGATTAGGGCAGACCAACATCCCCTCTC 

ATGGCAGCCGTTCGTGAAGGATTTCTCACGTrcAACTCGTCTCACCTTCACTGGTACTTTTC 

CGTGGGGATCACTTCCATCCTTTAAAGAGAAGAGAGGCACTCTGTAAACaiG^ 

AGTGG^TACCGCTCTGCGGTGCATTCGGTACACCTCCAAATCCGTTTCCTTTACTTGAATCGATTTTGCTCCTTC^ 
TACCTGTCG^TCGAACGAATCTTCACCTCCCATTTTATTTATTT^ 

ATGGCGT ACATTGACC AAGGGGCGT TTGG AGCCAAATCTGTGATGCAGCAAC AAT ACCTTCT TCC ACCC TCTAAT AGGCTCT CC AAACAACAACCTCCAC 
TGGCATCATCTTCTCTTCAACCTTCCTCTTCTAATAAACCTAGGAGTACATCAAGAGTAACAGAC^ 

TGCTTTT ATAAGC AACTCATTT AGT GTT ACAAAAAATGT GGTAAAACTTTCTAAAGAAC AAAC CGAGAAAAT ACT AGAAAAAGATTTGCCTGATAJ^GTG 
TACAAAT TGTTTG AAAATTT AAAG G ATGG AAC ATT TGGT ATAGGGGAAGATGAGGAG^AA 
CAAAAAACATAAGACTTAAAAGAAAGTGGAAAGAAATGACTGAACAAGGCGATCAAGGAA 
AGAAACTOrrCCAG^AGGAGCAGTTGGCCCrrcCAGGX:CCTCAJ^ 

C CTC AAG GAGAAAG AGG A GC AA TT G G ACC GGCAGG AAAGG ATGGAGCAGTTGGCC CT CAAGGCC C TC C AGGAGAAAG AGG AGAAAAT GGACGCCC AGG AA 
GAGATGGAGCAGTTGGC CCTCAAGGAGAAAGAGGAGCAATTGGACC GGCAGGAAAGGATGGAGC AGTTGGCCCTC AAGGAGAAAGAGGAGCAATTGGAC C 
GGCAGGAAAG^TGGAGCAGTTGGCCCTGC^GGCCCTCAAGGAGAAAGAGGAGAAAAT^ 

CCAGGAGAAAGAG^AGCAATTGGACCGGCAGGAAGAGATGGAGCAGTTGGCCXTGCAGGCCCTCCAGGAGAAAGA 

ATGGCGTGGACGGTTCTGTGGGCCCTCAAGGAGAAAGAGGAGAAATTGGACGCCCAGGAAGAGATGGAGC^ 

AGGAGCAACAGGACGCGCAGGAAAGGATGGTG<:AGTTGGTCCTG<1AGGCCCTCAA 

GGAAT ACAAGGCCC AAGAGGAG AG ACTGGACC AC CGX3G AAGGGACGGX^CTGCAGCAG AAAGAGGAG CC C AGGAC C ACCAGGCGAAACTG 

GACCACCAGGAAAGGATGGTCTGGATGGTTCTGAGGGCCCTCAAG€G3tftf^ 

CCTCCCAGGAAGAGATGGAGCAATTGGCCCTGCAGGCCCTCCAGGAGAAAGAGX5AGC 

CCCCAAGGAAGAAGAGGAGCAACAGGCCGCGCAGGAAAGGATG*a^CAGTTG^CCTGCAGG 

GGGATGGTGTGGACGGTTCTGTGGGCCCTCGAGGAGAAAGAGGAGAAACTGGACCAGCAGGAAGGGACG 

AAGAGGAGAAAATGGACGCCCAG^AAGAGATGGGGCAACTGGCCCTATAGGTCCTGCTGGTCCTCAAGGA^ 

GATGGAGCAACTGGCCCTATAGGCCCTAGAGGAGAAACTGGTGCAATGGGAAAGAATGGCGTGGACGGTTCTATG^ 

CAGGCCGCGCAGGAAAGGATGGGGCAGTTGGCCCTGCTGGCCCTCCAGGAGAAAGAGGAGAAACTGGACCAGCA^G 

TG^CCCTCAAGKSAGAAACAGGATTAACTGGCAGCCCAGGAAGAGATGGAGCMCTG 

GGACGCCCAGGAAGAGATGCAGCAACTGGCCCTATAGGTCCTGCTGGCCCTCAAGGAGAA 

GCCCT ATAGGTCCTGCTGGCCCTC AAGG AG AAAC AGGATT AACTGGACGCCCAGGAAGAGAT GGAGCAACTGGCCCTAT AGGTC CTAG AGGAGAAAC TGG 

TGCAATGGGAAAGAATGCTGTGGACGGTTCTACGGGTCCTCAAGGAAGAAGAGGAGCA^ 

CCTCCAGGAGJWVAGAGKSAGAAAATGGACGCCCAGGAAGAGATGXjAGCAACTGGCCCTATAGGTCCTCC^ 

TGCCAGGAAGAGATGGAGCAATTGGTCCTCAAGGAGAAAAGGGAGAAAATGGACGCCCAGGAAAGGATGGG^^ACTGGCCCTATG^ 

AAGGGGAGAGACTGGTCCTATAGGTCCTGCTGC<;CCTCAAGGAGCAACTGCTCTTCCAGG 

AGAGGATTAATAGGGCGCACAGGAAGGGATGGGGCAATTGGCCCTGTAG^^OTGCAGX^ 

ATGGAAAGGACGGTTCCGTGGGTCCTCAAGGAGCAATTGGACCTATAGGCCCACGAGGAGAAAGAGGAGAAACTC 

TGGTTCCACAGGCCCTATGGGCCCCCAAGGACTAAGAGGAGCTACGGGAGCTCCAGGACCGCAAGGAGAAAGAGGAT^ 

GGTGAAACAGGTCCTCCAGGGCGACAAGGAAGGGATGGAATAATGGGTCCTAGGGGTCTT^ 

GACCTGAAGGAAGAGATGGTGCACCTGGTCCCGCTGGCCCTATTGGACCTCAAGGAArAAGAGGATTA 

AGAAATGGGACCAGCCGGCAAGGACGGAATAGAAGGCCCTAGAGGTCAAGATGGJVACAACTGGC 

AGAACAGGAGAAACTGGTGCACAAGGATCTAGAGGAGAAAAAGG^GATAGAGGGCTAACAGGCCCTCAA 

CTCAAGGTCTTAGAGGAGAAAGGGGAGCACCTGGCCCTAGAGGTCCTAGAGGTATTCGTGGCCGT^ 

TCGAGGTCCCCGAGGAACAAAAGGAAGAACAGGAATACAAGGCCTCACTGGCATAGAAGGTCCTCGAGGTCCTAGAGCT 

ATGGGGAAAATTGGACATCGAGGAGAAAAGGGTGATAAAGGAGACCGTGGAGAACAAGGCATCGCTGG 
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GAGGAATTCGAGGCCCTATTGGTGCTCCTGGTAAGCCTGGC^ 

AGGGGiUVTTAGGTCCCCAAGGACCAACAGGrCCTCAAGGGCCAGCAGGTCCTCIUVGGGCCGATGG 
G6AGCAGT6GGACCAAGAG6AGAGAAAGGAGGTA6AGGAA6AAAGGGAAAAAATGGCCCTAAAGG 

AAT AT T CAATCACCCAT GC TC GTGC AG AGAT AAT GT GGGAAGGAA7VTGAAATCGG AGAAGC AT ACATTGG AAGATCTTATGG AAC T G AT ACAATC C C TGT 

GATGJITAGAAAATAG^LATAGGGATGACAAATGAGGACAAAAAAAACG^ 

T^GGGTGTTT^CTGGTAAGCAATAAGACAGATTATATCCTTTTAGTTACITTACTGATGCC^ 

CGAGGTCA^AGAGG<3TGJ^rGCT<!rWAGAGAAAG AGAAAGCAAATCGTACAGATT TATT AGGCCGTCTGACC AATCT ATAGGT ACTC ATTCACGT TCAAA 
AA?7GCCGTGGTAATGTATCCAGACGCAAGCATGAGTTACTCAGTTGATAC^ 

GCAGAAACCATACACGGGGAAAAAGATAGAGGTTTCTATGCTGATAGAGGAACTGTAGGGAGGTTGAT^TACCTCCCACT^ 

TGCAAGC 

(SEQ ID NO:l) 
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CT1035 
Nucleotide 

Genomic coordinates: 
Start: 130589 

Stop: 131444 (SEQ ID NO: 2) 



Amino Acid 

MAAAKMDAIIADINGNDTDLSKLITDVIQKRAKAVMDRNRAKMDMNRRVDEAIQEAVAAK 
KQKALWFDKLVEETDSGQSVPPTLSGSDYDAWVDRAMPSHIELVESVEGDSLYDKLPPF 
NVQDI DDQI G DE I DT P I S YLAMWVKVDCETGDI EE EYNIAPTFGVTQNNKI YRDERDQI 
FTKADKS VR I FKLAKL DS I S GKSRQLT Y AVKNNNEYTE FVCS VFAEFES DS DTTKSG I G I 
REYDKPKNEFE YEEREI FTFFI PIQPAGTKLLLYFLVDVRSRI I 
(SEQ ID NO: 3) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q65326 (Q65326) DNA POLYMERASE (FRAGMENT) 36 0.31 

MYSB_CAEEL (P02566) MYOSIN HEAVY CHAIN B <MHC B) 34 1.6 

002244 (002244) UNC-54 PROTEIN 34 1 . 6 

004327 (004327) CELL DIVISION PROTEIN FTSH ISOLOG 33 2.1 

Q04009 (Q04009) MYOSIN HEAVY CHAIN 33 2.1 

002077 (002077) COSMID C48E7 33 2.7 



Comments: 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 765 of CT1035: this 
corresponds to nucleotides 13064 9 to 131413 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=350 

5'stop«373 

3*start=405 

3*stop=427 

5 • primer =TCCCTCCTTTTAACGTACAAGACA (residues 350 to 373 of SEQ ID NO: 2) 
Tm5=57 . 95 

3 f primer*CCATGGCAAGGTAAGATATTGGT (residues 405 to 427 of SEQ ID NO: 2) 
Tm3=57.97 

probe 1=CGGTGATGAGATAGATAC (residues 387 to 404 of SEQ ID NO: 2) 

probelstart=387 

probelStop-404 

direc t ion 1— Forward 

Trol=69.08 

scorel=l . 91 

length=78 
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CT1037 

Nucleotide 

Genomic coordinates: 

Start: 131480 

Stop: 132941 (SEQ ID NO: 4) 

Amino Acid 

MFVISIATSLVLFFFLLFVSITILDGAKTIDSQPFRKRRKRKRYRTSESGDGIDGGTGTT 
NGGGGGGGEGGGGGTNGNGTGTTNGGGGGGEGGGGGTNGNGSGTTNGGGGGGEGGGGGTN 
GGGNGNGGGNGNGNGNGGDTDTDDFEPTPALLKERLLNSISSKPKEYYEAFVSAEVETAL 
QLSRDDSTQTIIIDDDQLELDASDTLQGKPRDYLFKLAGVSSAFLEGTTIRKAEDRARNI 
NEEE I AQT I LS Q LREKH I NDE Y DGK Y AT PEE RADFSN S LNLVTKYTNHE VGLL VGET I EK 
AFPHEIEFERCIILVEDFNSGTITSNTMQYRSNAYKIRWEGSTTDPGEWPDDCLVFAV 
VVNKEQHSLEISATNRCQDICFVIIPRLSAIGKNATMVIRKGDEIKQETYLFVANKNDTT 
HFSIITDKDESVGIELNMLIFSERILPTLSDPATVPRPLTDANVLSAYGKRLGVGAFTDK 
NLLSSQ 

(SEQ ID NO: 5) 



Top Blast Hits 

Sequences producing significant alignments: 

065514 (065514) PUTATIVE GLYCINE— RICH CELL WALL PROTEIN 
Q21835 (Q21835) R08B4 . 1 PROTEIN 
Q43522 (Q43522) TFM5 GENE 
053553 (053553) PGRS- FAMILY PROTEIN 
065450 (O65450) GLYCINE-RICH PROTEIN 

GRP1_PHAVU (PI 04 95) GLYCINE-RICH CELL WALL STRUCTURAL PROTE 
Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 392 to 1365 of CT1037: this 
corresponds to nucleotides 131892 to 132865 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=723 

5 p stop=74 5 

3'start=792 

3 , stop-811 

5 ' primer— TGAGGAAGAAATTGCACAAACAA (residues 723 to 745 of SEQ ID NO: 4) 
Tm5=58.39 

3 • primer =CCTCGGGTGTGGCATATTTT (residues 792 to 811 of SEQ ID NO: 4) 
Tm3=58-37 

probe 1 =CAACGATG AATACG ATGG (residues 774 to 791 of SEQ ID NO: 4) 

probelstart=774 

probe lStop=7 91 

directionl=Reverse 

Tml-68 . 98 

scorel=l . 98 

length=89 



Score E 
(bits) Value 

120 3e-26 

112 6e-24 

110 2e-23 

108 le-22 

107 2e-22 

106 3e-22 
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CT1038 

Nucleotide 

Genomic coordinates : 

Start: 132993 

Stop: 133896 (SEQ ID NO: 6) 
Amino Acid 

MGDKQKVEQLLRELKAEANDDWLSANVDPIVERFVTTKSDETAQVVKQAVDEKYDELLED 

KVEEMRPDIINEASETYDKLAADMIREVDTSSVIAPAIAGTVARTINNLRDKRKEYEKRL 

WTIAYKPWRRYVQAITVMEFRLSYKDLWHANSDTYLTFPFLRIKKIAYINNDRASPVNC 

SLSVSYPNKSEWGNDNGVGRKVDIHIRRNDLQEKDLYLSVICMLDTDFSGYDKAVEVDAH 

KFHFEAGNRTMFLPKTSNLFNRSHIVNSKICTIVFPPASASSASTTELDNVYYRITCTCS. 

{SEQ ID NO: 7) 

Top Blast Hits 



Sequences producing significant alignments: 



Score E 
(bits) Value 



RESA_PLAFP (Q26005) RING- INFECTED ERYTHROCYTE SURFACE ANTIG 
Q9X9G6 (Q9X9G6) HYPOTHETICAL 35.5 KD PROTEIN 
CAA21397 . (CAA21397) ORF74 

Q9ZFZ9 (Q92FZ9) HYPOTHETICAL 21-1 KD PROTEIN 

RESA_PLAFF (P13830) RING- INFECTED ERYTHROCYTE SURFACE ANTIG 
Q9ZH03 (Q9ZH03) LAMBDA HOST SPECIFICITY PROTEIN J 



38 
37 
37 
37 
36 
36 



0.12 
0.15 
0.15 
0.15 
0.26 
0. 45 
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CT1039 

Nucleotide 

Genomic coordinates: 

Start: 133968 

Stop: 136344 (SEQ ID NO: 8) 



Amino Acid 

MEEESQRVQRRIGVLPEEAASQILKDTKLRVSYLGVGHWGYSVSVIKSALQKGCRRNDED 
I TAW S I REAYLY YHLGLN Y I EN VKPAAKSLNTNMVNR I KI I AVE DTS PRSMV ASNECVRT 
LEKYEKGNFRQPSYLMDAAMRLVHASSSRVCSHMRALCCKEEDSDKLGGIYYANFNELET 
QCVSAVNFSPIERIKHVFREIESVKLGKKSVQLLNLRSVAAYHVLRYYGDKVKDTNKKHS 
GPFKRKEFEQFWGLCFKFVTQHVKTDPELRCYFNELTYAINWRRDFFCSKGFFREESLFL 
TSIVELIIAMCIGDRKQFAKIQKRDLKRFNKGEEGRKEEAATFDWIEGHVKRMPQMPVWV 
LDKHTNKNTHGVSFALESSMVSGGDKRWSPGVWLHSYTKMRLDSPPPPEVGQFLDQAFNT 
LKREAASHCVTRNICTTTGFIKASSFTANINSEPMEIKEEIKKRKIEIKDDNTTATVTVS 
ATTSSS I TSTPP PTKKQKTTPSGSNKVDS IQLNNLPTLNMEDLDRVLEVHNQNSKKGVAA 
TVLMKDGNKVVFKEMRKSFGWGSHQNFVQVLKDEDVCKLDYLLPCPDSGPYRGLYRCYFK 
I VKDE I S STAAR IEKVKWGENAMCYFI SGCVTRQEG I GKI IT DVRLSHMG PNKQYV YDNY 
RQLIHILIFRLLTGVSDTNTSNILVGDGGNLFSVDENYVGAKDPRTALENRKIKELQLLL 
KTSFKVNKVTKEDIDSCLPSWLFDTSKSDKIMNGVCNIGKNMGIGPTTLDIVKNNCTCIL 
GWN DLL YDNK 
( SEQ ID NO: 9) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9Y1H4 (Q9Y1H4) GAG 43 0.011 

RAP 1__Y EAST (P11938) DNA-BINDING PROTEIN RAP1 (SBF-E) (REPRE 41 0.031 

SLY 1_Y EAST (P22213) SLY1 PROTEIN 40 0-070 

P90603 (P90603) MUC.CL-1 38 0.20 

052224 (052224) GLUCOSYLTRANSFERASE (EC 2.4.1.5) ( DEX TRANS U 38 0.35 

Q20202 (Q20202) F4OE10.5 PROTEIN 38 0.35 
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Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 1092 of CT1039: this 
corresponds to nucleotides 1338 66 to 134957 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=1255 

5 , stop=1277 

3*start=1353 

3'stop=1375 

5'primer-AATACATTGAAGCGAGAAGCTGC (residues 1255 to 1277 of SEQ ID NO: 8) 
Tm5=58.71 

3 1 prime r-CCTTAATTTCCATCGGTTCAGAA (residues 1353 to 1375 of SEQ ID HO: 8) 
Tm3-S8.18 

probe 1 =TAGTCATTGCGTAACGAG (residues 1278 to 1295 of SEQ ID NO: 8) 

probelstart=1278 

probelStop=1295 

directionl^Reverse 

Tml-69.03 

scorel-1 . 96 

length=121 
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CT630 

Nucleotide 

Genomic coordinates: 

Start: 249215 

Stop: 247358 (SEQ ID NO: 10) 



Amino Acid 

MALKDAFTERLVVNKVGSGTDMAPVVEDDRQKSLFQKVENLYRVLVVEQKNSAITLSGNK 

NTNKRQCRQVEEDKV I FEGE DRT VSNLPQAVKET IAAN AESI LDYWYKNVI PLLDTKKER 

SGKSDTFLRTAVICLVRCCVSYKI»4KTCSLIYEFEHKILNKSTLDPLLKDILDNKQELLH 

MDSKYGSKTTSPELAKETIEALYTTVYNHWTNAFKLYQASLTHKPVTGKKYASVIHFIRT 

WRKIVKAYVSKHNNVERDLSLKNIMKNESADNANVLTIEKMYKKIGNSVKNTNNNSAHQM 

SDSEDDDDDDDDDCEGMDVCDEASEREKKHQESLYPINTPVTTITGDYIFKVLLELVLSP 

HIHPEWKIPMCDFVNRNIPKLMKAMETDISNAVIEVRASKVNPVQILPIAANFWDFCKSG 

KPPSDVKFCMMFNEPSSNETLSSGAGVFGRFIGGPFSHKSKELDIISNCLRSLLLNKEAD 

NLSTRIWREGGSWCFNYCPITARGAVLGYGEQLSERSIKALWAKKIQDAVTESVKRQRN 

AADKNSRNCDLLGDEGVVSMKTVTFGCAN^5LKTQNGMGKFNVWSFEDSIQANKEGAARQ 

YMSQQVFTHS F PALDQGK 

(SEQ ID NO: 11) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



097318 (097318) PFC0865W PROTEIN 45 0.002 

YBO 0_YEAST (P38114) PUTATIVE 126.9 KD TRANSCRIPTIONAL REGOL 44 0.003 

Q08281 (Q08281) CHROMOSOME XV READING FRAME ORF YOL138C 42 0.010 

Q92271 (Q92271) 12.8 KBP FRAGMENT OF THE LEFT ARM OF CHROMO 42 0.010 

062235 (062235) F36F2.3 PROTEIN 41 0.018 

Q83970 (Q83970) (CPV) 41 0.023 
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CT631 

Nucleotide 

Genomic coordinates: 

Start: 264975 

Stop: 259164 (SEQ ID NO: 12) 



Amino Acid 

MGSKRPCSSGQE PVTKKQKKNNNNNSN PVPVINIKSYP FLATRTQVLRSAVAAAAAS PSG 
SSSSSSSSASAVKLPDTCKEARKVLSTVSLQQSLAVRYLCNSISVSYAGGGISVFHLGGL 
PGAGKTTMVKELIAVLNDHGLIDSGSADMLLCCKSNSAKESLMCACKKPGGSSLMYPESV 
FSTLNKGFEIPVIFRKDEITLEKIQFVADKLKWKVIQVLANLRFLVIDEYTMASCRELVF 
IDAVLRIAKHRPDIPFGGVFVILLGDNRQNSAVVEDNTNHIQKKIKNPSEEEKPQKNNKN 
NKNKKKKKEKKEKGGEEEEGDENEEEEGEEEEEEEESDDEAETKKEEEKSTFFQGSVEQD 
NFGQEDNAKLYTEVFIKILKMFCSRDFFGNPSNLRNIVNKRHEAILMKSNNVKSVNNNLV 
SSAIKVEDCGNASNKKEVTAPSSSPAQSTAEENCDEFDDEEDDDFFNNEAFLKLMERNAL 
EKDRASGALNGFSLRCKSISDANEKIRSGTTSVSDKKSSLDMMKSLPLSALIEEGICSEL 
AHISELKKMSNANI^KYTENVCSIVFDMMAKAMREIDYSGREKLYIVSSLSERFKDTHLT 
SLMDEEILNVKYVHGSDPKCIDAVPFNSAHNRASAVAACVRNAFFRDGKDFVDETPIANY 
FKDNLRT VAS FLEN ETLT YKELLAKS ENIRSI LLKKETGNNNS ASSRTAAAAAYYEDDED 
YCYFDEEEAMDLEDGGSGGSGMKSSGGGGDDDDEESGEMIYRTDIPDKLHRDASTLDRVG 
HLVDFHWWKKWLTENKPSDLVRARVWYLYTLVRMQQVKFDNGKLPSLDKSALSGRLFHS 
PSEWATSTGVGVGGGGGAGADKPLHDEYWLRVLSMPISTGGDVGKSMLLPAYSSYLSALS 
RTYIMSSLKRIDIIKHAYSLMYGISLFDMTANLQDLVDTRMAGRSSRNGSVFMDNFDPVQ 
YFDNIFPSMVNEFLMYRKEDVFNNGQMMEGVKGSLKISRVLQTAHTENNNINNRHNNSLK 
YSEKSIVIAMQMVTSISKGNERRKKIEEFITKEQGQPKDMCERLMANSKAKQEKDAISSK 
TDKMMGAITLTKKHVLKNAVSNLVDTSIIKETKKNNNNSSSSSSTSLAAAAAVENSVPAL 
RVEVKFWLNMDLSDISHEKTISHKYRQQLINAIKTRSTPLFDKFTDRKILRAAESPRAL 
TTILLDEKKKVTRAKSITLYQGQNVI FTTSNRMIHGTQERFVTKDTGWTNLMYKNGELT 
VFVYVERLGQKCLEIKEGRQIIGNPNIKNGGFGNNVYVQYLPFESSQAMTIYSCQGHTFF 
RDTIVDLSGASTQDAYVAVTRNSNPQNLFIIQNHSVERGNLCNIKCAMSKDKAYTMPIGG 
IADFNGSDFINHDTVSVSREVAESSAAMDDDYNGDGGVTMYSAYDPSKDVVAAAEEFILS 
R5GKSLSFNASWMANTAKVIQQHGLETELKNIRDFFFGVNNGDVAKHYEKLCNKKMIELY 
TAIVRSITHYSIASGIVKQPSSKLCEEYETKQKNKKDYIKIHPVFVNRAPKESTIEMLLF 
DIAPHNKATIVFQFYVHYIFLVYEKLNVLNSSFAFLPSPNPCLNQYVRPKSITTNSTHVP 
NLGYESKDFAHCKDGGERDVKLEU.PITSADEFSNNIEGILKKVSDTSNQNKVNKYMDVVC 
KSMQHNLRRTGKFCRPTETCGLSKHGSIVTSTCTAQEKGENIHVDAEKGWLCMSDEANVY 
CMLMFMSKIAAASGVSEFPIKDKSILESNPETPSDTISLLAPRKTISPTNNLHFSMSEDV 
LFCGQVHPMKRVQFSLHVKRTGGALKSTFEEEEGLPTKI FSPNFATYPLFKKCKMYGAI I 
IAMTEMQGHEFAKYSTLDIRKSMFTGVGTWDLEKISGEGNEVMDKVDKFIVKNVSNILF 
KEQGKRVSFFVSCAIH 
(SEQ ID NO: 13) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



09YTL7 (Q9YTL7) ORF 4 8 68 Se-10 

035788 (C35788) CYCLIC NUCLEOTIDE -GATED CHANNEL BETA SUBUNI 68 5e-10 

NAB3_YEAST (P38996) NUCLEAR POLYADENYLATED RNA-BINDING PROT 68 8e-10 

Q07034 (Q07034) RNA BINDING PROTEIN 68 8e-10 

096229 (096229) HYPOTHETICAL 78.6 KD PROTEIN 66 2e-09 

096134 (096134) SER/THR PROTEIN KINASE 66 3e-09 

Comments : 



EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 830 of CT631: this 
corresponds to nucleotides 259304 to 260133 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

S , start=2408 

5 f stop=2429 
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3'start=2489 
3'stop«2509 

S'prime^GAGCCCGTGTGTGGTATTTGTA (residues 2408 to 2429 of SEQ ID NO: 12) 
TmS=58. 49 

3 , primer=AGCGTCCAGACAGTGCAGATT (residues 24 89 to 2509 of SEQ ID NO: 12) 
Tm3=58. 56 

probel— CACTAGTGAGAATGCAAC (residues 2432 to 2449 of SEQ ID NO: 12) 

probelstart=2432 

probe 1 S top=2 449 

direct ionl^Forward 

Tml=68.99 

scorel=1.99 

iength=102 
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CT632 

Nucleotide 

Genomic coordinates: 

Start: 268485 

Stop: 267717 (SEQ ID NO: 14) 



Amino Acid 

MAGVDLYGGHIKPYGETVFNNKMQGNRGKIRALINEKAAATLPMSEDNISAWVTEVAADV 

FPDPKSALTFrvPNKSLNAFAWDVLKTPASVEIDIGKRIPQLIENLHMSDFTVAIFRVKC 

DDQGRYETSYNLSPSMGGKINRGLIRTLAKAQDIWWKRDFSLTIENFEVDNGKKRLDFL 

FNNQTDKSCFVKIFHEMESEKDIAIKPEKRGSSAVWDEVYSDIVTKNTRNAKFSLRYRNE 

KPVDHLLLYCMVTYF 

(SEQ ID NO: 15) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

097036 (O97036) PLC-BETAH2 34 1-5 

045329 (045329) F09C6.2 PROTEIN 33 2.5 

P97868 (P97868) PROLIFERATION POTENTIAL— RELATED PROTEIN 32 5.6 

P70287 (P70287) RETINOBLASTOMA BINDING PROTEIN 6 (PACT) (FR 32 5.6 

AAD49229 (AAD49229) EHEC FACTOR FOR ADHERENCE 31 7.4 

CAB55629 (CAB55629) LYMPHOSTATIN 31 7.4 



Comments : 



TaqMan Primer/Probe Sets: 

5'start«378 

5 , stop=403 

3'start=452 

3'stop=477 

5 "primer— TGAAACCAGCTACAATTTATCTCCTT (residues 378 to 403 of SEQ ID NO: 14) 
Tm5=57.68 

3 • primer =CCTCTTCCAGACTACAATATCTTGGG (residues 452 to 477 of SEQ ID NO: 14) 
Tm3=59.4 3 

probe 1=TCAGAACACTGGCTAAGG (residues 434 to 4 51 of SEQ ID NO: 14) 

probe lstart=434 

probelStop=4 51 

direct ionl=Forward 

Tml=69.15 

scorel=l .84 

length=100 
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CT633 

Nucleotide 

Genomic coordinates: 

Start: 266790 

Stop: 266442 (SEQ ID NO: 16} 
Amino Acid 

MASPLVASLGGGKNILFGLLLITIIVIVIAVIIIKAPLIASLIAGTALAGTIASALGSIP 
GVGGAFKKAFGKGKGKGGPKTPDGGAKKTNQKPKKGKKKPPTRRSIFKRIPKIKF. 
(SEQ ID NO: 17) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q02391 (Q02391) CYSTEINE-RICH FIBROBLAST GROWTH FACTOR RECE 36 0.092 

Q91019 (Q91019) MUTANT CYSTEINE-RICH FGF RECEPTOR 36 0.092 

Q69526 (Q69526) GLYCOPROTEIN B 36 0,12 

Q9X215 (Q9XZ15) HYPOTHETICAL 29.3 KD PROTEIN 36 0.12 

Q14113 (Q14113) AORTIC CARBOXY PEPTIDASE -LIKE PROTEIN ACLP { 35 0.21 

P79922 (P79922) MODIFICATION METHYLASE (EC 2.1.1.73) (CYTOS 35 0.21 
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CT634 

Nucleotide 

Genomic coordinates: 

Start: 283360 

Stop: 282673 (SEQ ID NO: 18) 
Amino Acid 

MVSSRTSTTSSSAVAATSTLLPTKRKREPEEVKVKVEVKMEQEELVEDS5SNKRPRIKEE 
KEEEHKETHHLSLPCKEEEDDGEEEEYEEEEDEEEYEDRVDDDTAEKMENLLVQLDNTTK 
NIKLKNPLREHDMAVSHYEHEFEVQNTVNFSFGVLSDIGFLINREAVSRWGNTPPPKEFG 
DMEIGSLTVNQLLHKCDNFVQAVVQKVKEDITPSIEVTIDSLIDDPCW 
(SEQ ID NO: 19) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

035788 (035788) CYCLIC NUCLEOTI DE-GATED CHANNEL BETA SUBUNI 60 2e-08 

GARP_PLAFF (P13816) GLUTAMIC ACID-RICH PROTEIN PRECURSOR 57 8e-08 

Q9YTL7 (Q9YTL7) ORF 4 8 56 2e-07 

094922 (094922) KIAA0835 PROTEIN 55 3e-07 

008995 (O08995) MYELIN TRANSCRIPTION FACTOR 1 55 3e-07 

Q93424 (Q93424) HYPOTHETICAL GLYCINE-RICH 37.0 KD PROTEIN E 55 5e-07 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 575 to 1 of CT634 : this 
corresponds to nucleotides 282678 to 283252 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=210 

5'stop«231 

3'start=287 

3'stop=308 

5 ' primer =CCTCTCCCTCCCATGTAAAGAA (residues 210 to 231 of SEQ ID NO: 18) 
Tm5=58. 19 

3 ' primer ■vTCGTCGTCCACTCTGTCTTCAT (residues 287 to 308 of SEQ ID NO: 18) 
Tm3=58.4 0 

probe 1 =AAG AGGAGGATGAGGAAG (residues 266 to 283 of SEQ ID NO: 18) 

probelstart=266 

probe 1 S t op=2 8 3 

direct ionl=Reverse 

Trol=68 . 99 

scorel=l . 98 

length=99 
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CT635 

Nucleotide 

Genomic coordinates: 

Start: 285773 

Stop: 284075 (SEQ ID NO: 20) 



Amino Acid 

MARSVGLLSVTPEYDTFKYIKMEEFKTLKVKNGFTISGENPDKYEHILLSFKSVDRVTKS 
ELRDGLYIVRLKDKEVLHIKNGVHRLRQLTGDNTLQVGLKYTHNLPRLGSLLQDDGCEDY 
GEKWNESLPIDMQNINKIVKEKALLSDKNFKFSPLYRLLHERLSNAAVKKCDYMIITTDF 
LVGCGYTPSHCPRTLRNMEQLLVEQCGFSSRISVYDICDRLTYKGAYIANPITGSYSNMC 
LIVPMDKLGLIFYNSTHPSAKSIGNYMSSLFNATVIYANERDNLQMDNFRREIKFAENEV 
NMKEEELKELRKRCAVSEEQRISLRDVHKKSSIATSRYDGGACLVFAFSDRDFSLLCRTN 
GNGSFYSATEEGIRYVSSPEYKKRDVGERRPRLIMSITGSDAPICIRDSVRNHFKTRLFS 
RTSGNSITFAVPPGERELMEMVREVTGTDIKIFMDNGKVYQNGAEINVIDPTSKEYKELL 
KREENLPEDERKRLRRERRMIFNTSRAISMYNEERGDGGSGGETSEDGDGNGSTSSKGEK 
RKREENEGNEYVVLLNKACKDIKVC 
(SEQ ID NO: 21) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



Q21885 (Q21885) COSMID R09H3 
077336 (077336) PFC0425W PROTEIN 

000164 (000164) RIBOSOMAL RNA UPSTREAM BINDING TRANSCRIPTIO 
UBF1_HUMAN (P17480) NUCLEOLAR TRANSCRIPTION FACTOR 1 (UPSTR 
YPT2_CAEEL (P41880) HYPOTHETICAL 21.6 KD PROTEIN F37A4 . 2 IN 
YG2K YEAST (P53253) HYPOTHETICAL 106.5 KD PROTEIN IN CTTl-P 



37 


0. 


32 


37 


0. 


42 


36 


0. 


71 


36 


0. 


71 


36 


0. 


93 


35 


1. 
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Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 501 to 1 of CT635 : this 
corresponds to nucleotides 284 04 7 to 284547 of the genomic reference sequence. 



TaqMan Primer/Probe Sets : 

5'start=686 

5 f stop=706 

3'start=799 

3'stop=822 

5'primer-CAAACCCAATCACAGGCAGTT (residues 686 to 706 of SEQ ID NO: 20) 
Tm5=58.55 

3'primer-GGTTGCATTGAAAAGAGATGACAT (residues 799 to 822 of SEQ ID NO: 20) 
Tm3=58.19 

probel=ACTCCAACATGTGCCTAA (residues 707 to 724 of SEQ ID NO: 20) 

probe Is tar t=707 

probelStop=724 

direct ion l^Forward 

Tml=68.89 

scorel=1.8 9 

length=137 
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CT636 

Nucleotide 

Genomic coordinates: 

Start: 286706 

Stop: 286076 (SEQ ID NO: 22) 



Amino Acid 

MIVFVEGSPLTGKTSWVDNMRTAGKGKQSFLNFMYTNYRDYLPIFPWTIQEHliRASDYQE 
RPRLVDGMFGSSLNFFTGMWRHDTEQFPESKIGLREYLEMYGEEFKACVAEWVKYKPVFH 
VMVYREEDVKKMEPIIQELNDAHNWFIDVLKEERALFVKIEVIPRNVYKGNICSSCFSTS 
KNYVYRVGKCTNSIVHCDMKCKFIAEKII 
(SEQ ID NO: 23) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

P87968 (P87968) ENVELOPE GLYCOPROTEIN, VI -V5 REGION (FRAGME 33 2.0 

P88315 (P88315) ENVELOPE GLYCOPROTEIN (FRAGMENT) 33 2.0 

P88314 (P88314) ENVELOPE GLYCOPROTEIN (FRAGMENT) 32 2.7 

DYHC_PARTE (Q27171) DYNEIN HEAVY CHAIN, CYTOSOLIC (DYHC) 32 4 . 6 

040068 (040068) GP120 ( FRAGMENT ) 31 6.0 

040067 (040067) GP120 (FRAGMENT) 31 6.0 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 594 of CT636: this 
corresponds to nucleotides 2 86111 to 28 6704 of the genomic reference sequence* 

TaqMan Primer/Probe Sets: 

5*start=239 

5 f stop=259 

3*start=317 

3'stop=335 

5 * prime r=GGAGGCACGACACAGAACAGT (residues 239 to 259 of SEQ ID NO: 22) 
TmS=58. 81 

3 * prime r=CACTCGGCGACACATGCTT (residues 317 to 335 of SEQ ID NO: 22) 
Tm3=59. 42 

probe 1 =TGGAGATGTATGGAGAAG (residues 293 to 310 of SEQ ID NO: 22) 

probel start =293 

probe lStop=310 

d irec t i on 1 =Rever se 

Tml=68. 95 

scorel=l . 95 

length=97 
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CT637 

Nucleotide 

Genomic coordinates: 
Start: 300432 

Stop: 299085 (SEQ ID NO: 24) 



Amino Acid 

MGGEDSFDDRYDSDALWENEGAKSIQVKETDLEVYRMHRRAVPTLEEKNRTALRYYSDWS 

PVYRVPLFSLKDGSDPHERDFSFWDPRRFGKVPVKVRRVDVRNPSRTAAIFVPTGPGLH 

VSSYTGDGMLVCPNHNFIGDLCSEIASDITIYNTSSSGRLSYATNFNSVEDNSPVGILFE 

TLPDDKMFQQVSIFSATEPASNISIGPMSHVKIKLGYYDEENATAVGVIRYGGLFYTSVG 

ACIIPEGVFFDDWGNHSSMNIYNMTNQPKEIVLKEPRGEDAMEEDDGEEADYNFLGYVV 

RFEHDLKMQAMS SAYS SVS I DINSSSFHKCFLIKPKYNS I LQPLVSSEVVLNDLSLNTRG 

REVEFHDRLPSGAQDNSYSIVKYMKIVSLKEGLKVVNPI INTELYKKKQALKVHVLNMTR 

DWGLDTSEHSFGVIVCHAAKLPEVIGQ 

(SEQ ID NO: 25) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

023592 (023592) CARBOXYL-TERMINAL PROTEINASE HOMOLOG 34 1.6 

YGA1_YEAST (PS3199) PUTATIVE 3 BETA-HYDROXYSTEROID DEHYDROG 34 2.1 

VILIDICDI (P36418) PROTOVILLIN (100 KD ACTIN-BINDING PROTE 33 4.8 

Q9Z5B9 (Q9Z5B9) PUTATIVE TRANSFERASE 32 6.3 

Q9YUY4 (Q9YUY4) ENVELOPE GLYCOPROTEIN 32 8.2 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 3 to 1254 of CT637 : this 
corresponds to nucleotides 299132 to 300383 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5 f start=678 

5'stop=702 

3'start=788 

3'stop=812 

5*primer=CGGTGTCATTAGATATGGAGGATTA (residues 678 to 7 02 of SEQ ID NO: 24) 
Tm5=57.26 

3 * prime r^TCTTTTGGTTGGTTAGTCATGTTGT (residues 788 to 812 of SEQ ID NO: 24) 
Tm3«57.75 

probe l^CT AC ACCTCTGTCGGTGC (residues 705 to 722 of SEQ ID NO: 24) 

probe Is tart =7 05 

probelStop=722 

di rec ti on 1= Forward 

Tml-68.85 

scorel=l .85 

length=135 
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CT1040 

Nucleotide 

Genomic coordinates: 

Start: 137588 

Stop: 139940 (SEQ ID NO: 26) 



Amino Acid 

MAAAAVSGEGRISADLLLLEQLTPDGDVIRYDSEQYTKPRKIFGDKSVIETIGHFLIHNH 
NQGESYQIASSVI^KFPALLNCIWNGESGGMALWKALYRAKKYRLLNSLLVHKIKNWPSV 
AVIPIYGSVCDREERPIIMSEIIDKETLQTICKSDIRSLLGMMNAKHGTLGGNFLHFYAR 
STKPFENFQYEAMGANAVLMAAEAIYDGFRDHGLNPSEYTFPGLESADVYGNNPVEIAIS 
GDDDNMLLNLICNYGVSYEKTRGRVNRSLLDFLKMNTASKCLSVLKFVEKHFKIESNTPK 
GEFEEKAETCVNCLDRNNVLTKGSEQESYKLSCGHFLHVKCLRNICIVSQHLRCEKCLKR 
FDES I LRKCT PNLNWWLTMP AG AGNEEE I C FMRNKKL V DDFRKLLSPVS I PHFFKNSRQR 
NLDMLCPYSDHTIIPNKEDPKKNEDGNRVRVNHTAISEKQNKEEEDARIKRVAVRTFTAI 
REKQNKEEEDARIKRAVDMAVAAINEKNKEEEDARIKRAVDMAVAAINENNKEEEDARIK 
RAVDMAVAAINEKNKEEEDARIKRAVDMAVAAINENNKEEEDARIKRAVDMAVAAINENN 
KEEEDARIKRAVt^VAATNEKNKKEEDARIKRIIDLTVMRIQRIVDMAIAAATKKDKK 
EEEKRTKREQELRADLRRAMDMVNEVQKKLEDMELEKGCNKDEAKNTSNWSSSSVVAYS 
KEIVPCLGNNNNAVIGMTSTNYSANNTKNNVFGSPHKFSFNDASRFSNIVETPKMSFNFS 
FKT 

(SEQ ID NO: 27) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9ZU69 (Q9ZU69) PUTATIVE VICILIN STORAGE PROTEIN (GLOBULIN- 71 4e-ll 

Q26775 (Q26775) TB-292 MEMBRANE ASSOCIATED PROTEIN 70 5e-ll 

INCE_CHICK (P53352) INNER CENTROMERE PROTEIN (INCENP) 67 5e-10 

YDF3SCHPO (Q10475) PROBABLE EUKARYOTIC INITIATION FACTOR C 66 7e-10 

Q26774 (Q26774) TB-291 MEMBRANE ASSOCIATED PROTEIN 66 9e-10 

P91257 (P91257) SIMILAR TO C. ELEGANS UNC-89 65 2e-09 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 853 to 415 of CT104 0: this 
corresponds to nucleotides 138912 to 13934 9 of the genomic reference sequence. 
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CT1041 
Nucleotide 

Genomic coordinates: 
Start: 140110 

Stop: 141616 (SEQ ID NO: 28) 
Amino Acid 

MVYKGFVSPSFITIRTMTSNRPTTSPLSFSEGFSLSGDKYDTYEDILLEQFNCFKTSSPS 

SARKSEIEDKTLIFQLKEGEKFHLAKGIEELREILDDNSATIEPIISPTTFNDRNELLNH 

EGDISSSPLYTQIMKHISPEHDiyELDLIVGTDLLFGLGVNLRNVSKLMKKISYGTLNVV 

DVCHRKFFNNRI IVNPISSSFSKNVCI I PLFS AAEE FS S LGECRDLFNG I CDDVERYINS 

YFFYPENTTTTTTTAPSSPEMEIADEEEQSPKTIKRNDNASRNWSGVCLIFEVFKNTYYI 

INRGDRGGSFEKAVKSAISSIKEKRCKITDINGNKPRLVMVITGCYTELYFKDAI*KQIGE 

NRRKFLKMNGNYFSLIDEQADLIEFAMSVSGAGERIFVNGLGMFQNRKMIPVIDPLTYEN 

VVCGEHDIQKEDAILSVRRAIADYNDFVSKNKRGKKRSAEEENEDEDADASSSSSSSPPP 

SSPPARKKSRLPDEGEKCTLC 

(SEQ ID NO: 29) 

Top Blast Hits 

Sequences producing significant alignments: 

Q26258 (Q26258) BR2 - 2— BALBI ANI RING {5* REGION, REPEAT UNIT 

Q23804 (Q23804) SPID PRECURSOR (FRAGMENT) 

097324 (097324) MAL3P8.1 PROTEIN 

Q99112 (Q99112) HOMEODOMAIN PROTEIN BW2 

Q99111 (Q99111) HOMEODOMAIN PROTEIN BW2 (FRAGMENT) 

YA55_METJA (Q584 55) HYPOTHETICAL PROTEIN MJ1055 



Comments : 

TaqMan Primer /Probe Sets : 
5'start=822 
5 'stop=846 
3'start=92 9 
3'stop^950 

5'primer=AAAGAGAAATGACAACGCAAGTAGA (residues 822 to 84 6 of SEQ ID NO: 28) 
Tm5=57.46 

3 f primer=GCACTCTTCACAGCCTTTTCAA (residues 929 to 950 of SEQ ID NO: 28) 
Tm3=58.24 

probe 1=AACTGGTCTGGTGTCTGT (residues 847 to 864 of SEQ ID NO: 28) 

probe 1 star t=8 4 7 

probe lStop=8 64 

direct ionl=Forward 

Tml=69.07 

scorel=1.92 

length=129 



Score E 

(bits) Value 

38 0.12 

36 0.82 

36 0.82 

35 1.1 

35 1.1 

35 1.1 
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CT1042 

Nucleotide 

Genomic coordinates: 

Start: 141695 

Stop: 142541 (SEQ ID WO: 



30} 



Amino Acid 

MAVNLDNVLVNINNKDEDLTKLVSEAIKRRAKTVFDTKNQAGFDMRRQVEAALYEAISKKK 
EKAIKAFDELIQERGDEITPLTTMQYEEWWRTITPSLTTENLLGDVEHADFLLDRMTPVS 
EEDIEGFAASTFKEVS DSKTATVI VKADCETGDI DEVYNLAPS FGVTQE I KI YRSNNSSEL 
DNVADSFHIYKISATDSDSGNTKKLLYGLRNKKAGYTCLCRIFAEIESDGIMANTNIGVAE 
NNRDE I DENEEGKYGFLIPKQPAGAKLI I YFFLNCWTX 
(SEQ ID NO: 31) 

Top Blast Hits 

Sequences producing - significant alignments: 

gi|2492980 | sp I Q10043 I YRP1_CAEEL HYPOTHETICAL 37.6 KD PROTEIN Rl . 
gi|401691 | sp I Q00933 1 YSCI_YERPS YOP PROTEINS TRANSLOCATION PROTE - 
gill706480|sp1P51892|DNLl_XENLA DNA LIGASE I ( POLYDEOXYRIBONUCL. 
gi|267570|sp|Q01250!YSCIYEREN YOP PROTEINS TRANSLOCATION PROTE. 
gi|3121979|sp|O07597|DAAA_BACSU D- ALANINE AMINOTRANSFERASE (D-A. 
gi | 6686325 | sp | P71018 I PLSX_BACSU FATTY ACID/PHOSPHOLIPID SYNTHES - 



Score 
(bits) 

33 
32 
32 
31 
31 
31 



E 

value 

0.79 

1.8 

1.8 

2.3 

2.3 

4.0 



Comments : 

EST confirmation of the predicted transcript : 

An isolated EST has equence identity to nucleotides 700 to 1 of CT1042 : this 
corresponds to nucleotides 141748 to 142447 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=350 

5 ' stop=367 

3*start=392 

3 , stop*=414 

5 • prime r=GGACCGAATGACACCCGT (residues 350 to 367 of SEQ ID NO: 30) 
Tm5=57.70 

3 * prime r^CCTCCTTAAAAGTAGAAGCAGCG (residues 392 to 414 of SEQ ID NO: 30) 
Tm3=57 . 66 

probe 1 = AAGCGAGGAAGAT ATTGA {residues 368 to 385 of SEQ ID NO: 30) 

probelstart-368 

probelStop=385 

di rec t i onl=Forwa rd 

Tml=69.01 

scorel=l . 88 

length-65 
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CT1043 

Nucleotide 

Genomic coordinates: 

Start: 142610 

Stop: 143699(SEQ ID NO: 32) 
Amino Acid 

MTVLAVrTAPQIKKSKKRKIEDENEEEPVKTLEDFVKGRLLNAVKEKPAEYFELLISADT 
EAALKTAEETALRDFVIENDSVEIDVEEVLEEKPREYVFKLAGATSETLTNTIIAEVQKK 
AALITEEDITIKMLKQFRAANKDNKDGEATPEEKEDFTNNSDLVGLYLNEVVEKTTNIVI 
NKIFPHEMVFERCAILIEDFDTGWTDQAIQIPSNKYKIRLVEGDEPEVFPGDCLDLAVS 
VDKINHVLKI SAKNGCENNCFVI IPRFSPVGSVSSMILGSTDQVKPKTFLFLANKNDSTH 
FQFTMDKQHSVGCELDMLIFSERNJ^NLPDSKPRPLSDADILASYGKRLGTGVFTTENLV 
DD 

(SEQ ID NO: 33) 
Top Blast Hits 



Coramen t s : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 982 of CT1043: this 
corresponds to nucleotides 142640 to 143621 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=54 3 

5'stop=567 

3'start=635 

3 f stop=659 

5 ' prime r^CAAAATATTCCCTCATGAGATGGTT (residues 543 to 567 of SEQ ID NO: 32) 
Tm5=58.37 

3 ■ prime r=CTGATTTTGTATTTGTTGGAGGGTA (residues 635 to 659 of SEQ ID NO: 32) 
Tm3=57.33 

p robe 1 =G AC TG ATCAAG CC AT TCA (residues 615 to 632 of SEQ ID NO: 32) 

probelstart-615 

probe 1 S top= 6 32 

direct ionl=Reverse 

Tml=69.01 

scorel=1.98 

length=117 



Sequences producing significant alignments : 



Score E 
(bits) Value 



Q26938 (Q26938) KI NETOPLAST-ASSOCI ATED PROTEIN (KAP) 

023332 (023332) CENTROMERE PROTEIN HOMOLOG 

Q92ES5 (Q9ZES5) CTC PROTEIN 

AAD45753 (AAD45753) ANON1A3 (FRAGMENT) 

066878 (066878) CHROMOSOME ASSEMBLY PROTEIN HOMOLOG 

AAD45759 (AAD45759) ANON 1 A3 (FRAGMENT) 



40 0.022 

39 0.037 

39 0.049 

38 0.083 

38 0-11 

38 0.11 
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CT1044 
Nucleotide 
Genomic coordinates: 
Start: 143759 

Stop: 144689(SEQ ID NO: 34) 
Amino Acid 

MSSS SSETPKTSTDTGEERI KDI VNALDNNGEWLSSYI DPI I NNHISRKTAETVQKINQE 
VDERYDRKIADKINEIKSSIFTSAQTMYDQYAIDTFQEGKGANGTGPVMGPVNTVIDTTL 
NKMRGNMLEYAETO1WDGDDWKRFSSSMTTLEFDLSYSDLTMMRGSDGYFAFPFRGTKKIK 
MDGSRKKEDPINCIISVTYPNKVGDEWEEGKEREVNFNLERVDDYERDIHVSILCMLHAQ 
LDNFEQALGENANSFYFKKGQRVMFLPKKSKLFNRPTVEDSDMFSIIFPPASDQDFADDI 
YYRIIVTCS 

(SEQ ID NO: 35) 
Top Blast Hits 

Sequences producing significant alignments: 

013706 (013706) HYPOTHETICAL 11.8 KD PROTEIN C13F5.07C IN C 
Q20960 (Q20960) COSMID F58A6 
CAB59514 (CAB59S14) HEAT SHOCK PROTEIN 70 
Q45851 (Q45851) NEUROTOXIN TYPE F 
O17208 (017208) C01B12.2 PROTEIN 
BAA83026 (BAA83026) KIAA1074 PROTEIN 

Comments: 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 1 to 954 of CT1044: this 
corresponds to nucleotides 143768 to 144721 of the genomic reference sequence. 

TaqMan Primer/Probe Sets; 

5'start=330 

5 1 stop=350 

3 'start^Ofi 

3'stop=427 

5'primer=GCCAGTGAACACGGTTATCGA (residues 330 to 350 of SEQ ID NO: 34) 
Tm5=59.57 

3'primer=ATCGTTTCCAGTCATCTCCGTC (residues 406 to 427 of SEQ ID NO: 34) 
Tm3=59.04 

probe 1=CTCGAATACGCTGAAGAT (residues 382 to 399 of SEQ ID NO: 34) 

probe 1 sta r t =382 

probe lStop=3 99 

direct ionl=Forward 

Tml-69.00 

scorel=l . 99 

length=98 



Score E 

(bits) Value 

36 0.47 

35 0.61 

34 1.8 

34 1.8 

33 3.1 

32 4.1 
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CT1045 

Nucleotide 

Genomic coordinates: 

Start: 150687. 

Stop: 154344 (SEQ ID NO: 36) 

Amino Acid 

METTMDNVVQNNDVTKPTPDVATVTTATEKRQSCKEKKDQLKAECPQVLRALKLSNTLKA 
NFGKSMSAIFAQHLVDMTNAKHFKDPKTKKILELDGSSSSDSEEEEETSSSSKRKRGSGA 
RSASSKKEKCPNTIKNWLNDAQGVFRQFADI I INLPSFDDLRDEVKDEQTELKTIYDLYR 
QDMEKWEEVLGRQDLFDHKSEIAKGLARFDTHVSLLPSDRSAVLDSSISKELEKNSKGP 
NSNIFDTLNTLKEEIKELLCHHVKYLLQNLTPEDANFVFNSSVOTVKKSYQYYIQTSEME 
SDEFKSLLTGVNIKILEKIISSDNNVATPYKHITNPRNIISSLQKVRETKPVSKDYPFRV 
DTARDIVLLPETGGISDLPIKPVTLLQLVSYINALFSLERRNVFTDGFFNAACVLISQCL 
TNANLLSNDFPKPIE1AANVTRHNLLSMKMLQEGSSSEKKSKKKEKKKDKKKGGGGGDDS 
DSETDSSSSSSSSSSSSSSSSSEDEEEEKGEAVEKGKKTKRKTKKKPSKDDDLDTISKLI 
LKTGGYFHDTSELGNKIRNLIDKDDFAGVAQYAVTITEMQSTPMNQRLVSSLLDLIMRLK 
EQVKYSVOTESTSSTAKSNNALDSAKLTSQQVVT^VDSGAEIARIAAFFFVWDNT^ 
RHEAFILTSKLLPSNENRGLKTVVESFFKNLTISNKVSTSNEEMMSVMPFEDEQQQQQCP 
OHEQOPDLKRVVGEVFLEMGKSIVNSFPSNKSVQLTADAFKQNYSPMGRRINLAAKIKTA 
ISIGSNISPNILFSNLPESVGNNTVTGLRLTNLLKNISQSAQANNIIKNANTLVNNTMDQ 
QNSAAMSILLFPPTSKETSIFPGNDPSSIKLQDMTTMSNLARGFYSIAEGCIGVVRSREF 
DEGGVKAYTLLVDSNTMDMAVNFAAQSLEKSMSEALTNNANMNPSNVLEGGSFVDGALSY 
MFEKNGSDCEPTPLAKYTMKDVSNRYLKKFNNDKNTQDLYKNRAERALVEQVTNKPTSW 
HSQLANAMGVAVIGAASIKLMEAEAAESEMRAANYQATSKSTNAINITNTIGMIRNTTHL 
CTTIAVSAAADMSKLANNHFMSVLNTANNSHSSRRGDRSSLMLQQQQPTHSAFLEQTRGR 
GGGVLGSGTEQTKDHVERMKRDWILNMISPEDKNTTTTTPSNAGRTLGYGSNITGINTIK 
QDDKSMMDKLSEMSSFRT 
(SEQ ID NO: 37) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9XYZ6 (Q9XYZ6) HYPOTHETICAL 75.5 KD PROTEIN 59 2e-07 

055035 (O55035) PEPTIDYLPROLYL ISOMERASE MATRIN CYP <EC 5-2 59 2e-07 

VTA2_XENLA (P18709) VITELLOGENIN A2 PRECURSOR (VTG A2) [CON 57 5e-07 

Q07034 (Q07034) RNA BINDING PROTEIN 57 9e-07 

N AB3_Y E AS T (P38996) NUCLEAR POL Y A DENY LATE D RNA-BINDING PROT 57 9e-0"? 

095367 (095367) CBF1 INTERACTING COREPRESSOR CIR 57 9e-07 
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Comments : 

TaqMan Primer/Probe Sets: 

5*start=1624 

5*stop=164 6 

3'start=1702 

3*stop=1720 

5'primer=AAAACAGGAGGTTACTTCCACGA {residues 1624 to 1646 of SEQ ID NO: 36) 
Tm5»57.88 

3 1 prime r^CTGCATATTGGGCTACGCC (residues 1702 to 1720 of SEQ ID NO: 36) 
Tm3=57.81 

probe 1 =CACGAGTGAACTCGGCAA (residues 1647 to 1664 of SEQ ID NO: 36) 

probel start— 1647 

probelStop=1664 

direct ionl=Forward 

Tml=69.0Q 

scorel=1.99 

length«97 
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CT1046 

Nucleotide 

Genomic coordinates: 

Start: 154556 

Stop: 156932 (SEQ ID NO: 38) 



Amino Acid 

MSLVENNTQEEMILETTVEGWEGAEVAPRGVKRPI»PS5SSSSSASDSEDDEGGEQPQTK 

PPKKKRNINSGKYWKIETIEPASPEMLSAVNDIDNVSKTIPLIDNSFGVQFKKSVSEEQI 

KTLLTETIAVEYGTITNVKYSTFNQLERTGEPLKKKRSNNGNNNYRYWQIRIEAAAAENV 

TQAVLDAIVEGNDTVIKAILLPEGEGIGLQFNKSVSSQQAKNIVQAADIEFGQVAHMKCN 

LFHKMEKADESSNSSGESPKVKKVRRNKSQPTNSYYTFTMIGDSLQERIDNAIKVIEMSP 

VKRPFSNSAAAAEEDTTTTTTTSTGVVNPRGIKDIHFFDSSISKGCFTVRNIVAANGEVP 

QEEFVSELYTNLLKVEEKVDHPTFKKLIHDRTMNRHIKAWYCICPYYTTGGVPPAADKVS 

AKGIATYRIYEDRTGVFQFDGAHTSTTPAQAAEATGAIHKSMLFQSPGTDIQKFLDAKKA 

EGLEPISSGEIVYRSKWSPNDSRATRCFKFYSSSDEKMNIADVLSIVHTDGLFSSVHFRK 

DTMEYGVAKSKSKIIPKTIKIKKGGDTFHSEEDIEVPVKFTAITSEELNRECNTKGMNSL 

RAHKKRKSNSSTTTTSTTSTSTTANTPKKTKKSASAASDPFAKLTLDYVDSTSFVFYNIS 

KEMVQRI1AQERVKTLKAVKNEEKMEIVEGEEAQETYRGIVKIKTNAKAYNLANKTSGVL 

FPADKVCLKHTLEDLGDVLDFDWREDNVNKTVASTTTTSSENKASGGDDEETPMEFETD 

GEKLLHELLNE 

(SEQ ID NO: 39) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



Q4 57 5 9 (Q4 5759) CRYSTAL PROTEIN 41 0.023 

097003 (097003) L1156.7 PROTEIN 41 0.030 

002061 (002061) B0041.7 PROTEIN 41 0*039 

AAD55361 (AAD55361) XNP-1 41 0.039 

KI67_HUMAN (P46013) ANTIGEN KI-67 40 0,051 

P79065 (P79065) N0C1 PROTEIN 40 0.066 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 935 of CT1046: this 
corresponds to nucleotides 155963 to 156897 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5 , start=1338 

5'stop=1355 

3'start=1419 

3 ' stop=1440 

5 * pr imer «TACCCCAGCACAGGCAGC (residues 1338 to 1355 of SEQ ID NO: 38) 
Tm5=59. 36 

3 1 primer-TGCCTTCTTAGCATCGAGGAAC (residues 1419 to 1440 of SEQ ID NO: 38) 
Tm3=5 9.4 6 

probe 1=GAGGCAACTGGTGCTATT (residues 1357 to 1374 of SEQ ID NO: 38) 

probe 1 star t=1357 

probelStop=1374 

direct ion l=Reverse 

Tml=69. 01 

scorel=l . 98 

length=103 
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CT1047 

Nucleotide 

Genomic coordinates: 

Start: 159378 

Stop: 161256(SEQ ID NO: 40) 



Amino Acid 

MSCSSSSSCSSSSEEENEVGVEGGGGRIGPTEAKKKILRKRKRSSVKSTSSSSSSSSSSD 
DSDSDREEKEGRKLYVDIADTRKPPKVRKLDTPSQTLENDLYMSSSSSSSSSSSDSSSSS 
GEEESDDDDDDDYDPDNVHVLGCKKEKSPQDIEAEKEKEEEYEEEFKRMALPSRINTSVD 
DCVIPDRILTLFSTLLKKNSFQFSQPVSFLRLVMKQVNEAMNSAFSSMLSSSGMRLVEDS 
I^DTSKISSFITPQTDTSNSSSSSTFVNNCTDEDIKKRNIAMGRVAELLSNIAASSNEEN 
NFRPWSLMRGPTCGGSNASNKKLNSNRQTIPQVLNKVIFFREIHSVIALYLSSVCVQRA 
MNNDNTNSSGYAEGMVTKILNIIGKIPYNEMSREKFISVGRDALYLYQNVITDMTGPKHN 
KRLRIPQQQADFCYIIAMLVNDVPITSDLLLTGKATNLVQFASAMVDPAYRLAVHKMASV 
FNSSYSVYKVLDLDHKMLLRANLILSILSARNKCLSERKPRTLTQSVYLFLNHLLRNKLR 
SSGLTSEESSLGTAVKLVSQQLMYEGVTRQTIEDGCSMISGNFEDEDGVTLKCLGADVKD 
VKTVGLSALLSDRLRBCNIRRNVPFY 
(SEQ ID NO: 41) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



SR40_YEAST (P32583) SUPPRESSOR PROTEIN SRP40 74 2e-12 

NSR1_YEAST (P2747 6) NUCLEAR LOCALIZATION SEQUENCE BINDING P 63 6e-09 

VIT2_CHICK (P02845) VITELLOGENIN II PRECURSOR (MAJOR VI TELL 61 3e-08 

095815 (095815) DENTIN PHOSPHORYN (FRAGMENT) 60 4e-08 

VIT_ICHUN (Q91062) VITELLOGENIN PRECURSOR (VTG) [CONTAINS: 59 le-07 

GAR2 SCHPO (P41891) GAR2 PROTEIN 58 2e-07 



Comments: 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 2 to 888 of CT1047: this 
corresponds to nucleotides 160277 to 161163 of the genomic reference sequence. 
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CT1048 

Nucleotide 

Genomic coordinates: 

Start: 161717 

Stop: 165020 (SEQ ID NO: 42) 



Amino Acid 

MFGSSANNFNGDKKSSSSSSAAASSDDQQLGPLGLSTADFKKVAAIIANRTESLYLLPDS 

PNFKNVINHPNQISIVPFLGSSKAAESGSANKNENQAENSSKGGSDGKKSSQQNKFNLLN 

KVEAEEMAFKRVAELIADTPPSKDNPLRDDPDAIPSRNPWVKLTQKNLEYLFWEAVTIEV 

SNDRS I RSGRYLQASEVGEN P FLMT ISVDIRI LQRMALNWWFFNRFFRMVSGLGVENRA 

NSTYVATSDAIAQIWVEMLLKNFISGENVPQALKYLKEHYEHVYNKISKCGRQPSYFWE 

FERVDNTIGETTOSDTEHNGSSYMEYRCFDTIRKNASSGPSGGGKSGVLSSGTFFIDNEMG 

NNNSSAAAASAPAVS AGVSPSLS PFSS DGDDDDDDCSGDDVWGKKMI FNTSGDGSGESSG 

QNGGGASTYKRFRCGENTASLSQKENVRLMAMPKGNEDKQLLKNIINFLNSALNSVENHV 

MCTDENIFDEDQAEHYTSNKELYKAIVCSNPANVYRVMVELFVNLILPRLRNPIVSDIET 

VQNLPSNNGSVRTKKMVEHGCTDMRYDIPPYAKGKIRLSAKRACECRKLCKDVRCFDKSR 

EANLTPSQKAGREVEEPFPRNHNSHRSNAHDFTFYDKYRARMNKLKKDSKKKVKKIDTFT 

TTDDFLLQDRNAFDLLRKCFLSASLHHIFCPDVLMVHRGDSFNINFANNKLECYNERNGI 

EEVTSSQTVNAKEALEDITKIKMKRGDDIIDWKSKGLSLREFSKKVSKIVRRFNEITNQ 

LCNNCNVNSSNGDVDFHVFTS VCVY I HN 1 1 PVLEDIS I FAELGEELTKLVKECRDVAGED 

KTYDDIIRNYEITVKYFKLFNALVKFCHRNYNVAVTSAINRRGYMCMVSNLVGYYCKLSD 

NAIQYHESLCSLHSSISYADYYTSRNNNSEDGGGNSSSEKSNADVAKTMASFYDQFDKSE 

DSKKNKNKTSNEILIKMFQMDRVLDGMDDDDDEDSDSSSSENEEEEEEEEIVKKPAKKRK 

VEDVDSNKKTLPKEPAVKKVKQEEDVEMEEVKEAAAEEEKKEEQEAKEEDATEYDDDTEE 

DEKA VAS DE DE DDE DS KAI F 

(SEQ ID NO: 43) 



Top Blast Hits 

Sequences producing significant alignments: 



Score E 
(bits) Value 



BAA83091 (BAA83091) HHNBV-XIA 
Q9YTL7 (Q9YTL7) ORF 4 8 
Q18401 (Q18401) 
035788 (035788) 
096127 (096127) 
096229 (096229) 



COSMID C33G8 

CYCLIC NUCLEOTI DE-GATED CHANNEL BETA SUBUNI 
PREDICTED SECRETED PROTEIN 
HYPOTHETICAL 78.6 KD PROTEIN 



427 
88 
77 
74 
73 
73 



e-118 
3e-l€ 
7e-13 
5e-12 
8e-12 
le-11 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 109 to 282 of CT1048: this 
corresponds to nucleotides 164 907 to 165080 of the genomic reference sequence. 
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CT1049 
Nucleotide 

Genomic coordinates : 
Start: 170115 

Stop: 170733 (SEQ ID NO: 44) 



Amino Acid 

MAGIGRRDNRPVLHLDIDPNKEIPYNVPPTPIICEKNPFVFNMQKCSDCAPFPPYPGTEK 
PFPPYPGTAVEEEEKQKEIEELLVDQSFPPPFPGNKLRDIPRTYPLEFPEKKEKDFPCVD 
TTGHSDI PFI DLEKTPPRSDVRHGYHYLINPNKVGELNH I VGKLTEKQENLNKLVLDVDD 
VVINLSSTLKELEKLRAGLCKFSKN 
(SEQ ID NO: 45) 

Top Blast Hits 

Sequences producing significant alignments: 



Score E 
(bits) value 



PGCA_BOVIN (P13608) AGGRECAN CORE PROTEIN PRECURSOR (CARTIL 39 0.020 

CAB40774 (CAB40774) EXTENSIN-LIKE PROTEIN 37 0.13 

AAD45972 (AAD45972) WISKOTT-ALDRICH SYNDROME PROTEIN INTERA 36 0.30 

Q22807 (Q22807) SIMILAR TO E. GRACILIS MAJOR MEMBRANE SKELE 35 0.39 

014686 (014686) ALR 34 0.67 

Q28226 (Q28226) MUCIN 34 0.87 
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CT1050 

Nucleotide 

Genomic coordinates: 

Start: 170831 

Stop: 171461 (SEQ ID NO: 46) 
Amino Acid 

MASS SSSPVALSSVASSVMMERDEENTLSLRNRNVNKPTPVSAAWVPV DEE DEDREEMRR 
LEDFSSDEEDDDNKSCHCDHSDDDDDDEEDPSCFKGFSAGLCSFVRGFFGFLRKSLTKKQ 
VFLLTSAAVAAI FKTRDVAKTEEGAATMEENSTDVI TGGDGDSGIAADVVSLASEGEGEN 
GSLLESIATTLIKTTIENLVDGGEETTEL 
(SEQ ID NO: 47) 

Top Blast Hits 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 637 of CT1050: this 
corresponds to nucleotides 1708 64 to 171500 of the genomic reference sequence. 



Sequences producing significant alignments: 



Score E 
(bits) Value 



077320 (077320) PFC0335C PROTEIN 

CAQC_RAT (P51868) CALSEQUESTRIN, CARDIAC MUSCLE ISOFORM PRE 
049209 (049209) PUTATIVE HI STONE DEACETYLASE 
Q9YPA9 (Q9YPA9) HYPOTHETICAL 45.2 KD PROTEIN 
Q98148 (Q98148) ORF73 HOMOLOG 
O40947 (040947) ORF 73 



45 
43 
43 
43 
43 
43 



4e-04 
0.001 
0.001 
0.002 
0.002 
0.002 
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CT1052 
Nucleotide 

Genomic coordinates: 
Start: 173177 

Stop: 175862 ISEQ ID NO: 48) 



Amino Acid 

MTRHGVLVPK GRSRHVILGN VDYTFCTTDN NCVSLDIDFK DNITDQNIQL 
LNKKLGKKTA KKIKKEDAPE TKENSDEDIY ATKEFEQTIK GLQTKKGATE 
ENAIAAAAAA AT AAA V EKAM LSESEGKSMV INRARMVLSK RDTSQKQFTA 
LKNRESFFSV LIFETGSVIV VGLQDPSLTK LCVIKATTDI ADILQKNISV 
ANVSIVNTVS TFNRFHLNFI RLGKFFERNC ISYSYNPETF PGM FFKLRV P 
AKPLLPGETI GEYYTKVAMM RDSKDPNFKM SDWLRIKTAL TFKVGKITVL 
GEGESGCGDV SVVSKLLFGL FHYFMDNNIK MSPKEAQRVR EKYGIPHLEW 
YLYIDMLLHS YPYVKPSAEQ VKRAMVDQQH ISEVDRTYYG TKNSMDAAMS 
ANLVPSKEES ISFIKKIRSO QLFGHLCKPS KETTRRAIDT LSFDPINQDR 
WWNKNDQYYG KERCDPFSVA RLVSVSENTN SMMNSRISCQ GKWWLDENEY 
KDKLDHIVDL CTEEIVEECE SKGFIASPFL RKHQKEKIPT PYVLLARACN 
QKNGNKMSIN NNSNYLSGSS RAKRNAKLQE KHRVTLARLN TMMASYRFLN 
NYISTDIAPD FAKLFGNDVY SLLHLMTNLP KSRGHALTYN ERALSSNEST 
YKTPGNAYFS TLFEKSIINN QETANKGNNR KRKFSRIGQE KSSFLCNACG 
VNLNKGSDEI IKGICTSCDQ NSTSYIENAL SDINRDKKIK RFKAAATHPP 
VKQELVDSLS SSSSPSSSSS QTSNKNNRCT PSDFIDYVYK FTDETTGAPK 
VGLVFKMCDI LASLASRRGM EDRPTANYRT SLHSATQNKT NLNKLLVSAI 
KETGATETEA QIFNKIIGSE KGLSILCQLV ERRNKDNNVF D 
(SEQ ID NO: 49) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



gi|6647869|spI029874|TF2D_ARCFU TATA-BOX BINDING PROTEIN (TATA-, 
gi | 664 7868 I sp (027 664 | TF2D_METTH TATA-BOX BINDING PROTEIN (TATA-. 
gi|3122925|sp!Q12731|TF2D_EMENI TRANSCRIPTION INITIATION FACTOR. 
gi|135643|sp|P13393|TF2D_YEAST TRANSCRIPTION INITIATION FACTOR . 
gi|135627|sp|P28148|TF22_ARATH TRANSCRIPTION INITIATION FACTOR . 
gi|135626|sp|P28147|TF21_ARATH TRANSCRIPTION INITIATION FACTOR . 
gi I 333 4 374 I sp 104 3133 I TF2D_CANAL TRANSCRIPTION INITIATION FACTOR. 
gi|13S639|sp|P17871|TF2D_SCHPO TRANSCRIPTION INITIATION FACTOR . 
gi I 2833518 I spl Q57 9301 TF2D_METJA TATA-BOX BINDING PROTEIN (TATA-. 
gi|417896|sp|P32085|TF2D_CAEEL TRANSCRIPTION INITIATION FACTOR . 
gi|417 882lsp|Q02879|TF22_WHEAT TRANSCRIPTION INITIATION FACTOR . 
gi|121568|sp|P220lO|GR78__KLULA 78 KD GLUCOSE-REGULATED PROTEIN . 
gi|3122941 | sp | P9334 8 I TF2D_TOBAC TRANSCRIPTION INITIATION FACTOR, 
gi I 3915894 | sp | P52653 | TF2D_ENTHI TRANSCRIPTION INITIATION FACTOR . 
gi 1 1729908 I spl P50159 |TF22_MAIZE TRANSCRIPTION INITIATION FACTOR, 
gi I 1351224 I spl P4 8511 |TF2D_MESCR TRANSCRIPTION INITIATION FACTOR . 
gi 1 172 9907 1 spi P50158 |TF21_MAIZE TRANSCRIPTION INITIATION FACTOR . 
gi 1 1356401 spl P2 6357 | TF2D_SOLTU TRANSCRIPTION INITIATION FACTOR . 
gi|1709903|sp|P54 637|PTP3_DICDI PROTEIN-TYROSINE PHOSPHATASE 3 . 
gi)28334S9|sp|Q55031|TF2D_SULSH TATA-BOX BINDING PROTEIN (TATA-. 
gi|586175|sp|P32623|UTR2_YEAST UTR2 PROTEIN (UNKNOWN TRANSCRIPT, 
gi I 2833446 I spl 052366 | TF2D_PYRKO TATA-BOX BINDING PROTEIN (TATA-. 
gi|3915739|sp|P18428|LBP_HUMAN LI POPOLY SACCHARIDE -BIN DING PROTE . 
gi 1 135634 |sp|P26354|TF2D_ACACA TRANSCRIPTION INITIATION FACTOR . 
gi|135636|sp|P20227|TF2D_DROME TRANSCRIPTION INITIATION FACTOR . 
gi|417102|sp|P32103|Hl_EUPEU H I STONE HI, MACRONUCLEAR 
gi 12833477 | sp|Q57050 | TF2D_PYRFU TATA-BOX BINDING PROTEIN (TATA- . 
gi|3041729|sp|Q03410|SCPl_RAT SYNAPTONEMAL COMPLEX PROTEIN 1 (S. 
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gi|3122946lsplQ42808|TF2D_SOYBN TRANSCRIPTION INITIATION FACTOR... 
gi|126745|sp|P14873|MAPB_MOUSE MICROTUBULE-ASSOCIATED PROTEIN 1... 
gi|4033393|Sp|P78695|GR78_NEUCR 78 KDA GLUCOSE— REGULATED PROTEI . . . 
gi|12 6511|sp|P127 44|LUXB_PHOPO ALKANAL MONOOXYGENASE BETA CHAIN... 



35 
35 
35 
34 



0.74 
0.74 
0.74 

0.97 



Comments : 



TaqMan Primer/Probe Sets: 
5'start=12 68 
5*stop=1291 
3'start=1327 
3 f stop^l350 

5' primer=TTGGACATTTGTGTAAACCTTCAA( residues 1268 to 1291 of SEQ ID NO: 48) 
Tm5=57.29 

3 1 prime r=CCTGTCTTGGTTTATAGGATCGAA (residues 1327 to 1350 of SEQ ID NO: 48) 

38)Tm3=57.45 

pr imer Score— 0 .74 

allelel= 

probe l^AACTACTCGACGTGCTAT (residues 1296 to 1313 of SEQ ID NO: 48) 

probelstart=1296 

probe 1 S t op= 1313 

direct ionl*Reverse 

Tml=69.02 

scorel=l . 97 

length=83 
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CT1053 

Nucleotide 

Genomic coordinates: 

Start: 175839 

Stop: 177107 {SEO ID NO: 50} 



Amino Acid 

MSSTDLSKNAFHDWWSKTDCEVFDVHCETDRDCGAACENTYSVDGKEVTKFSCNQQSGR 

CARSVYSASSLERAANDLGHIIGIIKKNPKLEEELPESFLWFINHNGGDLFVNKRAAYYD 

TMHLSIGBCLDNVDTLAQGLDKRMASSLREHLLRKLDSILLQIDKVKYEKAKKWILDITQE 

AGTEEDNKEEEDAKKECKJSLSVSEIVDVLTGTHDPMPLRARGFIQKKIYPLSRNELRELA 

LKELFPEETTSPQVLSRQHDVSTREDLCNESMNAGRAESIFSDPDSGEYVATCACLYSEY 

LTGPACKHKTYRYVIDYDKWKRTGRPEFLTDPVLHFKKAEAVCKSTNPNLRAIYSPDNKG 

FLCAPVAELVKTALTFRGSHEPSLIVERDINQAENLPSNSFGVNWPYVNLLNRIQDQYT 

(SEQ ID NO: 51) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q26648 (Q26648) TEKTIN Bl 37 0.30 

RRPP_VSVSJ (P03520) RNA POLYMERASE ALPHA SUBUNIT (EC 2.7.7. 36 0.52 

RBB 1 _H UMAN (P29374) RETINOBLASTOMA BINDING PROTEIN 1 {RBBP- 36 0.68 

CENE_HUMAN (Q02224) CENTROMERIC PROTEIN E (CENP-E PROTEIN) 35 0.89 

RRPP_VSVIM (P04880) RNA POLYMERASE ALPHA SUBUNIT (EC 2.7.7. 35 1 . 2 

Q89487 (Q89487) PHOSPHOPROTEIN 35 1.2 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 1109 of CT1053: this 
corresponds to nucleotides 175933 to 177041 of the genomic reference sequence. 
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CT1054 

Nucleotide 

Genomic coordinates : 

Start: 177123 

Stop: 178524 (SEQ ID NO: 52) 
Amino Acid 

MSASLILDEYLKKTASAVLDVADSFEKIKGEIQSPEEAAALSVALYGAPPKPSASAVASII 

TGERTSLNDKYLSDNVLLKMSVARVGQENNRKRADQAADEIRTIMEDITGSLSGAYRQYSP 

I^ENKVHIGIMNNKTPSIVCGYYTMDTSISSEPLSLTDFQNPTVIANVTKRMESIFSKVD 

SARSTRFDAFVNGVANNMDIKSSIDWANMVENVIKLPDSTPNPCSVDTIVSRDASWKTAV 

NDIYASVGKSYCRFATQLTFMSEIEKLRKAAVVCFEALMSDTRERAFVEFLFYVSFKEDAS 

NTNSKLFVQNKLSSMSGNPRQPIKLVRRSAEETLFGLCFMFKVMPPEFMNCIFNFPTIPHS 

TQYHGLYGTCLTPLLRKYGSSFEKSWAHFEEILSERANAVKKFGVNDTRIDCLDAVANLTG 

PVYVLI LDLVRTLSAQRSCST KFLRE I KEN YLLWNRF VSX 

(SEQ ID NO: 53) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Valu< 

gi | 126194 | sp I P17279 | LE02_RHIRA 3-ISOPROPYLMALATE DEHYDRATASE (I... 33 1.4 

gi I 6685598 |sp I 095613 I KEND_HUMAN KENDRIN (KIAA04 02) 33 1.4 

gi|24 97227|sp|Q04893|YM96_YEAST HYPOTHETICAL 113.1 KD PROTEIN I... 33 1.4 

gi|134393|sp|P17065|SEC2_YEAST PROTEIN TRANSPORT PROTEIN SEC2 32 2.4 

gi | 547908 | sp IQ02455 | MI*P1_YEAST MYOSIN- LIKE PROTEIN MLP1 32 2.4 

gi |24 96893|sp|Q09462 1 YQ52_CAEEL HYPOTHETICAL 30.9 KD PROTEIN CI... 31 4.2 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 899 of CT1054 : this 
corresponds to nucleotides 177561 to 1784 59 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start-678 

S'stop=701 

3'start=754 

3*stop=777 

5'primer=CACCTAACCCTTGTTCAGTTGACA (residues 678 to 701 of SEQ ID NO: 52) 
Tm5=59.08 

3'primer=CAATAAGATTTTCCAACAGAAGCG (residues 754 to 777 of SEQ ID NO: 52) 
Tm3=58 .08 

probe 1— TAT TGTGTCCAGAGACGC (residues 703 to 720 of SEQ ID NO: 52) 

probelstart=703 

probelStop=720 

direction l=Forward 

Tml=68 . 92 

scorel=l . 92 

length=100 
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CT1055 

Nucleotide 

Genomic coordinates: 

Start: 178529 

Stop: 179348 (SEQ ID NO: 54) 



Amino Acid 

MAQTSKMGTNKRCFEEEVEEERQQPFTKKSKSEPPSFEDKSSSTSSKKKSKSNKHTKTKE 

EQLLEFVKDLERSDPTVPDEKVKQEVEEKSPEAIAEIFSMFGIAQDSKFKSLLPIERIKS 

ITTKIVIDAINQPVRKMLVDHLYHFKEMQNWEBCYKDDSDEKLSVILKSKKSPKEFDLSF 

SDYVDRLNRILVGVIKRVAGAIESKELLQSNSMIMNSVLGTVVSNIPYNMKINICVFLTN 

FI CT FAN DDL Y T FFRDDEKFVMSQVTR Y I SKD 

(SEQ ID NO: 55) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

001761 (001761) C. ELEGANS UNC-89 (GB:U33058) (NID: G1160355 45 8e-04 

Q17362 (Q17362) ONC-89 45 8e-04 

Q17595 (Q17595) SIMILARITY TO MYOSIN HEAVY CHAIN 43 0.003 

031329 (031329) ERPM 42 0.004 

FKB3YEAST (P38911) FK506-BINDING NUCLEAR PROTEIN (PEPTIDYL 41 0.007 

AAD55361 (AAD55361) XNP-1 41 0.009 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 661 of CT1055: this 
corresponds to nucleotides 178612 to 179272 of the genomic reference sequence. 
Hit to public sequence gi | 6856160 t gb I AF173992 . 1 to CT nucleotides 647 to 819 of 
nucleotides 910 to 738 of the public sequence with a 100% homology, a score of 
343 and an Evalue of 6e-97. 



TaqMan Primer/Probe Sets: 

5*start=269 

5'stop=289 

3'start=340 

3 , stop«360 

5 1 primer— CCCCTGAAGCTATTGCTGAAA (residues 2 69 to 289 of SEQ ID NO: 54) 
Tm5=58 . 07 

3 ' prime r=GCTCTTTATGCGTTCAATGGG (residues 340 to 360 of SEQ ID NO: 54) 
Tm3=58.32 

p robe 1 = AAGTTCAAGAGCCTTCTT (residues 322 to 339 of SEQ ID NO: 54) 

probelstart=322 

probelStop=339 

direct ionl=Forward 

Tml~69. 00 

scorel=l . 88 

length=92 
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CT1056 

Nucleotide 

Genomic coordinates: 

Start: 185432 

Stop: 186830 (SEQ ID NO: 56) 



Amino Acid 

MAVGDYLSMSSVGEATLVGFMILNFINFVTILSLIIYAVTDVYRRCKRPSTNGYSGCTTN 

WSSTLQEANLVTTEKDKPVQFVRGLVPRKMMEKYRSDLSPKNVGEYILPSEKETDKLKS 

DYKKGKKVGLLTALSNGHDSNKRIIGPRDLISRDDVKDKSYVFKRLSKDPLVYYSSATSK 

YTOKFSPFRAKKFMTSTQLGSKLVYPHPIRYGTAFVLPTGYVINKAYGMDNEDLHTWNPP 

SSSVLVPDSNNDRLTVECAKTDPTHRIGIYGFGGSDDNRRAKEEGYVEMLLCNCDNHKDL 

LKAPLITEYSTNPTEIQVDVAAKRVLFPAPGSEPVKSSQVTSAAHQLDGATGEHDISHEP 

VKLSDTGDYAVGSPIVFKPVYGTSLVNLPETGSPLALNCPCTDKADGIYQVNQKGGILYR 

DMVGYLNANPVEAASLSSSDSSSWLTTGNKISSVTCEGEKIKKIV 

(SEQ ID NO: 57) 



Top Blast Hits 

Sequences producing significant alignments: 

Q99175 (Q99175) HYPOTHETICAL 58.5 KD PROTEIN 
CAB4 9723 (CAB49723) HYPOTHETICAL 52.6 KD PROTEIN 
Q9Y6X0 (Q9Y6X0) SET-BINDING PROTEIN (SEB) 
Q9ZXH3 (Q9ZXH3) INT44 

Q55105 (Q55105) MULTIPLE LIGAND-BINDING PROTEIN 1 PRECURSOR 
Q9WXH8 (Q9WXH8) PYRUVATE ORTHOPHOS PHATE D I KINASE 



Score 
(bits) 

34 
34 
34 
33 
33 
33 



E 

Value 

2.2 
2-9 
2.9 
3.8 
5.0 
5.0 



Comments : 



TaqMan Primer/Probe Sets: 

5 f start=663 

5 f stop~684 

3'start=705 

3'stop=725 

5 * pr imer =CGTGATCAACAAAGCATACGGA (residues 663 to 684 of SEQ ID NO: 56) 
TmS=59.59 

3 * pr imer=GAAGAGGGTGGGTTCCAAGTG (residues 705 to 725 of SEQ ID NO: 56) 
Tm3=59.41 

probe 1=GGATAATGAGGATCTACA (residues 687 to 704 of SEQ ID NO: 56) 

probelstart=687 

probelStop=704 

directionl^Reverse 

Tml=68.99 

scorel=l .88 

length=63 
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CT1057 

Nucleotide 

Genomic coordinates: 

Start: 190875 

Stop: 193236(SEQ ID NO: 58) 
Amino Acid 

MEYMEEGDI AERRSEGVDY I LDENSACWNVKS I RNRLGAMDAEEAQY AQDI SAQLVTH I 
IRLAHCSESNKIKDTIASIAGLFINNIFDNNSTKNKLKTYNQFKAESQNKSSVLNIFGSL 
DPLSMLSSFMGSDPAKSGGENLDKSLGVLFEVLQNYNPCKIDDIVLLEMCPSKCAACTGL 
KEAIROEQPMEAMLLFFKCINHNRFNFGSDIKSAYASETCMRYSQDERAVWPLRSILLG 
CLDRDDPAHTLSSFGDTIEYADSDNAWVSSLFAAVSRMPMVDRAVIAHFYVYTMLSRHRR 
VSGDSFKQFVYTVFVRMIYSAIEILFCDTENSSVECDGKHFLSYVNAMVNVSVLGSTFNV 
LKAYRSWWDQASVAPVLDI I SGGWKKN YPS PDH IKRVAYDI SQVINHLASPSRMVKGNN 
KASNVTSGLDSIRSVRQAEKYIPFGILENKAGYGVINIAKHNISRPAREQSNGRNFNCNA 
LHILPSIKGCEALGAQKGSADQTVNVFDNFVASHMDIAMKKQGSGKILGLLTSMIDRQGL 
TTSFPSSEAEYKKRIHDFTRYVIFSSTPINDELVNSRCILPHSNVLNSPISLRNIDPESV 
PDTRFHFLLMMWQRPNIDEPNLSALTTSQLELLLSKNQKWDKLTTRAFFNIDRINFQMAD 
AIIKNVSGSGFLDGSKTASSSSSAPNFFQIFSGAECTAKQLQSIRKFI GESMQHVQKEWS 
SAVNNGNRGVENYDGLNAQFSEELFELLYKLIIEEDMRPSSLIASSEFLSNYVNAMDELL 
I RAN AS 

(SEQ ID NO: 59) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

CAB52133 (CAB52133) RCOP C7 (FRAGMENT) 34 3.0 

IDHC_SOLTU (P50217) ISOCITRATE DEHYDROGENASE [NADPJ (EC 1.1 34 5.1 

Q23615 (Q23615) ZK822.4 PROTEIN 34 5.1 

YSW1 CAEEL (Q10017) HYPOTHETICAL 63.8 KD PROTEIN T25D10.1 I 33 8.8 



Comments: 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 636 of CT1057 : this 
corresponds to nucleotides 192639 to 193274 of the genomic reference sequence. 

TaqMan Primer /Probe Sets: 

5'start=1214 

5 f stop=1236 

3 , start=1281 

3»stop=1303 

5 * primer=TCATCAATCATCTTGCATCACCT (residues 1214 to 1236 of SEQ ID NO: 58) 
Tm5=58.13 

3 ' primer=CAGACCTGATACTATCCAGGCCA (residues 1281 to 1303 of SEQ ID NO: 58) 
Tm3=59.10 

probe l^AGGCTAGCAACGTTACAT (residues 1262 to 1279 of SEQ ID NO: 58) 

probe lstart=1262 

probelStop=1279 

direction l=Forward 

Tml=69.01 

scorel=l . 98 

length=90 
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CT1058 

Nucleotide 

Genomic coordinates : 

Start: 209615 

Stop: 227849 (SEQ ID NO: 60) 
Amino Acid 

M DQYPEVRDT PQTEQEQAAAQQQQAATTTAAAAAAAAPTQY SNTVSAETLSAI SE DGKLE 

RSIAASCWINNLNPDEKMAQRVQFHPLSSTTTYDSENVNPGSSVVFLKPRALPTGGTCLA 

PNY IAVPTLRAASEI I DSI ASTSLYQCSMFNSWNLI PI FMSNSKHSQFGDRVIKRSMIRN 

CFSKQKNVENLLKELRRRKVNAAKAFSHAVQQKSAVNTAIAAWNAGSAANLEKLVDFCKL 

KYSPDRKYKAGGLFSASATAQSQSGTSSSSVEHTSNDFLLDILKRHKGTSLDLDSATNTF 

DTALSRVFTE FKEQARAAVDAAADS DH LSAS DPIFSIVRHNSRREGILDSVPNI GMLAPR 

SKYSVAEYLMADRDESADIAAKIGTKIATDFEALRGDNNKRRADTSVDDLKESLADSIEK 

TSIKNTGDINSVTNIPTDTEEYEFSLHITQLFAQAFLETMGSLLSCAFGVQFPFSDEGFA 

AIERIIRKTDPDTGKVSEMDPSSLSDQYLLLVGNFQVSPFHVSDPKDIVFGRQVTPNTPI 

LS 1 1 TRS KNDKNETST 1 1 NFRDRLL VN DTVLR DATQN VSTST PSQRRV PTAAGEPKKPML 

SGCLPIIRGPQWTRESDDMISGLVGDWYISLGVYYAMGSSAAAIAAGHQRALASAESIN 

SPMMKKFSKKGGKYTEEEKRIKKAMRRNADRSARILALLGQTDAQYGYVEHNSTLDSFWS 

SNAAIRAKAKEDALSRAEILAVRKQLDGKCSSSRDEYSMVERYLRDS FFRSVNRSGGGYE 

MFDQGFDMGRFADFLSDNSAARNAWQQYAEVMRGLSKHEKRVFNIEGLFSALNSFKFPLV 

PEQGRKKTVGGRHRLNNLKAANKI ING I TEMTLQSA I DGTG I S DI IGS VSDGWGNTTAQP 

SRVKALKTLSNFSGNGNWSIPVSRAVKCAAGSRGGETLKCVDI PSVI IANLISDKRILD 

QLCGGGMNIAHEITNFIETIAGKEHTGKESVFLSPRLSVILLRYIWFNAAWSLTDSNIK 

MPLNTMSEGTGDDI YRDYLAIRGMVNN YNSSLSS I SVKAI S DRYNCGSGNTSTSNKNVTI 

KTQGELLTVLQQTANALSAFTNKGGVGATPDAANMAN V I S P I ANADWKNTNV WSGLDR 

ITETINFFSFLSQIKTMNENIEEYLRRYRLGEGLDKKELDNFVYPNIAAIVKRELGVSGS 

ALSSNLDTDRPITIDLNTEQPLIVKASKGYASNRYAKLFNKTTRTAAEQAQMEQYNAQMA 

ANTIPQLVNRLTIPGSITADTAINWKAFTENGEFSNAETHLGVMGNAINEMQPLFTDGF 

NVANKRLTVNVGSVSKLIQNGLTVSLILAHSKASPYVFKPLVQDFAKLLLAVTAETSLW 

SRSQKSFFPIPPSVFSSGGLFKIDREMFDNMKTDYVVEVIRQLSKNATAAIERCNDSDSA 

ARIAKSGEIYNKDVASTTAAPGTSSSALTLFANNLQNPAKVWSMGALPHFDMAWPKLHG 

ISHD^FRLSTYYQGIHKMEI^ISDCKPEEWDNSLPGNRASKFFGLSSVSDNNRSFNLALD 

TLLASPAEICDLVTREMVKTSNDIVHNIGSNSNTDALQKSLQVGASAVEKYDESTLSTKE 

TDVYSLVSALAKSKSPLSSSSSLSSEGHLTSKEIDRTWNTPALLGTAKTTSYSVSEDALN 

APLSAVLDFRRNWDATKSLYEVAAVCSVMSKEEDVRSSSRKIMGMVEQESPVMQDIGID 

RIASLVSTVATPKQHRRFLQTVNDYKNYLIRKVASNPLLSSRLGGISPTSGNT DYNLKAV 

YDGWSSSSSMTPSSMSVSDRFWSGVFSQCLETGPSMFADAGHGGSNMFQITAPKLYGSR 

VNTYAALSSGVERLRDS ISSATQERKNRIAKS IEALET FVTDVVGGDTLDQLRKAQNMYN 

KLSDI TSNS I Y S DFGN I DCAKIMKNVT SKKMTARQQS DT I LSS LLHELAGLVH KQQPQLA 

TQFAIAS HVI KAKYVTN DLNNI HEKET FSQLMAVAGVADYYNVSAAAMCQRLVAS DVTMF 

LGGTMLQQGLFVSFLLNNVLFSQVSDNIKMNELNDETKSL.LVKLVGFCGTVSDALGSRHV 

SSIRRVQNEEDKKLDRSFVTSLYSAYRDLRKKTELYRETDTINKLFGHQNFMSYESSMLK 

RTSLVHDAVSGPRPRRYSTLEDVLEAPSTVHKSFMVSYPERAAASRRVKRAGLRALADNR 

MESLYGEEVLNDMRSSAVSSEMMDIEYGEGGFMMMISDDEDDIAFIDSEEESESSTDFSS 

SDEYSDSSDEYDFDDDNNGQSPYSTTSYSYDALDRLNSAAKPLTAIYGCRGEGEDDEEND 

LYEEEQERRRRRSSKMGKILRDLHESDDDDDDYFDDEFDGERSMSETIATRRAGRIQYGP 

GFLSHSNILNRPAKARAFLTRGKKFRPSAYDRFFMEDDDSLLFSDESTTSSSSSDSPFSS 

FSKGRKCKRRTSEDQCAFVKRVVRAFVPTRVTMI NGRVSM I T P VTSENTVGFYENYQKAN 

KRERARLIEEYKIVKGASATLPDEYVEGRASKQVSPRELRRSLIKAAAYVARTQESNLNI 

IFDALTTTSNATLVNDPSTLLGDTLLFAKQLEAITERRhJRLMKDLTEISPSLFTSFGDAS 

KDTQMMADAKQIVSGGNFKSAGYLGVPLRTLASCIKGTNTVDRLLATKNKNHLEWMTTAA 

IVFARSFNDTTFHALEDTLKMTSALTDMYSAFTNLVGSEHSORLKVKSTLLDS I FNTRMA 

HTEAVMGLVYPTAFINHEMPSDYTQRREMQSLALNILRGVNCSQLPRKDIGDTAGLLTFI 

TSRKFAGYGGERGGLSLYRMSIVDALSCPSDNRLKGAVSLEVGKWQDMGEEIFYKRSNDL 

VDFCSKNNISLENAVGP I ARFVPNGTNMADI GMT DI I SRTVKDDASM I RLRRAEEGAGAA 

GKFI TASAMGNLYGG I DT WNLTEKLYDSFVLLQDS DS FNT PT EM AT A I INRMKSRKHKA 

LKTPFGGDIATYKNFPSSSEAIVVRAKEMRNSISTIVMDISKSRGINSFSSRSGSTLAKI 

STSEFE RI LETS AVLSNTKANLRTI ENRLAEH YNKLKQ FSH I SN DGLSETRAWAVI AES 

LTPVYADDTSERGASVSELLTDNTLLKFIVQNELKNIEEAKRHVTAAIEGSSQLHEKMLS 

LLVASADINRMSAQNNLECKKLTEGNSNFVPMTNDQGGTFIKHKETGIWLKTDEENNTSS 

IKDNDQRRVAKTILAIVEDNRNATIRSRLQSLCFGKYAMNDIFALDDADIKNMDKLIEKL 
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GEALAEKASPSSSAISSSSSSNTTSSSSSPSSSPSSSSSSFSMDYSNNLAKTIPYMPIVF 

QNKQSNVNSSDASSSSPSSSSSSSANIDNVEHKKVALQQLQTQESNDLSNVLSVTTKHRF 

ASHNQAATVGI FNGRQHAETWAIPNANKANNNATVSAGQGILTRFSAPENVSSTSMQLP 

PSSSSSSNGDDNKVPVTVRLNQYANSILSSIENASEFKDLKEAERKIDLAIQAASTTETK 

EMVTVSKCPSANQTAITAISQAKSLKKSALELLERVIKAVEVYTPDSSIAAVSLPVNGDS 

MVSSSSGSGSAPSSSSSSSSSSSSSNVTDYFNYAYGKLKNIDENTEEGAETVQKNMVEQD 

AAVRIPLLVSYAPFSEMMRRAIDKLNEYYQLIDAIKTKIVSDTKQASSWAIKETDKELDM 

DKEQV ISKINNLQQN FSNESDKIKMAI S VLDNKRNELELQNNKTRSFI ETTKSRIEAGGG 

DVANFKEIIDYENTSENDNNLFQSLKAFAADNSGTVYTPTDMSNGRDTKSDSKFVDMYNK 

QILEGGIKLINEGQNTVKVDFSKALEAFPRQSNGASEPVSSSWERRQRERLQAVEMFMA 

IMMERTESLRKRIADSAAQWNTVNNVEETVNSGMVNIKSERLTEIRNQAQIAESTALNSI 

NDEIVESPLTLSLGARVDQLLIKVDRVGSIQQQQQQQQQQQQLPKLTATEQRKEQQYAAD 

RVVYDPSYTCFLQPLHETIKRISSVYNSKNKGPLSNTRGVPTSDADLQLMTITDLSRSVL 

DSSSTSSKKMLYENVPSSIVPGLCQQCAMMITNVHEATHTSPHSFNFENKRSLKQLTEML 

NAATSSSDGPAVRHDVLTMLESNNGYVKDFGFTHRQKVACITPVNTLLGGTFSGNVAPNT 

VILPTSELFNCPGVENDKFRSMVNRTTDKNVADAPKSSASIVETLARTSPNAEHLYFPFK 

DQRRHFNSITDAIISGMSGESSSQLNTTCDQNLVNIDQTTGFPVFTGRKQGERRIVHTEN 

TMEGARKDKNSGIPSCTKDRQTYIDMGTKFMVAPGSLLNANKEETLRLNRLSDINNVRHY 

GTDVHVAGANSAWRIGEVVRAASSFPDGDKESAMKKMLLLGSVSAISAQKSASHINDPTA 

LLSTNTSIQNLVKEAFPDPVCSSNYLGSAESTFATQLAYRQRLFPNGDDENVTTVSNICP 

MDLMGSTKRYNDAFNNIFGSKMTSTNKKGSNCENLLKSAMSNVPAINTAFGAFEEASSSV 

RNRLS PLYEDSTKYSSNQLAVQAMTDTAVDALSAVST WGRQNGRNTLLSLPTS I TS I AT 

SGRPSLSYSSDMKSNLIKTISRINRDASIXSMGDSQVAAGSSFFNSFLRSSSIPVTTSQD 

GNVAAAEIVLGTILDICTVEINKRFEMLGGGKMVAGSPEARAIQRNTMSSILQMNENELAR 

DLCE I ENKI ETRQLRDAFQDLKRSMLMTPGGVGAI SSGASTNNVPLSLLMSRVDASSGLL 

MNNNSANVMEAVDSFNTTPLLVRHMMLDSGKSPVPMAKEIRSMLTQPRALTARALLSESS 

PLLTEICLYNTRDTQPERAVDRLLTSAYLVKQAKRFDGVDPAFPAALTCASHLMLSSMDS 

HTKSSFMDNIKLHMTDTQCFFKNIERFEKFLGRYGDEYAMSHKQNCNCPFHLHHTFTPSD 

NEHLVSSFAFARPEVSMEEIRATPYQANKLISDKHYVMNMSKIDSRVTGSSLLKKVSEWT 

EMRMNSNFNGTFEPSRLALSNSGMTTAGVNLDVIVKPNNARSVLGILECHRQHVCTADAK 

GTVASAMPAVFOATDGNGNESELIQNALPRNRYIQKSTMNAQTVVFANVLEQLIADLGKV 

IVNELAGTIAESVPESVYENTKEMIDRLGSDDLFKSNNNGGVESMDYEDSETTSNNGPVL 

ISEAMKNAVYHTLISGKAARPENVPFASCASGPLAFDFLLSKGDTFEEKNAEQGAAAAVS 

STYSSSSNTTLRKHLARVFEAISKQVTDAEFKDILNDIERNISSDYTNCPPNTNQNAFAL 

AIKREFSRIVSFLTILRKNITPALVDPKGALHEKVAIYLTLLSTKSKLENFFQYGLSNSS 

SVDLSHLKPINCSNNVKNIEDTFMYRNVHPILIMALPENFTALLQQEQMDPDTAIESRRS 

LTTFLNHPNTASMANGARAAVGAGGGNPMGLYLSSHILHESTVTTSNPVTDTTENVNYHS 

SVTQDPVMVVNPFKDSARLIVNNNNTGIDVLNDKSCNYLQVSMPSESSGLVTNTGCSSSS 

SSSSSDTFKYVRRDNTPVNLPRVTPAVLCSDASSNLLDVFSRADIVLENMNVRFGFMPEI 

IAAVSKFKGI*TKEEVIKC»1VSQNNINNNSNNNNGNGKKTTVDPVTGDIVITNATFPDTRP 

LYTAANGGTSSFKWGDINDRKMHAKAFPTFFIGNPTAAATANGVPLTSEGISLTEEKRKK 

I AG I SEGS I GTGALRAAANTRLSSDME P VMKGWNN I VQLQQT FKKAS DKLTH LLRSGG I P 

PRSQETNAIINKMHDSFKTLEECRRVIQDEAALLVATSDLLTGGYGGDAALAMVSPVRPE 

MTGLI GAI S AP VRGI SHLLKLGGVSAANAAI RKRLNLPTSNGKTLPEHG I VHKS AKTLLL 

DSDSISNLYNTDLQDVVSNARDNNNLGRIMQSLGLKGNNAGDLVYSARQI/TDLITVPEYG 

NNRDLTKRQAILKMLISNPEILENVADTIYLTTGKNALAPVSAQEMACASLTVGGSGGGK 

LSSDDNVQSLNRLYFRV 

(SEQ ID NO: 61) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

SR40YEAST (P32583) SUPPRESSOR PROTEIN SRP40 80 5e-13 

094317 (094317) SERINE-RICH PROTEIN 80 5e-13 

095815 (09S815) DENTIN PHOSPHORYN (FRAGMENT) 80 6e-13 

YM96_YEAST (Q04893) HYPOTHETICAL 113.1 KD PROTEIN IN PRE5-F 76 9e-12 

AGA1_YEAST (P32323) A- AGGLUTININ ATTACHMENT SUBUNIT PRECURS 73 8e-ll 

Q9Y07 6 (Q9Y076) PROTEOPHOS PHOGL YCAN PRECURSOR (FRAGMENT) 71 3e-l0 



FIGURE 3, Sheet 35 of 160 



WO 01/38351 



77/201 



PCT/DS00/28888 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 704 of CT1058: this 
corresponds to nucleotides 22724 0 to 22794 3 of the genomic reference sequence. 

TaqMan Primer/ Probe Sets: 

5*start«8829 

5'stop=884 8 

3'start=8894 

3'stop=8917 

5 * pr imer^CATTACAGCCTCAGCCATGG {residues 8829 to 8848 of SEQ ID NO: 60) 
Tm5=57. 78 

3 ' prime r=GCAGAACGAACGAGTCGTATAGTT (residues 8894 to 8917 of SEQ ID NO: 60) 
Tm3=57. 7 6 

probel — GAGGTATTGATACCGTTG (residues 8861 to B878 of SEQ ID NO: 60) 

probels tart =8861 

probelStop=8878 

direct ionl=Reverse 

Tml=69.00 

scorel=l .99 

length=89 
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CT1059 

Nucleotide 

Genomic coordinates : 

Start: 228374 

Stop: 230564 (SEQ ID NO: 62) 



Amino Acid 

MDKVCVISNTRERTFKVPADLLCVATEPEISTKEEDAGIEIETRVWFSRCVSVQELHTI 
NPNDEGFSVQLFKDYLKLQSAQGKKPIGLYIQIKAGEDLERRLISGGTAYLDPATHLFYL 
DFSLYPNYS I FNDISSRLKI I DEDTYNGWFSNSEEKEKDAXVLIRVTFSTHEKAIEAAI 
KKIMLRKVFFKDGDLDFGYLRIPKSKLDKFTPYFRSQYGIVNVEKNIPGYIWGEIMKQRV 
RCSRWYLYNTDSEWEYKNVAEERVGPRQLVKJCYGAKCENLCFRDIDLRKKEAKEKRDIER 
ETESRYWVTLTHKHEMPENMPYFGPKCSVVRLDETRILLCFVDEISYNDEDVDEILSEN 
RSLRN VS I RH KEN V P VHTLLKKG VS I HAR FTLNGL D DALI I LKRI PKT YFE DEE LQAACA 
HVNLEQYEWLCSNNRGNKVEHVKSRVVTRAVKRRRKCRHWIYFDKDTLNLNYKYFDKKVT 
ASMASKICNAKHDCLVFHRKMELEDLTESAYFKVEPSPINFAKLKSCPDVKYVQKKTDGT 
FSVIRFFRNMTKGDLIQRMDLFCRFIPDSHTITLLSRADFYACKRGESMHMCTNKHRILH 
YKFSNAPHAAIEQITNIISDTRGRKGIHIEYAIENVQEMYEEDGRRYEAKYTGTLTEYKR 
NEDKTFKSLLAPHLTPVNKPYNINHLYEQYGNFDEELEDKLRSGFISYDTYVTAKDNWGR 
CATGKGACI 
(SEQ ID NO: 63) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



Y061_CAEEL (P34600) HYPOTHETICAL 84.7 KD PROTEIN 2K1098.1 I 36 0.95 

Q9ZAJ8 (Q9ZAJ8) BONT PROTEIN 36 1.2 

Q01794 (Q01794) MAJOR SURFACE ANTIGEN MSG1 (FRAGMENT) 36 1.2 

Q9X708 (Q9X708) BOTULINUM NEUROTOXIN TYPE B (FRAGMENT) 35 1.6 

GLND_HAEIN (P43919) [ PROTEIN-PII ] UR I DYLYLTRANS FERASE (EC 2 35 2 . 1 

BXB CLOBO (P10844) BOTULINUM NEUROTOXIN TYPE B PRECURSOR (E 35 2.1 
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CT1060 
Nucleotide 

Genomic coordinates : 
Start: 230616 

Stop: 231582 (SEQ ID NO: 64) 
Amino Acid 

MCTLKTYKMTTSTEISKNLSDVLSIKATGDWCSNIKTVrSPFTEGKGNLPNSLPFrRSPN 

TTCGSREAANATEH FI TVFAKDKYERKRVKRT I GFTLDNTKELTPNRYLVADVYSWQEEK 

MVFEGFCVP PGKSGT FVRYSNE DKS FLLADTGR YMKKK Y DDPENKTS SGGDDDDDDDDDD 

DDNNNVDVYEENDPRNVFEVEKDEKYACTFSILVYRAMKKSPPVCRGLLVETDGPSSHPK 

RAPSAFNPFGGSSMLNGYGAGADALEEEDEVDGVPERERITNFALKRGPATGQNFVSVKL 

EHDGSKADLYNVTCFSKQRGV 

(SEQ ID NO: 65) 

Top Blast Hits 





Score 


E 


Sequences producing significant alignments: 


(bits) 


Value 


097300 (O97300) PFC1035W PROTEIN 


51 


le-05 


KEX1_YEAST (P09620) CARBOXYPEPTI DASE KEXl PRECURSOR (EC 3.4 


47 


2e-04 


Q83970 (Q83970) (CPV) 


46 


3e-04 


SIS2_YEAST (P36024) SIS2 PROTEIN ( HALOTOLERANCE PROTEIN HAL 


46 


3e-04 


YB00_YEAST (P38114) PUTATIVE 126.9 KD TRANSCRIPTIONAL REGUL 


45 


6e-04 


077384 (077384) PFC0760C PROTEIN 


45 


8e-04 



Comments : 



TaqMan Primer/Probe Sets: 

5'start=477 

5'stop=500 

3'start=555 

3'stop=578 

5 1 pr imer— CGATGATCCAGAAAATAAGACCAG (residues 477 to 500 of SEQ ID NO: 64) 
Tm5=57.85 

3 'primer— TCGTTTTCTTCATACACGTCAAC A (residues 555 to 578 of SEQ ID NO: 64 > 
Tm3=58.34 

probe 1 — ATGACGATGACGACGATG (residues 515 to 532 of SEQ ID NO: 64) 

probelstart=515 

probelStop=532 

direc t i onl =Forward 

Tml=69.04 

scorel=1.95 

length=102 
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CT1061 

Nucleotide 

Genomic coordinates: 

Start: 231602 

Stop: 232799 (SEQ ID NO; 66) 



Amino Acid 

MQLI LSHHLTMAGRVELVTGPMFAGKST YLKN I YQQENGGNKHCLFVKH SLETR YGCGTG 

TIVTHAGEVIEGCTTVSSIKELISVLPEVVDVILIDEGQFFTDLVLVNRLADKGKRIVIA 

ALDGTSDQQMFSPIHKLLPYTNSIVKLASKCMICKIDTKEAPFTVRFGNDNDNNVICVGG 

AEMYAAACRDCYKKINKKKNKGKLVVLEGGDRCGKSTQAKLLLTNKNSPLYGGEYMCFPD 

RSSHTGKLINDYLTKKIELDDHAAHLLFSANRWEVCSKIKQLLDDGIHVVMDRYYYSGIV 

FSLARGVDTVEWCSASDEGLPQPDLVLLMLLDVEKCSNRDTFGVERFETNSIQERARALF 

L DL AN K DEKN VW I KV DARGT IEEVQTKI INI VYNIVEE 

(SEQ ID NO: 68) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



074528 (074528) THYMIDYLATE KINASE 165 6e-40 

KTHY_HUMAN (P23919) THYMIDYLATE KINASE (EC 2.7.4.9) (DTMP K 159 3e-38 

KTHY_MOUSE (P97930) THYMIDYLATE KINASE (EC 2.7.4.9) ( DTMP K 157 le-37 

KTHY_CAEEL (Q22018) PROBABLE THYMIDYLATE KINASE (EC 2.7.4.9 148 8e-35 

KTHY_YEAST (P00572) THYMIDYLATE KINASE (EC 2.7.4.9) (DTMP K 143 2e-33 

KTHY SCHPO (P36590) THYMIDYLATE KINASE (EC 2.7.4.9) (DTMP K 141 6e-33 
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CT1062 

Nucleotide 

Genomic coordinates: 

Start; 232848 

Stop: 233334 (SEQ ID NO: 68) 



Amino Acid 

MLPRKTLPDTENGYFVLDESLLEKVYYDNNNELrVRVGGIYMQICKSKYIFHHDDPERFF 
YSVLEDYHPIKEIVERIAEEDGVFLGPWEFLSRKQVNLQHGCYKALLSLPEDKYCNLLLP 
QQMKTNLEKMEE I QRTRLI HSRT YNTPQ I ELS DQLDGCV IC 
(SEQ ID NO: 69) 

Top Blast Hits 

Sequences producing significant alignments: 

060678 (O60678) PROTEIN ARGININE N-METHYLTRANSFERASE 3 (FRA 
P95966 (P95966) ORF C04027 
Q9ZW94 (Q9ZW94) F5A8 . 4 PROTEIN 

058387 (058387) 358AA LONG HYPOTHETICAL PROTEIN 



Score 
(bits) 

31 
30 
30 
30 



E 

Value 

5.4 
9.3 
9.3 
9.3 



TaqMan Primer/Probe Sets: 

5'start-223 

5*stop«245 

3'start=309 

3'stop=330 

5 ' prime r=GAACGACTAGCAGAAGAGGATGG (residues 223 to 245 of SEQ ID NO: 68) 
Tm5-58.17 

3'primer=TGGCAATGACAAAAGAGCTTTG (residues 309 to 330 of SEQ ID NO: 68) 
Tm3=58.99 

probe 1=AAGTGAACCTCCAACACG (residues 284 to 301 of SEQ ID NO: 68) 

probelstart=284 

probelStop=301 

direct ion l^Forward 

Tml=68.99 

score 1=1 . 96 

length=108 
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CT1063 

Nucleotide 

Genomic coordinates: 

Start: 236678 

Stop: 238604 (SEQ ID NO: 70) 
Amino Acid 

MVASTPCPGPGPVPTQELLSTNFLEAHKLVVELLLPSYSSDWYCDSETYTKPIPIFGNK 

SIVSTIGDYVLSNPNEDVSYQMVSSVLEKFPLLFHCTYKTNEEDKGIPLWKKLYNKRKFK 

LLNSLLVHNNKNWTPVPAIPFDRENICDASGRSVIMSEIMSTSTFQTICKNNTHYLFDML 

NMERGKQGGSFLHFFASRKNSFTN FENEEMDSHVLSNIAKFICNEKEKLDSFIPANGKIP 

CPDKTNDEGYIPLEIAIMEDNYPALLYLVCRYGASWANTYGDHNESLKAFAIRNDAKDCL 

EIIEFISDHYSFNKNVTKEEFVKEKTVECVGCLYDIEDEKRCYKLPCGHFMHTFCLSNKC 

SKANFRCVKCFQTFDDTIFRKCPPTIQWKMGINQTTNHKEMDLFNRAFDTYLDFICSYNV 

KLDKKSKPKHKPENKKVEEELAKRTAEIEEAIKKKEEELAKRTAEIEEAIKKKEEEIAKR 

TAEIEEAMKKKEEEELSKYNKIIEKGKRRLNEECVKLRDISTAAINMYKEKVRINGVLLK 

DSDQELAEAKERLRKILLLEEETKLDRFLFRPKRVEERIFLTKDDETLAFKLALEKKTED 

IIAKKNNQKGSERRDGEYTITSHIEKLPQSTALASVCVLNE 

(SEQ ID NO: 72) 

Top Blast Hits 



Comments: 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 362 to 1 of CT1063: this 
corresponds to nucleotides 238163 to 238524 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=775 

5'stop=797 

S'start^SSO 

3 , stop=872 

5 1 p rimer *=GAAGACAATTACCCTGCATTGCT (residues 775 to 797 of SEQ ID NO: 70) 
Tm5=58.33 

3 'primer^GCAAACGCTTTGAGAGATTCATT (residues 850 to 872 of SEQ ID NO: 70) 
Tm3=58.56 

probe 1 =TAGGTATGG AGCATCTTG (residues 810 to 827 of SEQ ID NO: 70) 

probelstart=810 

probelStop=827 

directionl=Forward 

Tral=68-95 

scorel=l . 95 

length=98 



Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



Q26938 (Q26938) KINETOPLAST-ASSOCIATED PROTEIN (KAP) 
Q9ZIU2 (Q9ZIU2) VIRULENT STRAIN ASSOCIATED LIPOPROTEIN 
050870 (050870) HYPOTHETICAL 54.3 KD PROTEIN 

Q9ZU69 (Q9ZU69) PUTATIVE VICILIN STORAGE PROTEIN (GLOBULIN- 
023230 (023230) TRICHOHYALIN LIKE PROTEIN 
MNN4 YEAST (P36044) MNN4 PROTEIN 



74 3e-12 

73 5e-12 

73 5e-12 

68 2e-10 

62 le-08 

62 le-08 
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CT1064 

Nucleotide 

Genomic coordinates : 

Start: 238658 

Stop: 239438 (SEQ ID NO: 72) 



Amino Acid 

MSTCSNLLSVFGGGDWTTTFPFDLVHTRQECDKKREQDYSFFITETCKGENIGIHSYEHT 
SKIIDTGNNDSTSIEELEVLNiyKAINHLENILKLNKGEKIILMDVETMILETHKILMKG 
ILPKGKNGSFSTCVRFAVNKNNERHYYPVFETEKEAFNSIQNLVDYYNEIVAHTNDQIKI 
IKACAYFMYNFLTLHPFNDGNGRTARLLYSFLLKGNGIVPHFSPITHPRDQFVDTLVYFR 
EHG DGRPLL Y VLLES I KNK 
(SEQ ID NO: 73) 



Top Blast Hits 

Sequences producing significant alignments: 
Q23544 (Q23544) 2K593.8 PROTEIN 

Q9ZHQ9 (Q9ZHQ9) HYPOTHETICAL 26.1 KD PROTEIN (FRAGMENT) 

068899 (068899) HYPOTHETICAL 29.5 KD PROTEIN 

Q48249 (Q48249) PLASMID PHPM180, COMPLETE SEQUENCE 

075406 (O75406) HUNTINGTIN INTERACTING PROTEIN HYPE (FRAGME 

AAC96089 (AAC96089) HYPOTHETICAL 23.9 KD PROTEIN (FRAGMENT) 



Score 
(bits) 

50 
44 
43 
43 
42 
42 



E 

Value 

2e-05 
0.001 
0.002 
0.002 
0.004 
0.004 



Comments : 

EST confirmation of the predicted transcript: 

Nucleotides 1 to 677 of CT1064 : this corresponds to nucleotides 238717 to 239393 
of the genomic reference sequence. 

TaqMan Primer/Probe Sets : 

5'start=397 

5'stop=419 

3'start=450 

3'stop=472 

5 * pr imer=TGCGT ACGCTTTGCTGTAAATAA (residues 397 to 419 of SEQ ID NO: 72) 
Tm5=5B. 67 

3 ' primer=TGAACGCTTCTTTCTCTGTTTCA (residues 450 to 472 of SEQ ID NO: 72) 
Tm3=57.92 

probe 1 =AATGAACGGCATTACTAC (residues 424 to 441 of SEQ ID NO: 72) 

probelstart=424 

pr obe 1 S t op=4 4 1 

directionl=Forward 

Tml=68.96 

scorel=l . 88 

length=76 
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CT1065 
Nucleotide 

Genomic coordinates: 
Start: 240712 

Stop: 241192 (SEQ ID NO: 74) 



Amino Acid 

MEDLKSTIERVYEERVENLEQWTNTVEEEERTVSAIDSVLEEQKRALDAWEAAIKEREND 
LAVKEGISALVFNAADAKTRKELINTWIAERETSEKRRKEATSTNNQLKNQMSSLVNTTK 
T LKEKYNKYYRRS AI LNMQY I NNKRDYE ASQ FW V YTNN A 
(SEQ ID NO: 75) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



RADI_HUMAN (P35241) RADIXIN 
RADI_PIG (P26044) RADIXIN (MOESIN B) 
040947 (040947) ORF 73 

SSP5_STRGN (P16952) AGGLUTININ RECEPTOR PRECURSOR 
Q9Y489 (Q9Y4 89) CENTRIOLE ASSOCIATED PROTEIN CEP110 
Q21952 (Q21952) SIMILAR TO MYOSIN HEAVY CHAIN 



43 


0. 


.001 


43 


0. 


.001 


42 


0. 


.002 


41 


0. 


.005 


41 


0. 


.005 


41 


0. 


.005 



Comments : 



TaqMan Primer/ Probe Sets: 

5'start=228 

5'stop=24 9 

3'start=297 

3'stop=320 

5'priroer=CGCCAAAACACGTAAAGAATTG (residues 228 to 249 of SEQ ID NO: 74) 
Tm5=S8.34 

3 * primer ^TGATTATTGGTAGAGGTTGCTTCC (residues 2 97 to 320 of SEQ ID NO: 74) 
Tm3=57 . 45 

probe 1=AATACGTGGATAGCCGAA (residues 253 to 270 of SEQ ID NO: 74) 

probe Is tar t=2 53 

probelStop=270 

directionl-Reverse 

Tml=68 . 84 

scorel*=l .84 

length=93 
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CT1066 

Nucleotide 

Genomic coordinates: 

Start: 241184 

Stop: 241505(SEQ ID NO: 76) 
Amino Acid 

MH KFSNKFYFI I KGVLI 1 1 FVPDWFS I FLLPPLGVRHKNGGGGNEEQKSGPSQKHHI PG 
PVLIFVLIIVIVGSVVIIIGVLISVRIAVLLWSHPYIHDGQDEDTN 
(SEQ ID NO: 77) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q19507 (Q19507) F17A2.11 PROTEIN 33 0.78 

Q37367 (Q37367) NADH DEHYDROGENASE SUBUNIT 1 (FRAGMENT) 29 8.9 

PSC_DROME (P35820) POSTERIOR SEX COMBS PROTEIN 29 8.9 

YG35_YEAST (P53273) HYPOTHETICAL 117.0 KD PROTEIN IN ASN2-P 29 8.9 

AAF05147 (AAF05147) ORF33 29 8.9 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 452 to 1 of CT1066: this 
corresponds to nucleotides 241246 to 241697 of the genomic reference sequence. 
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CT1067 
Nucleotide 

Genomic coordinates: 
Start: 241774 

Stop: 243409<SEQ ID NO: 78) 
Amino Acid 

MFRQFCSLYLLQRRVNDNLRSTASASAAASLKGDGTEFITGEPPSHKMRGPSYSVLGPDP 
CEDPERVYVDI WS I LQTNN IQVTKEWELFSDKLRKLGPWI DRSGIENNGEGEEDGDENE 
DGGGNGGRIEDREAHRRKMMKKLSFVGREDPVAVDLPTWRENSTEFARRLTLKELCDLIV 
ECGCIKSKEELFDFIFEEPWEIKEAADVRGMANRSKFTKESLIDWFFEFDTYSKCVVFFE 
AVNWYLKSQASPISLVLDDIYCCVFSYIRRQTFLTRAKNPSLTVASSFSPTPDTKLLAID 
ECVQHFI^SDINISQMALTERDCFFPLLTEMPRQQKKVNTFiDTMKRPTLSLLPSTSSSS 
SSNNKRKRNTAAANILLPVYRSNFSTASNNKRLKTDDGENASACILIEGYANGKISPIRI 
MVRKSTIIPEVFNHLLFPVFASKDTGANILFFIKMKSFASASLLLPGLFRHPKQFLNGPC 
KWMTLAENNINDNNINSSTMWSYTLADYCPLGYYTQESPQPYQTCGNFTSTTNKRLQNVQ 
PLYF 

(SEQ ID NO: 79) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

CAB38842 (CAB38842) HYPOTHETICAL 35.2 KD PROTEIN 39 0.061 

CAB52581 (CAB52581) CONSERVED HYPOTHETICAL TBC DOMAIN PROTE 39 0.10 

0844 02 (084402) RI BONUCLE AS E FAMILY 37 0.31 

Q59085 (Q59085) RNA POLYMERASE SIGMA-54 FACTOR 37 0.31 

080815 (080815) T8F5.22 PROTEIN 35 1.2 

PI4K_DICDI (P54 677) PHOSP HAT IDYL INOSITOL 4-KINASE (EC 2.7.1 35 1 . 6 

Comments : 

TaqMan Primer/Probe Sets: 

5*start=717 

5'stop=74 0 

3*start=791 

3 , stop=814 

5 * prime r=TGAAGCAGTCAACTGGTACTTGAA (residues 717 to 74 0 of SEQ ID NO: 78) 
Tm5=57.74 

3 , primer=TTTGGCGTCTTATGTAGGAAAAGA (residues 791 to 814 of SEQ ID NO: 78) 
Tm3=58.08 

probe 1 =ATCTCAAGCGTCTCCAAT (residues 741 to 758 of SEQ ID NO: 78) 

probelstart=74 1 

probelStop=758 

directionl=Forward 

Tml=68. 99 

scorel=l .99 

length=98 
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CT1068 

Nucleotide 

Genomic coordinates : 

Start: 243216 

Stop: 243798 (SEQ ID NO: 80) 



Amino Acid 

MDDSSRKQHQRQQHKLFHDVELHASRLLSSGLLHPREPSTLSDMRQFYFDYKQETTKRAA 
I ILLNTLLEYYRTPSEEWE I PFNLLLNVMNNKWSTLI PGVKISAGIISKLPWTMKTMYEI 
VSSPNNNNNNGDYYSTCRRMVMEYPIGGLLHTPAITNKYPRSRMVTCTKGKDHQKLYDIS 
RQMFDI IEANGQL 
(SEQ ID NO: 81) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

P3K3_DICDI (P54675) PHOSPH AT IDYL INOSITOL 3-KINASE 3 (EC 2.7 32 3 . 0 

001590 (001590) K09H11.1 PROTEIN 31 6.7 

RRPL_DUGBV (Q66431) RNA-DIRECTED RNA POLYMERASE (EC 2.7.7.4 31 8.8 

Comments : 



TaqMan Primer/Probe Sets: 

S'start-234 

5'stop=257 

3'start=308 

3'stop=331 

5 'primer-GGAAATTCCGTTTAATCTCTTGCT (residues 234 to 257 of SEQ ID NO: 80) 
Tm5=57 . 99 

3 ' primer^GGAGTTTCGATATGATACCTGCAC (residues 308 to 331 of SEQ ID NO: 80) 
Tm3=57.75 

probe 1=GAGTACACTCATTCCAGG (residues 279 to 296 of SEQ ID NO: 80) 

probelstart«279 

probe 1 Stop 5 ^ 96 

direct ionl=Reverse 

Tml=68.82 

scorel~1.82 

length«98 



FIGURE 3, Sheet 46 of 160 



WO 01/38351 



88/201 



PCT/US00/28888 



CT1069 

Nucleotide 

Genomic coordinates: 

Start: 244241 

Stop: 244856<SEQ ID NO: 82) 
Amino Acid 

MDLSFTLSWSAILAITAVIAVFIVIFRYHNTVTKTIETHTDNIETNMDENLRIPVTAEV 

G5GYFKMTDVSFDSDTLGKIKIRNGKSDAQMKEEDADLVITPVEGRALEVTVGQNLTFEG 

TFKVWNNTSRKINITGMQMVPKINPSKAFVGSSNTSSFTPVSIDEDEVGTFVCGTTFGAP 

IAATAGGNLFDMYVHVTYSGTETE 

(SEQ ID NO: 83) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

OB5179 (085179) FLAGELLIN A 34 0.60 

FLAA_CAMJE (P22251) FLAGELLIN A 34 0.60 

BAA83944 (BAA83944) UNKNOWN 34 1.0 

FLAB_CAMJE (P22252) FLAGELLIN B 33 1.8 

AAC25644 (AAC2564 4) FLAGELLIN A 33 1 . 8 

AAC25648 (AAC2564 8) FLAGELLIN A 33 1.8 

Comments : 

EST confirmation of the predicted transcript and hits to pblic SBV sequences: 
Nucleotides 1 to 803 of CT1069: this corresponds to nucleotides 244132 to 244934 
of the genomic reference sequence. 

Hit to public sequence gi | 68561 62 I gb | AF173993 . 1 : CT nucleotides 1 to 615 match 
nucleotides 323 to 937 of the public sequence with a 100% homology, a score of 
1219 and an Evalue of 0. 
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CT500 

Nucleotide 

Genomic coordinates; 

Start: 2425 

Stop: 1537 (SEQ ID NO: 84) 
Amino Acid 

MKNSRQRSGVWRGNSCLYKSFYFSGAI IECKKIRI IMMFLLLSLILFVCFVGVWGVI FM 
SRPNKTTTTSNKKTKKDKEKEKEDDTEGAVLGRREPENRPIGRDEEGAVEDGKEEEEVFE 
FEQPSVNTGSNTGGGGTGTVPGEGLLPPPPPTPTPTPPPTPTPTPPPPPTRTPSPSSSLG 
EDDDDDIDIDFDDNDIDEFLDSGEEMEEDEEEEDLDTLLSRLETGMSGEEVDFDASSAYI 
QPDPWVKNIERSDYTLDPMESWKVLNRSEGDIRFFVDRGITNKIKAMTEDLKEL 
(SEQ ID NO: 85) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q48373 (Q48373) CHITINASE PRECURSOR 64 2e-09 

086476 (086476) CLUMPING FACTOR B PRECURSOR 59 4e-08 

092451 (092451) ACMNPV 0RF91 58 9e-08 

Y091_NPVOP (010341) HYPOTHETICAL 29.3 KD PROTEIN (0RF92) 58 9e-08 

Q69023 (Q69023) (B95-8 ISOLATE) U2-IR2 DOMAIN ENCODING NUCL 57 2e-07 

Q42421 (Q42421) CHITINASE PRECURSOR 57 2e-07 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 24 4 to 513 of CT500: this 
corresponds to nucleotides 1979 to 22 48 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5*start=419 

5 , stop*=4 4 0 

3 , start=505 

3*stop=523 , 
5 • primer=TGCCTGGAGAAGGTTTGTTACC (residues 419 to 440 of SEQ ID NO: 84/ 
Tm5=59.11 

3 'primer =GAGATGGGGTTCGTGTCGG (residues 505 to 523 of SEQ ID NO: 84) 
Tm3=59.88 

probe 1=CCTCCTACTCCTACTCCT (residues 448 to 465 of SEQ ID NO: 84) 

probelstart=448 

probelStop s =4 65 

di rec t i on 1 ^Forward 

Tml=69.04 

scorel=l . 95 

length=105 



4 
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CT501 

Nucleotide 
Genomic coordinates: 
Start: 7645 

Stop: 7042 (SEQ ID NO: 86) 
Amino Acid 

MTMWNKIVITTKEIMNWPMVVGVFFILAITALAVALYIRHASKQEKYSTSHINEQFTAKQL 
PVTYLSKTGKLKDMHLTHSDFMAyVDVHNRTKTLKHPMCTDEAGWAHFCLLASAEAYRRI 
RYGRGEFGPEKHSLAETIQSTVQDMSEPYITHIFKKNTDVDGHGMQSVLEKNRNKIRMGD 
GKTSSETYNLSDKSISIVGV 
(SEQ ID NO: 87) 

Top Blast Hits 

Sequences producing significant alignments: 

Q21859 (Q21859) R09D1.3 PROTEIN 
Q9Y3S0 (Q9Y3S0) EMDC II PROTEIN 

AAD25099 (AAD25099) METALLO PROTEASE DISINTEGRIN CYSTEINE-RI 
031548 (031548) YFJL PROTEIN (RIBOSOMAL PROTEIN L6-LIKE PRO 
Q4 4 602 (Q4 4 602) PHOSPHORIBOSYL ANTHRAN I LATE TRANSFERASE 
CAB52230 (CAB52230) HYPOTHETICAL 33.8 KD PROTEIN 

Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 587 to 1 of CT501: this 
corresponds to nucleotides 6996 to 7582 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=24S 

5'stop=266 

3'start=315 

3 , stop=332 

5 'primer— TGGCATATGTTGATGTGCACAA (residues 245 to 266 of SEQ ID NO: 86) 
Tm5=59.44 

3 1 prime r=AGCAGGCAAAAGTGGGCC (residues 315 to 332 of SEQ ID NO: 86) 
Tm3=59.78 

probe 1 = ATG TGTACT GACGAGGCT (residues 292 to 309 of SEQ ID NO: 86) 

probel start =292 

probelStop=309 

direct ionl=Reverse 

Tml=68.96 

scorel=l . 96 

length=88 



Score E 

(bits) Value 

34 1.0 

32 4.0 

32 4.0 

31 5.3 

31 6.9 

31 6.9 
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CT502 

Nucleotide 

Genomic coordinates: 

Start: 8502 

Stop: 7641 (SEQ ID NO: 88) 



Amino Acid 

MSSGSINNHPSSNMDTNKMEEGEEQDFDVLELDYSKIIHDITAMLSVAAPPPNSILDASD 

GLIATASATAPAAETGNSNRMRLDKDVCQLIERDIELVKSDTIEVDSIIRQLLYFGESAS 

EKNIKTNSTEKEPVYFPKEPKGEAVKLAKNTPVLDTITKLDWMANICQSNKIGVENLASA 

LQSGQLIWTTFPAAVYASLDSFYHIAIMWKLLGSFINIEALSKGSKDNLLPRDDIQVVHA 

KQEIAAMLQSRQNILGRGPSEYPPVPITAILSRTIIPLLRNFSEKL 

(SEQ ID NO: 89) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

043631 (043631) SPINDLE POLE BODY PROTEIN SPC98 HOMOLOG 38 0.063 

060853 (060853) PROTEIN ENCODED BY SACCHAROMYCES CEREVISIAE 38 0.063 

060852 (060852) PROTEIN ENCODED BY SACCHAROMYCES CEREVISIAE 38 0.063 

AMPR_HUMAN (P15514) AMPHIREGULIN PRECURSOR (AR) (COLORECTUM 34 1.2 

BYN_DROME (P55965) T -RELATED PROTEIN (TRP) ( BRACH YENTERON P 33 2.1 

AAB32396 (AAB32396) T-RELATED PROTEIN 33 2.1 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 753 of CT502 : this 
corresponds to nucleotides 7704 to 8456 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=315 

5 'stop=336 

3'start=392 

3'stop=412 

5 f primer =TGACTCC ATT AT TCGCCAACTG (residues 315 to 336 of SEQ ID NO: 88) 
Tm5=58.70 

3 "primer— TGGGGAAGTAAACTGGCTCCT (residues 392 to 412 of SEQ ID NO: 88) 
Tm3=58.70 

probe 1=TGGAGAATCTGC AT CAGA (residues 34 5 to 362 of SEQ ID NO: 88) 

probelstart=345 

probe lStop«3 62 

direct ionl-Forward 

Tml=68-85 

scorel=l . 85 

length=98 
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CT503 

Nucleotide 

Genomic coordinates: 

Start: 924 8 

Stop: 8552 (SEQ ID NO: 90) 
Amino Acid 

MDPGASAASRRALWSSTVTNTRHYQQQLNRALNKIEEEDDVEEEHGQVTTTNKEMASTST 
SSSSSSSSSPTSSAIPSSDEEEEEEEEYDSESDTNVDSLLGEEEEEDS DTESTSADANFL 
RSSSRNSTTRNRLIKKYVDRFIKYEKDILLADRNKRKKRHRNRQPQIHKLNNKRLKKPTD 
KKQKTNKKKTWRRLPKFIKKMSPASRLKFFSACIISGIKITSIIVLSIMAL 
(SEQ ID NO: 91) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

NSR1JYEAST (P27476) NUCLEAR LOCALIZATION SEQUENCE BINDING P 56 2e-07 

FKB4_YEAST (Q06205) FK506-BINDING PROTEIN 4 (PEPTIDYL-PROLY 55 4e-07 

GAR2_SCHPO (P41891) GAR2 PROTEIN 52 5e-06 

095367 (095367) CBFl INTERACTING COREPRESSOR CIR 52 5e-06 

SR40_YEAST (P32583) SUPPRESSOR PROTEIN SRP40 51 6e-06 

008904 (008904) BRAIN X-LINKED PROTEIN (BRX PROTEIN) (FRAGM 50 le-05 
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CT504 

Nucleotide 

Genomic coordinates: 

Start: 13936 

Stop: 9328 (SEQ ID NO: 92) 



Amino Acid 

MAHKLLFLEEEDAKEIGTLSHPEPSFALYFISETFRSVGFCKNVTDAYPKFLPRPMDINSV 

QAIAVRIALIQFYKGRGWKKNMSIIDLVKDKVERNFKVDKKTSGGFIIGDGTGVGKTREL 

AAFVMSVILQEKALLDVQKHVGPSIFGODSDKVITAINSGVWKRHPFFIWLTCSKPLFNS 

CQQGMREVVTNSRGLRDPKFSWRKLQVPCANKPTSFKSDGKSGSMTVDVENSVSSAKDSV 

DIRFFTLRDVKEFHSKRSSRSIGDFLTETPTILFMTYSDLRTNLEFVLKFITGGTDLDSN 

KVM P I DN FVT ALLC DE FHKTQN I S DS FRKELAKTWEEE DTRVLRN IQKRANPS V S DL I N R 

FKSAMSDDRNFKVKRMKSSNNKGRVTMSNYLKLLSQADAFRIFLEILKYDTFTVMASATP 

FQSNADLHMIDHILRKSAPAYTSIQAFKEVSSATPDAMAEHSEYVTVFLEQVIKLLRNRG 

QLVSRSISMAGVDCSTTNCKASPLQKYAI DELASYCLNARQVLIDSEKVGGHVRRAFTKI 

I REH QEGGI LEEEDVEKLVAE I NSPSRKRKRAANDDDLYE VMEN I DRRFKVWVRDRDV A 

HDGKTTLRSIVQDAIKTYSQKKDALSNGGGGIITSPEVDISSIDMVAQDLYDAIKKKEKP 

SKGKTDFNEDYDDGANEEDGWGEVFDDECFEKLRRQYFINTASTSVAACKGALLNIKATS 

VTDAVKRLRTTNESKKMVMSLEQTGDSFLKNLTTRILQTIAKDESDAKYGIVDVGTFDSS 

PVANTIFSGYRLLCRAVMMASAFTISLKNKTNRRTSPAHVMLVPSVPDTEPLMALAGNPI 

DSITQSIGEDSNAEITNRKLCSRITNRGLFLVKNNTKTANTNKCISAFNNTKEVDVIMLG 

PKGNTGLSLHDSSNNSMYAKRYHCVLDVPYNAIAFLQTIGRTHRNGQLSVPQFLIFSTDS 

PAERRFFDSLDKRIKDSKAGTYADRYSNNSIDIAAAVMREQFIDQGLVLKTMGNIVQIVT 

ASMTKVHLMEHFSKMMMRTNRGGVAFVEGLTLENGIFTEVIVLAMHIALWIGAQNKITS 

SDDLGHALSFTSVLPHNQILSIVKSASQFVFSNLCLHLVHFKSDCDNLLPREKRVRDAAS 

ALIDTLNTKNNEVTSKTNKIESDAPSLTALMLPSGPRNRKMDVFSNIMAYNNNNGMDFDE 

DVPDN DEDEGCLPLQEENATTLALSN FPHDY DRAI KDAHQLVT VR I VGQGE KEGVIPISE 

CLDVPELDMTNLI PVVTATNV IQSIjAKENPGLLFT IHNAALAHSHREGYGGSHLLGLAKK 

LSRGFINFRQFQNQLFSPKKESKIMYDIFLSVKAIMARDDRYDGLCDMRMNSMMDASFLK 

VRKKPECVFITKL1DKNFRRHIINDEEEETRERFGGEEEEEDDDEEFEDEEEEQAEREWG 

EEEGESAYDISVINDKNNTIGHDVDI ILCNRKKLTLTKENSVFVNEHI DSFMVGNLIGAE 

GSLIQICF DNCTGE FEGLPKFCL YDS S S KDKDT I P 

(SEQ IQ NO: 93) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



001940 (O01940) STRAWBERRY NOTCH (SNO) 

Q9Y2G9 (Q9Y2G9) KIAA0963 PROTEIN 

075257 (075257) R31180_l 

001737 (001737) F20H11.2 PROTEIN 

085862 (085862) PROBABLY METHYLASE/HELICASE 

064516 (064 516) YUP8H12R.3 PROTEIN 



72 
62 
62 
62 
59 
53 



3e-ll 
3e-08 
3e-08 
4e-08 
2e-07 
le-05 



TaqMan Primer/Probe Sets: 

5'start=2131 

5'stop«2154 

3'start=2210 

3'stop=2233 

5 1 primer =GGTGCACTATTGAATATCAAGGCA (residues 2123 to 2154 of SEQ ID NO: 92) 
Tm5=59.03 

3 * primer =CAGTTTGCTCTAGTGACATGACCA (residues 2210 to 2233 of SEQ ID NO: 92) 
Tm3=58.06 

probe 1=TTCTGTCACCGATGCAGT (residues 2157 to 2174 of SEQ ID NO: 92) 

probelstart=2157 

probe lStop-2174 

direction l=Forward 

Tml=69.00 

scorel*1.99 

length=103 
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CT505 

Nucleotide 

Genomic coordinates: 

Start: 16983 

Stop: 14064 (SEQ ID NO: 94) 



Amino Acid 

MDERRRDPLLYPTNRSMRFTAQITLFVTVFVLGCIFALVCAAMAYNVAKmSVNFQAIHE 
LGMKSKLKAVOGANPEKTLEEYLEARGRHDGVEDASNYPPHPALLDMMNLTVKGNKWNVP 
SETKERNSRFESHDLAANRSSSLLVPEHHIDRLSEATIEKSNKYLDAVSGKKFRQRMVNL 
KDNIEKDDTELYDSLFGVHDIHHHSASGVSGDAPPPPPSTSEGHDEDVDILAYNTGGYCS 
NPVPLKEGQTCTSVCYTSRAVRVMTPFVAGGTFITHKSGEDPKPYCWSGNVPGDHIETSP 
TTGERVVKECSVHTSIVVLTDDGGWQCRPKYPTYFGGSGGTSMTACAFNPSTHKGPPPPS 
SSTPIYYDVLKKQQIRNHTEFRNSSYISKLRQSSSLAEFKIKCNDPEFLYKNPITCFCNN 
KKDVLNNDLI.SQDVTKDMKFRGMYECMENPCVMMPNIDPSFVTFDVSTMKCVPGVNNPQD 
SNRHAIIGDDRTPLVGTVPAMGrFLADQSKRGDQIHQQRPKSSIDETTAKKIALAQAPII 
TPLNLDATNTSKNVLFVPIPSTVLPPLENIPHVIIRPSSLLHRSCLAPVLNKPSSGQHRP 
FCTAPFYIEPAANVLAGMIPQKPYEHSMLATECLRNSRMVSGSVHGGSELLFSTLLSQNK 
PSSYIRTPPGGTPAPEYNSTGDQRLEEIRDFFERNFNDERRLSQTEYVIKKHARGMRTSE 
IYLKSSSWDSLMKRKEFLRHIIKKSEDTFVLKEGLLMRSYGPYAATVLARDMFDLDYLKG 
KPASKTSSTLKVSNPLQYAFPTSYSVLPEEGATDDIFSVDHNRIFDSETIPSYFDCSNVT 
PGSEKLFGTSSSSSEYRVDIDDDAWGLQSFRLDHNPKSGPWQSDPRLAFDASNISSTPE 
GATITPLSLFKKSLVEWGHKKADVQETSWFRDGVDTSEAYRRLLVETSMAVRNSWFSIAW 
ENKNYYFAKNSS 
(SEQ ID NO; 95) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

CUP5GALME (Q24998) PUPAL CUTICLE PROTEIN PCP52 PRECURSOR ( 46 9e-04 

CFHDHUMAN (Q02985) COMPLEMENT FACTOR H-LIKE PROTEIN D0WN16 38 0.33 

P89911 (P89911) PROTEIN KINASE 37 0.57 

CAB39619 (CAB39619) AIG1-LIKE PROTEIN 37 0.57 

CAB53064 (CAB53064) DJ15D12.2 (FHR-3 (FACTOR H-RELATED PROT 37 0.75 

Q24552 (Q24552) TF125 PROTEIN 36 0.98 

TaqMan Primer/Probe Sets: 

5*start=1446 

5*stop~14 66 

3*start=1511 

3'stop=1532 

5 'primer ^CAGGCATGCAATAATAGGCGA (residues 1446 to 1466 of SEQ ID NO: 94) 
Tm5=59. 66 

3 *primer=CTTTTAGATTGGTCGGCCAAGA (residues 1511 to 1532 of SEQ ID NO: 94) 
Tm3=59.21 

p robe 1 «GAC AGGACACCGTTAGTG (residues 1468 to 1485 of SEQ ID NO: 94) 

probelstar t=14 68 

probelStop=1485 

direct ionl=Reverse 

Tml=69.00 

scorel=1.99 

length=87 
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CT506 

Nucleotide 

Genomic coordinates: 

Start: 25878 

Stop: 25197 (SEQ ID NO: 96) 
Amino Acid 

MASVFEDPADLFANMDLTGKVPTRPNILFFEGLLPNSGKEIMENRLIHKGKCGAFEADTQ 
LAYFFPSNNEENTKKLNIGFQIKSNCLSFFIRDFLNDWLEEIKDCGPYCTFSQYMDGDKE 
IFGNSVFGQDFTIVAMDWIDKGVTF^IFVDGSDSMENMASLWMCDKIJCRMNANVVKVFVD 
NASKPKFSVCKTCRWEFPGPVSYVIEGHGMGHSDLSCDEISEFLVQ 
(SEQ ID NO: 97) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

AMY_BACCI (P08137) ALPHA— AMYLASE PRECURSOR (EC 3.2.1.1) (1, 35 0.44 

Q85449 (Q85449) PROTEIN 8 32 3 . 8 

Q85439 (Q85439) COAT PROTEIN 32 3.8 

VP8_RDV (P17379) OUTER CAPSID PROTEIN P8 (STRUCTURAL PROTEI 32 3.8 

Q85451 (Q85451) OUTER CAPSID PROTEIN 32 3.8 

Q24284 (Q24284) PLC-GAMMA D 31 6.6 

Comments: 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 652 of CT506: this 
corresponds to nucleotides 25216 to 25867 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5 , start=287 

5 * stop=307 

3»start=351 

3'stop=373 

5 , primer=ACGACTGGTTGGAGGAGATCA (residues 287 to 307 of SEQ ID NO: 96) 
Tm5=58.18 

3 ' prime r=TGTTTCCGAAGATTTCTTTGTCC (residues 351 to 373 of SEQ ID NO: 96) 
Tm3=58.28 

probe 1 =AGGACTGTGGACCAT ACT (residues 308 to 325 of SEQ ID NO: 96) 

probelstart=308 

probelStop=325 

directionl=Forward 

Tml«68.99 

scorel=l . 99 

length=87 
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CT507 

Nucleotide 

Genomic coordinates: 

Start: 29077 

Stop: 28330 (SEQ ID NO: 98) 
Amino Acid 

MIAIANHKHDVSDALVGAHGAKINMLYGKSSTLSVTEAALLMFNDTALTQFAQRGYEPSI 
PTIUCAALDFSLQEEEPLVAATGLDVNKAPRSWPILNCRLGYIASSNYPWAEHIISGDKE 
EIKRALEEHEKNANVRFDSDNCPVCLEDFSSTNI IRTTRCGHCI DEKCWDRLVLSTQRGE 
ITRCPVCRERTSLRPDADQVKEMLVEPIVSCKRMAVPDEQVSCKRRRIGYNRYQFLINDV 
WTDESETV 
(SEQ ID NO: 99) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q92T52 (Q9ZT52) RING-H2 FINGER PROTEIN RHA1A 45 7e-04 

CABS 1420 (CAB51420) RING-H2 FINGER PROTEIN RHA1B 44 0.001 

Q92T51 (Q9ZT51) RING-H2 FINGER PROTEIN RHA1B 4 4 0.001 

CAB51421 (CAB51421) RING-H2 FINGER PROTEIN RHA1A-LIKE PROTE 44 0.001 

017099 (017099) F42G2 . 5 PROTEIN 42 0 . 00S 

016682 (016682) 2K1240.2 PROTEIN 41 0.008 

Comments: 

TaqMan Primer/Probe Sets: 

S'start-257 

5'stop=279 

3'start=308 

3'stop=330 

5'primer^TCAATAAAGCACCTCGTTCTTGG (residues 257 to 279 of SEQ ID NO:98) 
Tm5=59.26 

3 * pr imer— CCAAGGATAATTTGAGGATGCAA (residues 308 to 330 of SEQ ID NO: 98) 
Tm3=58.96 

probe 1 =CAAT ACTGAATTGTCGCT (residues 281 to 298 of SEQ ID NO: 98) 

probelstart*=281 

probelStop=298 

direct ionl -Reverse 

Tml=68.81 

scorel^l . 81 

length=74 
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CT508 

Nucleotide 

Genomic coordinates: 

Start: 30861 

Stop: 29073 <SEQ ID NO: 100) 
Amino Acid 

MAEAAPRYRQVLEEVLENIEPYMSFLDVFTERELALLNDIITSRNSPPVPSSSFKKLDNK 

EEFRDIIYFFINNNTKSDSSPICEGMTFINALTTVCKTFRGLYENIHDDFLFVKYSLLVS 

MDNGFLRRETHGIKFGTGDDSRTGFKFTSKEQAEEEREKVMRRIKKLDGVLASLKKSTSS 

ARSGIVFYVEKCSSVIRFRLFSRIVNITSDYVAEMKKSAPLEPFDISFGYKYFVDESPCV 

TKAKRLI SNGNFI IGRPFSCLETSPSS VSTDFREEMNMDARS I ARLNWTNEERAS AYRS V 

1 1 KSFLSSIEEEMVEEYCETTTKTVAEMAVE FVDVFI EKAET IQHFQT LYS I FDTMPKFS 

AEMMDNILKNVAINEAVGSGLCGAILLWMINSRPFEEIDYNYFKICLREIMVRKKTDKLC 

DNLIVKRIVSHKNWITDPHEVKGYVRLCTKVSCFMEDLEAFLTKNPWLKHTYFDEKGNT 

LLCYCIINKYSHTSKLVKQEKLNILKPSAKGMSPLMVCAAISSPFTTRVGIEILTTNSLA 

FSFIN ENNEN VFHAAAVATS CN FLDALAKKYKN 1 1 Y DFDRS I VNARRRAM VQRP 

(SEQ ID NO: 101) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

CUL1_CAEEL (Q17389) CUL-1 PROTEIN (LIN-19 PROTEIN) 36 0.99 

023296 (023296) HYPOTHETICAL 74.2 KD PROTEIN 35 1.7 

057577 (Q57577) CARBON MONOXIDE DEHYDROGENASE CORRINOID/IRO 35 1.7 

GIDA_COXBU (P94613) GLUCOSE INHIBITED DIVISION PROTEIN A 34 3 . 8 

Q47427 (Q47427) PLASMID P15B GENES R, S, SC, SVM1 , SVMR, SV 34 3.8 

Q39068 (Q39068) CYCLIN 2A PROTEIN 33 5.0 

Comments : 

TaqMan Primer /Probe Sets: 

5 , start=742 

5'stop=764 

3'start=822 

3 f stop=845 

5'primer^TCCAATGGCAATTTTATTATCGG (residues 742 to 764 of SEQ ID NO: 100) 
Tm5=58.75 

3 * primer=GATCTAGCGTCCATGTTCATTTCT (residues 822 to 845 of SEQ ID NO: 100) 
Tm3=57.51 

probel=CATCCGTGTCAACTGACT (residues 797 to 814 of SEQ ID NO: 100) 

probe 1 s tar t=*7 97 

probelStop=814 

directionl=Reverse 

Tml=68. 91 

scorel=l .91 

length=*104 
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CTS09 

Nucleotide 

Genomic coordinates: 

Start: 37245 

Stop: 36048 (SEQ ID NO: 102) 
Amino Acid 

MAGTDIISSSSSGSSSSKKGGCIVSKKGKTIKGKNIVFKTSIKTSSSSEMMKKHKKRMEI 
KDMVKKCASCKKVDYSSSTLENDALRASIESTCSALNRFPEIKYGEGEIGDVLSAIRLMA 
GCLLAKNEKS FYKFFLRGFQFDKNGFMMLSEGMKRI EKMHTKIAKKVFGGCKAAPLKE DR 
EGKIPCQEFHKPSSYKGEYTTPLPPTPAPVKVLPPLLPYKNVKNKPVFVPDLAVGEAKKP 
CWVHKLFS DDPEERKRLFERHQAGRRDALMEDYGVI PNNDNEAEDTERFVSNALEYQAQM 
LELLDTANMPPPASTPVRRGRTRIVRDYDASPVPSPYSSPLHTPFDAPNVNLNPGSGRMV 
DRVRDGRRNTSRRTSAVMARRINQLQHQFLYYSSDSDF 
(SEQ ID NO: 103) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



001693 (001693) COSMID T08B2 39 0.073 

CAB52863 (CAB52863) PUTATIVE MEMBRANE PROTEIN 36 0.37 

001348 (001348) ZINC-FINGER PROTEIN KL0 (KLUMPFUSS PROTEIN) 36 0.48 

Q9ZDN6 (Q9ZDN6) VIRB10 PROTEIN (VIRB10) 35 0.83 

Q9Y2W9 (Q9Y2W9) ENDOCRINE REGULATOR 35 1.1 

GAR 2 SCHPO (P41891) GAR2 PROTEIN 35 1 . 1 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 973 of CT509: this 
corresponds to nucleotides 36148 to 37119 of the genomic reference sequence. 
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CT1070 

Nucleotide 

Genomic coordinates: 

Start: 249425 

Stop: 253221 (SEQ ID NO: 104) 



Amino Acid 

MASSGGFFTGIDDLFKTVIQQEKQEKNKPTQAPETEPKPGPSQAPDPVPDPVPKTPTNFC 
PPPPNPLPPPPPPPPKPSREERLKTSKIRLNKALSDIVEATNERVDALKENQALNTEYDK 
KDNYFQVLKCS I TPS V PTAI I GAHVKQVAKSSE I ELAVNELDIKNKCS LV YNENESLKFF 
RDHENLILQIAVQLFSRHDNTKCVGAEICVKGNEKNKFVNKLVVKKLPNAPSSSSTVLEI 
RGATRNLI^NNFNKGENNTVNENKDIPPSERANLDTTKAEISHVFSTLHRLDTKRKLFFK 
GNTFYQRKPTFDNKFRWTEVIGWTESEASKQTTKSLDKPTDDNIiEVLPHSFNNIiADHLRL 
KFKNVLYKNSTAHPGKRNYYKTQETLINPQIDSAKEYKMVFAEIDKCLDVLLAIGKNDKY 
TKSTVIQYRGKFRRYLIFCYAFYALNKAKHSRAVSPLPFNFFNLFSFMYCHGPFLHSASF 
LSTLTFVYQHMFFPMGTAAPSVSAKRLMDIDSALMKGGKGVGVRDFGSPSKTSLHTRTLV 
SFLGFAEMAMGTMTALLSGVEVRVSPALQQRISKSLERWCDSVIFIYFTFVLFHRFSGAK 
KVSLESALRLIMGQTHAHTNKVRAAKRCRIEAAEMEGVEEEEAGLTLSYAHLLGLPYSIQ 
KALGLPVPKINPLMTASSSQYNLGDFVGVEQLLKAKREFPAEGETAGFLGMFDNLVKDSI 
DKYYGEGAFSDVVENVKQ6MEQNTPYDTSSALMTPIPKAFYEEEKDVPQQEENSTQQRYR 
LNRDVEEYLMAS PMKMV FVS I LDKTNQKERFMSVGDI ALLAWCKRNVLKKDWNEYAIAK 
GNYEWLGAKMCNHLLLADLVN FGI LGDLKITNKLDTNTDTFHRDSDRLPSVADQKKFI KN 
TSLSDRKQLALVHSCVNVSTRTHVGRVTATSWAVDALRTYTRGDKDMFAALSSSLDMYHL 
GHTNSANFVPYFSRNYLCNEQENGLWGYTRRTSEKLAKEELGRGRLGGLNKVGVAKTELA 
AAAIAISSALDMGEVEAVMDDSSKVRKIASTCLNVNAAKVSAAREKAREASIKRLLLATN 
APAAGSSRNSNRFLLKDLWGFFSDPDKRQKLIKGEAVSVLCPNTGFLHAAVPDFVIEYSF 
ESETSIVRLRLRLIKPEKQDEMVCPSTAPEANKKRKLVRNNQDAVLTLDDEDNIVKYNKY 
DMVEDEEARERLRHQDKQSVIAARISKVCERKNPKKKRRLEDPELQSVDEQLIRELAAIA 
Y 

(SEQ ID NO: 105) 
Top Blast Hits 

Score E 

Sequences producing significant alignments; (bits) Value 

SSGP_VOLCA (P21997) SULFATED SURFACE GLYCOPROTEIN 185 (SSG 
YPRO_OWEFU (P21260) HYPOTHETICAL PROLINE-RICH PROTEIN (FRAG 
013305 (013305) PROTEASE 1 
P93797 (P93797) PHEROPHORIN-S PRECURSOR 
AAD37432 (AAD37432) M-LIKE PROTEIN PRECURSOR 
Q54071 (Q54071) M PROTEIN PRECURSOR, MSZW60 



64 


8e- 


-09 


62 


2e- 


-08 


61 


5e 


-08 


61 


5e 


-08 


59 


2e 


-07 


59 


2e 


-07 



Comments : 



Tacplan Primer/Probe Sets: 

5 , start=1884 

5'stop~1904 

3*start«1931 

3'stop=1952 

5 • prime r=CCGAATAGAAGCAGCGGAAAT (residues 1884 to 1904 of SEQ ID NO: 104) 
Tm5=59.32 

3 ■ pr imer=TGGGCATAAGAGAGTGTCAGGC (residues 1931 to 1952 of SEQ ID NO: 104) 
Tm3=60.21 

probe 1=GAAGGTGTGGAAGAAGAA (residues 1906 to 1923 of SEQ ID NO: 104) 

probelstart=1906 

probelStop=1923 

direct ionl=Reverse 

Tml=68-98 

scorel=1.98 

length=69 
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CT1071 

Nucleotide 

Genomic coordinates: 

Start: 253296 

Stop: 255120 (SEQ ID NO: 106) 
Amino Acid 

MRDDTFNQETAVKLVRWYTEYDCCCPLVNRVERLLGSFGGGVDATSVRSRPALYEEDKKG 
DKCIPFRITSLIEGILLERALTKPDLAAAAFDVSEKLVYCSCNNTQGNFDVSSMTIWIDG 
NNSKKYEVTCPSCTVEKISGGAESIHKKPMSLLAFFNNLVEKEAFAERIELKKLYLSLLT 
GSAAGGGGMYKDSSQQSSFNGSWTSLLFHTSKKDKTRLEAEVLVSNKIKHTSRLQPRCVC 
S DLL YALCST TNNSAS YAYKARNLCV I EGGE FLY FKYT I FEENGPFDSKT DLQSLVNNEP 
VSETNSSALAASSSSLEDDDDCCDDDDDDDDDEDEKTKKKQPKKQTKKQKTTTSTLPPIS 
KTNHDNMLMNVLKKGAVNGKRKMMDSLSGKKGQHSKKLKTSAAAGGGASSDVVAGENEEE 
NNPSSVSPTNNRDRKDYVLPCPQIEEVTIFSQHRMNNNKLAESVVKHSVVINGNCLNLFV 
TQHRKKYILPHENILFCPPLVQHVGFNKFRILTGVSCFFDRIEIVFSDQSDSWLSNNAA 
HSAILRLLSYIRENSLKRSVRTASVKGIDFWKSQDTNIGIPLSNKEIRERQLCSASTLS 
MLAGLGK 

(SEQ ID NO: 107) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q20497 (Q20497) F47A4.2 PROTEIN 52 2e-05 

KS1_HYDAT (P38978) KS1 PROTEIN PRECURSOR (HEAD- SPECIFIC PRO 50 6e-05 

YIL2JYEAST (P40480) HYPOTHETICAL 123.6 KD PROTEIN IN POR2-C 48 2e-04 

FKB4_SPOFR (Q26486) 46 KD FK506-BINDING NUCLEAR PROTEIN (PE 47 3e-04 

Q06459 (Q06459) NUCLEOLIN 4 6 9e-04 

NPM XENLA (P07222) NUCLEOPHOSMIN (NPM) (NUCLEOLAR PHOSPHOPR 4 6 9e-04 



Comments : 

TaqMan Primer/Probe Sets: 

S'start^lOO? 

5 f stop~1032 

3»start«1084 

3 , stop=1105 

5 f primer=AAACTAAGAAGAAACAACCCAAGAAA (residues 1007 to 1032 of SEQ ID NO: 106) 
Tm5=57 . 02 

3 * prime r=TCAACATGTTGTCGTGATTGGT (residues 1084 to 1105 of SEQ ID NO: 106) 
Tm3=57.4 4 

probe l^CACTTCCACCTATCAGCA (residues 1064 to 1081 of SEQ ID NO: 106) 

probelstart=1064 

probe lStop=1081 

direction l=Reverse 

Tml=68.99 

scorel=1.99 

length=99 
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CT1072 

Nucleotide 

Genomic coordinates: 

Start: 255074 

Stop: 257477 (SEQ ID NO: 108) 



Amino Acid 

MLSFNPEYASW FGKMITDPGVILPVSKDWLFGSRGQSDVGIMTLDPHDLDIKITSKRIG 
VEERLAQYNTLPMDFTRAMEKELNNSRNMKESIFTGIFLDTGSAIFEDNMFNGGGSALRL 
IRSPALNSAVFSSKNYIIKQLPTITKSLRRSQARDKQVDKTREKIVVDSFSILSAIAAQV 
MHLTDGEMTYVPDGHCVNWMSETNASSIYLIINDPTGSGWKIMPNNFNKTLEMRDGVID 
RVETLVEFACKCVASSLIKRGMDLVDMQRTIRSMDFLPPASSTSNNTPRVAIMTSGSSTT 
TGIGSLSIIAEDGSTHHQIKLSEYRTGLSITENNREVSFTVEPSIDGVQAEHPLSPSILQ 
WLPPLVKRPEVVAAAAAAWEEENGDNKPSDKDNEDKYSDTDFWSNVPVTPLITPKKWRA 
CKINDRAMISSWKNNLVKLHKYDWTNKTTKVDYFDKMAAFVALMTFRKFQDILADNYVPP 
QTPSQG SEYAVTMSN V ATLFT DV YGFESNGNKPL FALEQLEN ETG I ES I YVLN 1 1 GNS P D 
GNSVRVVRLEKEMSFLLKAKQYFTEMAIPPINEKCKWTDKAPSSVKEYKYFCDLTAFISK 
RPRKDNNDGGVEHSALTYTPRCIYHTERCLVHLYSEPEKITEHVSFNKDLNILEIGKNIT 
NQYQTNYKSIFEIVDVPIIVASMSSTKTMTVNNYIISTPSATTKFVQDPPKTGKQLLAVE 
EVRNFKLKSVLVPPPYFRDNKRNTTLCSQITEQNCPSSSEGGRFSCPSESLILKYSNLSK 
KRALEE I APETE TS I LSLAM 
(SEQ ID NO: 109) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

TFC3_YEAST (P34111) TRANSCRIPTION FACTOR TAO 138 KD SUBUNIT 39 0.12 

Q9XIC1 (Q9XIC1) F13F21.2 PROTEIN 38 0.20 

P95629 (P95629) PUTA GENE 37 0.60 

Q9XGN1 (Q9XGN1) TTG1 PROTEIN 35 2.3 

085475 (085475) CELL DIVISION PROTEIN 34 3.0 

Q00741 (Q00741) TAMA 34 4.0 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 932 of CT1072: this 
corresponds to nucleotides 256520 to 257451 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=1048 

5'stop=1068 

3'start=1099 

3 , stop=1120 

5 , primer=GCAGAGCATCCTCTATCCCCT (residues 1048 tol068 of SEQ ID NO: 108) 
Tm5=S8.38 

3 'primer— CTGCTACCACTTCTGGCCTTTT (residues 1099 to 1120 of SEQ ID NO: 108) 
Tm3=58.57 

probe l^TTCAGTGGTTACCTCCTC (residues 1076 to 1093 of SEQ ID NO: 108) 

probel s tart=107 6 

p robe 1 S top*= 1093 

direct ionl=Forward 

Tml=69.00 

scorel=1.99 

length~73 
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CT1073 

Nucleotide 

Genomic coordinates: 

Start: 257551 

Stop: 259132(SEQ ID NO: 110) 



Amino Acid 

MGVQKNILVGGGGGVSLLLGWTLLGTVTEGAPAVPPFSSSSYSFTPESSVFWVEGNRVL 

SGTKKDTLINVLGKKIPYYANSIFRHDCSETRSIQWPETSPLGLNLIFCSCASHEHQHRT 

HETTEPDDLLWDGSRKTTTIILPKKWWSDVVWTSLWRDNDQKCGCGQAFVSSFTSTQKEV 

QGEWLAAHTNGKTSEGDTNSAYLFISLQRTTLKPIITDVTEDNMMMGRMSGTPMNPKDMT 

YFVN DFSDD I GS TPQCLVSNS DI LNKREEW I AVWGVADSKDLLTKHQLGERE YGS EGRRR 

NPGVEEEEEERVEEEEEVEVALPYIKKSGKLIGPRRRPLTTTTTTTTTTTTNPIVREVVE 

DFDYESFNEPEIFGSNSKLPFIRFLDQKNWRLGIMSRVSSSIANFKIEQESSKALFCLAV 

WVGDEHTPKFRLSVWKNWKPFTSAPIIVQNVGYSSDVFWHETLRSKIVDRSRDLIETKVT 

KKIGEDWANKKQTVVAMFISGIVCITVTVISIFSIVIYYKIKMPKF 

(SEQ ID NO: 111) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

G1184543 (G1184543) NAD+ : PROTEIN (ADP-RIBOSYL) -TRANSFERASE, 44 0.003 

017412 (017412) CHITINASE 41 0.015 

AAF00095 (AAF00095) HI STONE ACETYL/TRANSFERASE MORF 39 0.10 

AAF00100 (AAF00100) HISTONE ACETYLTRANSFERASE MORF BETA 39 0.1O 

015087 (015087) KIAA0383 (FRAGMENT) 39 0.10 

AAF00099 (AAF00099) HISTONE ACETYLTRANSFERASE MORF ALPHA 39 0.10 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 511 to 925 of CT1073: this 
corresponds to nucleotides 258 607 to 259021 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=698 

S , stop=721 

3'start=823 

3' stop=84 6 

5 • pr imer=CCATGAACCCTAAGGATATGACAT (residues 698 to 721 of SEQ ID NO: 110) 
Tm5=57 .04 

3 1 pr inter -GAGGTCTTTAGAGTCTGCAACACC (residues 823 to 846 of SEQ ID NO: 110) 
Tm3=57 .29 

probe 1=TTCGGACATCCTGAACAA (residues 780 to 797 of SEQ ID NO: 110) 

probe 1 star t=780 

probelStop=797 

di rectionl=Forward 

Tml=69.00 

scorel=l . 99 

length=149 
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CT1074 

Nucleotide 

Genomic coordinates: 

Start: 274526 

Stop: 275153 {SEQ ID NO: 112) 
Amino Acid 

MYIFVEGSPLTGKSSWMSICLIDTGSCGMSFLNFLRMNTSDYYNWPAEIGTEHLQLGFRET 
RWDGMFEPVLKTFVDSWKKEQGKESLKEYLDYNGQVMEIYIAEWLRQRPLAFHVFTYTD 
EAVKSGFLNEE DLDMDTATKWM AE 1 1 REKRGNIQE I KVTPRVV FNGNVCS ACFSNTKRNL 
YNFGTNYNNWHCDLLCPFARHRIVHFL 
(SEQ ID NO: 113) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9Y7X6 (Q9Y7X6) HYPOTHETICAL 75.7 KD PROTEIN 38 0.079 

Q9YBC0 (Q9YBC0) 431AA LONG HYPOTHETICAL PYRUVATE DEHYDROGEN 35 0.52 

067329 (067329) DIHYDROOROTATE DEHYDROGENASE ELECTRON TRANS 33 1 . 5 

DP01_BORBU (051498) DNA POLYMERASE I (EC 2.7.7.7} (POL I) 32 4.5 

YHC3_YEAST (P38742) HYPOTHETICAL 130.0 KD PROTEIN IN SNF6-S 32 4.5 

051342 (OS1342) ATP- DE PENDENT CLP PROTEASE, SUBONIT A (CLPA 31 5.9 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 563 of CT1074 : this 
corresponds to nucleotides 274555 to 275117 of the genomic reference sequence- 
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CT1075 

Nucleotide 

Genomic coordinates: 

Start: 277704 

Stop: 278079(SEQ ID NO; 114) 



Amino Acid 

MWRSCI SNIREMGDNKDYETRLIQRI NDLESE IENKTELCEKINEQMKNTQLKYDKCFVE 
EETEKFRKMEERVKYLKEQGI PLDPEERRTMLAEIDKSNKELDALLEENERI IKLIDEEL 
ESMK 

(SEQ ID NO: 115) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

044929 (044 929) MICROTUBULE BINDING PROTEIN D-CLIP-190 47 5e-05 

013439 (Q13439) TRANS -GOLG I P230 (256 KD GOLGIN) (GOLGIN-24 43 7e-04 

Q59037 (Q59037) HYPOTHETICAL PROTEIN MJ1643 43 0.001 

Q25662 (Q25662) REPEAT ORGANELLAR PROTEIN 43 0.001 

KINH_DROME (P17210) KINESIN HEAVY CHAIN 42 0.002 

AAD29948 (AAD29948) MYOSIN HEAVY CHAIN 41 0.002 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 388 to 1 of CT1075: this 
corresponds to nucleotides 277689 to 278076 of the genomic reference sequence. 
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CT1076 

Nucleotide 

Genomic coordinates: 

Start: 278221 

Stop: 278473(SEQ ID NO: 116) 



Amino Acid 

MQKKYDKLFEDDKRFREIEERILQQKEKGNPLDPEERLVLSADIDRSMKEIDDCLEEINH 
IELSI DTLLDECENLH YGLQTTK 
(SEQ ID NO: 117} 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

018164 (018164) Y6B3B.1 PROTEIN 35 0.076 

P70388 (P70388) RAD50 H0MOLOG (S. CEREVISIAE) 35 0.10 

Q06268 (Q06268) INTERMEDIATE FILAMENT PROTEIN 33 0.30 

USOl_YEAST (P25386) INTRACELLULAR PROTEIN TRANSPORT PROTEIN 33 0.39 

Q07380 (Q07380) HYPOTHETICAL 206.5 KD PROTEIN YDL058W 33 0.39 

096275 (096275) RESA-H3 ANTIGEN 32 0.51 



Comments : 

Hit to public SBV sequence: 

gil 6007410 igb|AFl78573. l : CT nucleotides 13 to 252 match nucleotides 2050 to 2289 of the 
public sequence with a 100% homology, a score of 476 and an Evalue of ie-137 

TaqMan Primer/Probe Sets: 

5*start=88 

5 f stop-Ill 

3'start=147 

3 , stop=170 

5 , primer=AACCCTCTAGACCCAGAAGAAAGA (residues 88 to 111 of SEQ ID NO: 116) 
Tm5=57 . 69 

3 *primer=TCCTCGAGACAATCATCAATCTCT (residues 147 to 170 of SEQ ID NO: 116) 
Tm3=57.97 

probe 1=TATTGTCGGCTGATATTG (residues 116 to 133 of SEQ ID NO: 116) 

probel start—11 6 

probelStop=133 

direct ionl=Forward 

Tml=68.98 

scorel=l. 8 8 

length=83 
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CT1077 
Nucleotide 

Genomic coordinates: 
Start: 278723 

Stop: 280976CSEQ ID NO: 118) 



Amino Acid 

MEKKTETAATTEKDPEPSVSKRSRNKEPKTTSTVYTSVKCYLSSIIKSESSRSNVTSTKE 

RFEERCKS VS KMMVKGSLFLRLWDECLRRYNHLEDE I DKWP DMTKDN FYVQLLRKGLDK 

KKLKEGSTHPWEDVWNSPIVQETFLSQQGEGNNPIKRHLMDFNTITYAAKQLKTCFETN 

LRTHFRTRQQRAISGWLAENGFDKKYTKLVQHWIIGCTYKSDWVDSGDLERVKEGTKNFV 

TLHRKHLCVISDKBCNGTISYSPEEKYPIPSILNYYKFLQTEYPQNKKIQKMIVVPKHKLK 

I HYCTFDQTT IQGICKD1»GVWKIDMEER11KQSEDI LYKQGWYLLFDVKKI KKLRPNWNFHS 

IQTDGEGVSVLFSREVEEVETVSKKSKKNKKPRGDEDRRNYPPTNAKYVVGVDPGRTNVV 

SCSVFDTRQKRVVRKHRMTAKQYYQESWMTDRRKANETYKKNNKEYKEALEEITRYDNGE 

EIINDGNGDTSTPTKKFEAYLKWNEHYRLLWNEKGKKKYRKNAMKVYSRKQKCISNFID 

ELIPKRDKIEDYHIAFGDAKFACTGRGEQYASPARIFAKKIKERVGGDKRFTFVDEKYTS 

KVCHRCNQPLNMLEKDCFSPNKKRKPPTIVTTTTTTTTEEDEENGKWKKATPLRENRDTR 

RCSSEKTQFGYSSNRKVSTGDISMETPVPSSTSSSFCTPTSITCVLGGKFVDRDFNASTN 

IVHKFLGFWDKKMEKKDKMPLKYHFIRVA 

(SEQ ID NO: 119) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9ZAY5 (Q92AY5) SURFACE PROTEIN C 46 7e-04 

CYL1_HUMAN (P35663) CYLICIN I (MULTIPLE -BAND POLYPEPTIDE I) 41 0.022 

062231 (062231) F35E2.9 PROTEIN 39 0.086 

Q1208O (Q12080) P2610 39 0.11 

MYS2DICDI (P08799J MYOSIN II HEAVY CHAIN, NON MUSCLE 39 0.15 

069188 (069188) C3 -BINDING PROTEIN 38 0.33 



Comments : 

Hit to public SBV sequence: 

gi I60074i0igbi AF178573.1 ; CT nucleotides 12 to 297 match nucleotides 2549 to 2833 of the 
public sequence with a 99% homology, a score of 551 and an Evalue of ie-159 
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CT1078 

Nucleotide 

Genomic coordinates : 

Start: 282175 

Stop: 282586 (SEQ ID NO: 120) 



Amino Acid 

MGNSESRSSGIEIVHKNGAPKRSHKTLYLSNRTERHAQIQKQIEELHHKTNKQFEQAOKV 
LDKNEERKKHQQQQQ 1 1 1 PL DPEERRAILAEI DKHMKEI DG FI EESERLGLLV DAE INN L 
EEKEVEEEHLLKQKED 
(SEQ ID NO: 121) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

ARP4_STRPY (P13050) IGA RECEPTOR PRECURSOR 45 2e-04 

AAF05247 (AAF05247) ORF133 43 6e-04 

MYSG_CHICK (P10587) MYOSIN HEAVY CHAIN, GIZZARD SMOOTH MUSC 43 8e-04 

AAD4 6501 (AAD4 6501) LATENT NUCLEAR ANTIGEN 43 0.001 

040947 (040947) ORF 73 43 0.001 

Q9ZGM5 (Q9ZGM5) M-LIKE PROTEIN (FRAGMENT) 42 0.001 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 5 to 34 4 of CT1078: this 
corresponds to nucleotides 282190 to 282529 of the genomic reference sequence. 
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CT107 9 

Nucleotide 

Genomic coordinates: 

Start: 286863 

Stop: 289635 (SEQ ID NO: 122) 



Amino Acid 

MSSSSSSSFSFRISTYQTFLKALAHPDLVDKITQKCDETGRNQKCPIQFLADISHLIQGE 

RNGGNLFPLHPFKNQPHLEPRIVGSLHGRTLDNDIEESYCYFVKDLYNGVFSYVNGVKEL 

QGVLDKKISGSGSGESSSSRAPLIPITDVDLLYIFGTLVVLPPRSKAYRVITEAVLALPF 

NEFSNNWPPTNIKGAYVSRDFRMFNLLAGLDHIEGEVGGESEWESIHASWKRMVTIMRN 

KAEKKPPSTSRIFRVYVAEPVNDAVTKIPIRVLSKLFGSRIAGILQKVYSYSMLNLPYLL 

SSNSIDIKQGVKGITLSIPSARKLGFYLLQKDTTLQSSLSQDVADCIVSINAGIIGDDFS 

EKIRQCIEEKNKPENCCMCFCEIDKTPDFSYSEHVARHNFFPVHAFSSSHDDKCCGAKIC 

SECI FPYI I SLYEKMTGVAGVKVVDLFQCPGCKSGMLNLKGRCYEFSNLCKRMI LPYTST 

HCSSLFDATINRAEACFYSLEFLQYDFETARRIAHGAKDIPHVYNKVVKNVKDLDRLCAL 

YCYKCVSPVVCDEPNESTDYEMVDVTPPLINLTEIVDSEEYDDGPGNHMWPAKFTCNFIA 

GSSGETPTISTCRDAVTFLGRAPRKKMAGWDDQSAVGQAIIALANWRKSGELPKNMFDLL 

EGVNAVLYRGDSFLLRAINYPCVIGRSMSPSLELVKRKVNKIALIKAFFHEKRVRPDASK 

KLLEWAELLVKS YLMEVLLQT PECV I HRAHS FVGKTLLI TDELVHMRPDDATRN A YIQNL 

NAARQNAAAAASFSGSLPKPEFVPCKERTIEWMYEKDNDDVRVVNCPSCKKAIQKYGGCV 

NVFCECGTNMCWICEEKVSPADSNHCVEKHRIVYSNCVRVKYALESMYGFEICTMKNVEE 

GVKNYYVMENGFFFDVQEMVAKK 

(SEQ ID NO: 123) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



AAF04 637 (AAF04 637) HYPOTHETICAL 84.4 KD PROTEIN 213 3e-54 

097226 (097226) PFC0175W PROTEIN 49 le-04 

Q94981 (Q94 981) ARI PROTEIN 48 4e-04 

CAB45785 (CAB45785) HYPOTHETICAL 262.6 KD PROTEIN 46 0.001 

CAB36714 (CAB36714) HYPOTHETICAL 68.5 KD PROTEIN 45 0.002 

Q9XII0 (Q9XII0) F7H1.11 PROTEIN 45 0.003 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 910 of CT1079: this 
corresponds to nucleotides 2 88641 to 28 9550 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5 , start«114B 

5 , stop=1167 

3'start=1192 

3 , stop=1211 

5 ' pr imer=TTGACAAGACGCCCGATTTT (residues 1148 to 1167 of SEQ ID NO: 122) 
Tm5=59. 14 

3 "primer=TGGACGGGGAAGAAATTGTG (residues 1192 to 1211 of SEQ ID NO: 122) 
Tm3=59.35 

probe 1 = AGTGAACATGTGGC AAGG (residues 1174 to 1191 of SEQ ID NO: 122) 

probelstart=117 4 

probe 1 S top= 1191 

direct ionl=Forward 

Tml=69.00 

scorel«l .99 

length=64 
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CT510 

Nucleotide 

Genomic coordinates : 

Start: 38917 

Stop: 37381 (SEQ ID NO: 124) 



Amino Acid 

MAETVAVDEVPTCPICMGDYDS DTDCYNWSNGGMPCCRKSVHLECLFTWRFEEHMVNENH 

LLCPMCRAY I PPVWFFRKVYEEV YKYAS FHS FLLSADY VNDEGVKDTLNKMSTI LAPT FF 

VPNAKGVNENEDVYMERAYTKLSFWLETLSRQEMHAFSEETFEDNHEAALMGKFKDIPPY 

EYEGEWLKYVAPNTIDITOCLSNDDDDDEGDNNVSPSLLSGVTSFNFIEDDEDTWFVPP 

EVDDNDDSESLPDLTVPPRSNNITFDTISGISSSLYDVNDDDDDDDTMSLPDLNMPSAST 

SSAPTSSAFTSTSLNINVNLCFNVDSDSDDEEVIPSSSSVNQPSTSSGSSSSSSNSRKRP 

RYGRDEDRMSNISSESKRLCVDVKRYMCRLDNIDEEYNEIANRYLAELSALRERRQETEN 

KLGDCISRGNLFHTTVNDVIGKSLCSKKLKVKRKYASKWSANKQLIGSCLIKSASNNARL 

DDEIAHVHSSLLNGFDTDPSEADQISSLPNL 

(SEQ ID NO: 125) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



O70567 (070567) DENTIN SIALOPHOSPHOPROTEIN PRECURSOR 56 7e-07 

P97399 (P97399) DENTIN SIALOPHOSPHOPROTEIN PRECURSOR 56 7e-07 

SR40_YEAST (P32583) SUPPRESSOR PROTEIN SRP40 53 4e-06 

095815 (095815) DENTIN PHOSPHORYN (FRAGMENT) 53 6e-06 

P87736 (P87736) RING-FINGER PROTEIN (FRAGMENT) 51 2e-05 

Q53653 (Q53653) CLUMPING FACTOR 51 2e-05 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 444 of CT510: this 
corresponds to nucleotides 37411 to 37854 of the genomic reference sequence. 
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CT511 

Nucleotide 

Genomic coordinates: 

Start: 40718 

Stop: 38972 {SEQ ID NO: 126) 



Amino Acid 

MGGPTVITTTINTGGDHHHQQYVYHQGNKKRPVEEYNNNNYASGSTSEATTVPAYNNNNN 

NITIKTWDDVINLSITPPPPKRFKKSEVAPSPPTTRTFSNVCASKVIRQCKRQYNEWIER 

DS P YY FKG I EKSCSLE DN YDTCQQLR I G HRS I VXS SK YVH DTCFYGKDP KVGF YW PTS S C 

DEEMRFFDTRHILKELSSRNIPSSQIMDIMYMAVEVFQLPSSACERIRQKTSTLIKEVSD 

QCENWENFRKTARGCLSDLVEVPEDVKDFNTFICPWETFFEIKYGVYYIVNRGTVVKFMK 

DMNYEE F V FECVNGLS VYRKN I KG V VGVTGVC PQGLCLEM P FAG ISIDDVT RCVKDSLDG 

GEYYESRDARLLYGVVMLQRMGRLPEVKGVDTVAPITDSFIARKVVRSMFEKLKVNMPFV 

LAETCNVITRVANEGIINVDIKADNFVIDSISGQPKMIDLGLSYPLGYCYNDEYFRNTEE 

LIRQYIHTPPEFFRGHCLGAYSMTYSFSVMASSILEDVVACSNMEGPAFNLMSNMHFLML 

LQSGTDTDFYQNRPSITEYALAMKHIFPFKGTVMNLFKVKK 

(SEQ ID NO: 127) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q15208 (Q15208) NDR PROTEIN KINASE 41 0.017 

Q56921 (Q56921) PROTEIN KINASE A 41 0.017 

085239 (085239) PROTEIN KINASE YOPO 41 0.017 

068717 (068717) PROTEIN KINASE A HOMO LOG 41 0.022 

CAB54949 (CAB54949) PUTATIVE TARGETED EFFECTOR PROTEIN KINA 41 0.022 

YPKA_YERPS (Q05608) PROTEIN KINASE YPKA PRECURSOR (EC 2.7.1 41 0.022 

Comments : 



TaqMan Primer/Probe Sets: 

5'start=878 

5*stop=902 

3*start=981 

3'stop=1001 

5 • primer =GGACTGTTGTCAAGTTTATGAAGGA (residues 878 to 902 of SEQ ID NO: 126) 
Tm5=57 . 47 

3 • primer=TGAGGACACACACCAGTCACC (residues 981 to 1001 of SEQ ID NO: 126) 
Tm3=57 . 98 

probe l^TGAGTGTGTTAATGGCCT (residues 927 to 944 of SEQ ID NO: 126) 

probelstart=927 

probelStop=944 

directionl=Reverse 

Tml=68.98 

scorel=l-98 

length=124 
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CT512 

Nucleotide 

Genomic coordinates: 

Start: 61872 

Stop: 60672 (SEQ ID NO: 128) 



Amino Acid 

MLSTCDLKHPSSTDGNVLKNIHFSESIPAND2ISFPSSDTEELNKDLLDSVRNQIKFGFD 

PI TETLKNC I TTQTLLHSFLKSS LLTLQEKFNEWGS IQLEKGGQEMALCASLKI MGQI SA 

LIETAKEASMDNKKKNNNACANCRDSKCSASLVTLFNKTIDEKYVKQNSSSASALLANTF 

TAGANKPPKEFITKDNAHGNSDTNYTAMSDNLICPGKYYSSDITYEVTKQAKERIKNNNK 

KMRLATGVEMVMKELEAENNKEGGRVEVEVEGVEQQQPSTSGEEMQMEIMLFTPPPPDLE 

SLVTEGVDDYPVFSPLPSLLSPMPASPLPSNGNSALEDGGPFAPSADIWDKTSEIMGRT 

PGSEWVHQRDRNSKMEIRNYGARGSGINTGRYRRNNTVL 

(SEQ ID NO: 129) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

GARP_PLAFF (P13816) GLUTAMIC ACID-RICH PROTEIN PRECURSOR 38 0.12 

YMEI_YEAST (Q03433) HYPOTHETICAL 32.0 KD PROTEIN IN CAT2-AM 36 0.36 

Q9WTU0 (Q9WTU0) PHD-FINGER PROTEIN 36 0.36 

015029 (015029) KIAA0312 (UPSTREAM REGULATORY ELEMENT BINDI 36 0.36 

MML3_MYCLE (006081) PUTATIVE MEMBRANE PROTEIN MMPL3 36 0.4 7 

Q06166 (Q06166) MATURE PARASITE -INFECT ED ERYTHROCYTE SURFAC 36 0.61 



Comments : 



TaqMan Primer/ Probe Sets: 

5'start=549 

5 , stop-570 

3'start=626 

3 ' stop=649 

5 ' prime r=TGCAAATAAACCACCCAAAGAG (residues 549 to 570 of SEQ ID NO: 128) 
TmS=S7 .79 

3'primer=TGCCTGGACAAATAAGGTTATCAC (residues 626 to 649 of SEQ ID NO: 128) 
Tm3=58.13 

probel =GC AC ATGGCAATTCT GAT (residues 589 to 606 of SEQ ID NO: 128) 

probelstart=589 

probel Stop=606 

directionl=Reverse 

Tml=68 . 99 

scorel=l . 99 

length=101 
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CT513 

Nucleotide 

Genomic coordinates: 

Start: 77506 

Stop: 76273 (SEQ ID NO: 130) 
Amino Acid 

MEEHLSFTJKPSPENGVVFFDFSDNTSMSNMVDNIRHRLPMDKKFSSKALLLASTPIPSDE 
QLSTKVNKAIFSHRETIVLSKALKIVVTGLYVDGEYVDDVICLYPEKHTLNGILRYVVHL 
NMMLMDKAEDADEI RCGLI PLGRGFNREAFKFVDPVI PCAGYN ILNGYH PDNGHQIS PSS 
TQPQVQRRCAVKC^IYKQINGMFEWKQFSIKHNNRIFTINQVDFKGEEMKMFFALYSEEL 
LPFYSETGKLLSEKHVSKSFSQLPPHVTISVFYLRNMEEYNTLMKTDFGSCFAPAIKIDT 
GE»JFELFGMNNNILVSKVCVGDDALDLRRRIMEHISDAIGRNVEIADNRLNPHITHGKIN 
EGWGEWVSRFAPCN FLCKPREEI VFGGTKFI FGRVSNGNYV I KQPVDYV 
(SEO ID NO: 131) 

Top Blast Hits 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 982 of CT513: this 
corresponds to nucleotides 76314 to 77295 of the genomic reference sequence. 

TaqMan Primer/ Probe Sets: 

5'start=502 

5'stop=522 

3'start=559 

3*stop=578 

5'primer=TACCACCCAGATAATGGCCAC (residues 502 to 522 of SEQ ID NO: 130) 
Tm5=58.25 

3'primer=TGCTTGACTGCGCATCTTCT (residues 559 to 578 of SEQ ID NO: 130) 
Tm3=58 . 35 

probe 1 =ATCTACTCAACC ACAGGT (residues 537 to 554 of SEQ ID NO: 130) 

probelstart=537 

probelStop=554 

direction l=Reverse 

Tml=69. 03 

scorel=l . 96 

length=77 



Sequences producing significant alignments: 



Score E 
(bits) Value 



032743 (032743) RECOMBINASE 

SYA_RHILV (P24075) ALANYL-TRNA SYNTHETASE (EC 6.1.1.7) (ALA 

003942 (003942) LACTOBACILLUS BACTERIOPHAGE PHIG1E COMPLETE 

Q9X257 (Q9X257) CONSERVED HYPOTHETICAL PROTEIN 

022993 (022993) CELL DIVISION PROTEIN ISOLOG 

YK05 MYCTU (Q10851) HYPOTHETICAL 30.9 KD PROTEIN RV2005C 



37 
34 
34 
32 
32 
32 



0.23 

1.5 

2.6 

5.8 

9.9 

9.9 
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CT514 

Nucleotide 

Genomic coordinates: 

Start: 102885 

Stop: 100046(SEQ ID NO: 132) 



Amino Acid 

MDSTSTTTIEAEKALLKE YVNENLTWE FVDRV I RHEKLMQRT DMRLKTSSRRLFS FI S I Y 
SFLQDFFTARDGVNSDEWCTQSALYHMLDGVASIISCFRKRIDYYNKKMERLACTSIREG 
YFLVDVKTIESRHVELLDPDKKIWQRLYAEKIAPEKWDAYNEVSKLLPDEAMANYNYRT 
GLVHLSDTLKNAKKPPTDLTMTDFDFYEKYIRSDIVLGKSNKLSGMFSENFEILPDINIK 
VPRRLERYFNVETN YSLEHNFRFPSNH I RGLI FAYFI GNI FGGAFSCVQLYLLGFTLSAA 
SACRENVLDTPFSKLKQYIKNDNKTKNSSSNEDNDGEEYYPCELQYARINSNDKNACRKS 
IVKAVKFVADRVEKASVTMMRTPIAEHESDGYMADWLSLQISKLLGRKVSASYALLFIVN 
WVAHKYKQSFTNDVNGSEKYEILLKKLTVACGLTYNHKCGMWPVIGFGSGMTNRKLRQY 
AVHCIENVIGSFISSGKRKKDIHEDPKKLEEMSLMQLSARLFKNNDVMKRGQDGKVTFAN 
EDNVQDFLEELKTKEFVPNERRRKIHEEEYTKSLHTNLKMTFRFGVCGFQHFLPASSDKP 
TQVSLQLLKQRQTFVQRETAAAVNWTRLLQFLFPSDERDNKRHQNSLPWNRLGSNLNRHF 
ISIiASKFIKRSVHCERVVNDIISKFNADILPLGKDPDHFLMTKAGLVIEDHARENIDNAM 
YSLCGGFNNQTTEQKLNSIRLRISAEALKNARNCVLATTFSKSYNEDRPFLPRTDEAKFV 
PIPLFGVEPLHPLLNSFrDNTANKCNDSVSDFWLEESDDIFKEALVSHTILTDSSVYSTL 
VGEDEDYCDNNKSGKRIGNTLVCTLYDMMGRANYNGLHSDKPRKHDPTPWSSKNTGQSGR 
STTDFSPNSVIVLLDTENVADDYEDEEEDYEALKQSERDNVITLNNX 
[SEQ ID NO: 133) 



Top Blast Hits 

Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



Q12537 (Q12537) GLUCOAMYLASE PRECURSOR (EC 3.2.1.3) (GLUCAN 



AM YG_AS PAK (P23176) 

AMYG_ASPSH (P22832) 

MSH6JYEAST (Q03834) 

YMM1_CAEEL (P34 489) 

AMYG AS PNG (P04 064) 



GLUCOAMYLASE I PRECURSOR (EC 3.2.1.3) ( 
GLUCOAMYLASE PRECURSOR (EC 3.2.1.3) (GL 
MUTS PROTEIN HOMOLOG 6 

HYPOTHETICAL 81.8 KD PROTEIN K01B6.1 IN 
GLUCOAMYLASE Gl AND G2 PRECURSOR (EC 3, 



39 
38 
38 
36 
36 
35 



0. 11 

0.32 

0.42 

1.2 

1.6 

2.1 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 3 to 841 of CT514 : this 
corresponds to nucleotides 99939 to 100777 of the genomic reference sequence. 



TaqMan Proe Sets: 
5'start=1126 
5'stop=114 7 
3'start=1173 
3 , stop=*1194 

5 'primer- TCCGTCACAATGATGAGGACAC (residues 1126 to 1147 of SEQ ID NO: 132) 
Tra5=59.41 

3'primer=GGAGAGCCAGTCTGCCATGTAG (residues 1173 to 1194 of SEQ ID NO: 132) 
Tm3=59. 86 

probe 1=CGAACACGAATCAGATGG (residues 1155 to 1172 of SEQ ID NO: 132) 

probe 1 start=l 1 55 

probe 1 S t op= 1 1 7 2 

direct ion l=Forward 

Tml=69.00 

scorel^l . 99 

length=69 
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CT515 

Nucleotide 

Genomic coordinates: 

Start: 114953 

Stop: 110132 (SEQ ID NO: 134) 



Amino Acid 

MSRNSLRVKGLKENGGIIPNPFDPLYVDTDAPFGMAGVKSDIIGKGFVESLLPGEISSHY 

NTFDCFKTPKKCRVGGNDFECISCRSLGGGTCVKSSRELKTEYGIEDDDEYDGVCVPLAD 

TI FS AS SAFDKHDDDVATDAAYRNVN P FTTVEE AYLH Y ESGGV I TGGGKKGTT YI TKKRG 

CVDSSVVRKDPSLLNKDPRLEPILGCTDIVLCGGKGVGRPIHPTTFSIIDDVDDIDFDIS 

SMTSTMCCLCEPGYSQQRDPATNAPKCEKKEGGIQEKEQGLGCPVMFRYGVVGDTGTKGC 

LCDESTQIRLEEVAGIDLPDAAKTDYAQPFVEGAKLLLQITERYETLGGSTKDACLPRPG 

NDTRMSALGYSYAASLFGRAPEITAFNGGHLITGGLLRESAMDAAGNWHSRIEDSDEQGK 

LTVSESVGGWPYSGTGSVAAHIWNGDALNDNGLVGAGGGNFTEHPNASLRWPLPHSNI 

PGLG I DSI DHAVGI I ASQGKI FPET VHMRAGDPSGVKT DRRDAHNDTT I ETS FLKDS DKA 

GYDSYKDNPLQKLRKSHDSGICATAYWPSLHRVIKEKPSAKNDKTVNKILPLVHYRPTA 

KRMAHTPIETIFKHSLLTAQERDQSFANSTLNSMMVTNSSNSFDDVTNLLLDYFFPNLNG 

EGKERSGLPINTRSIYNEPNNAKFKEIGGIILQPVTAQGAKKSSTFARFSEKILSTNSPK 

IIDHYKAGSSAVFKKVGEKEAYEMFAHPPTAWRIASNEGTFFSGRGLNNGIEGTGMREAE 

RVAKTLSKKPDIFAGAILTGDGVLMNGASSPLVRPMEIPASSLPEHTWFERRSPVNARGD 

FGSADNLTAINNTYDRVTKGDIRAILNSTTDIKTSFNSYAPARPFSKPLAPPAGVSAAAQ 

ATSFLGVLGGFPLPIPSSFLIQKSVQESVSNGTVGSMHGIVPLKFHEGDELWQQCEVKET 

EGALNFIPPPMALFESLLRVRTLSSETFIRPELIPNRFRADWGLSPHTAGHYLNGVYSPP 

CVREETGQSFGYPCSGALSQYTTMMVPKPLGPQSHSSLSKFSIKSYVEEQTRLLPANIGE 

KSIFEMQDPNSKNIFDKIGELGEKENCNCTNGLFCPKVNGGGRNKTDPIAATPSRGNRHS 

RFPLMTTLPKNDVHLSAALLRAQSGDARILNTIGETKTNGRKINLKAATENIWDISSNVM 

LAPNKFCAMRRSTAYTPYSTRQEKVPAAVLDERKGTFDRNAELLGDVGMTDIVSNDILME 

DYERLPGVPPAEAEIFHIIRDAAKTGQEGAKARRIVDFFESSHGVTASTFNVGTFSPYVE 

GVKDIVSLYATPCFTDIDSPTISADSATINEGASIEPTDGSEVVVEVVNSNMEMLGGSTA 

GSTKKRRLSISDYVDLEEDAEVFTINKQGKATENLRVRTSSSSKYVEGGQKDMVGFYEAS 

KRVPRVMRRVHVLPVLTPYHGGFESCAPTAAQSACTRGVEITYADFMRPSDLSGTKTTLE 

GVRVKGPEPFDDLSTLYFRSVGGPNLRKFAHHHHFGYEGLMSRYYYTREKTVSVSEGDLK 

DRFPFVCQS DRGPFPPKRDGT IQPLALVDMGVLPEGALTRRTI SME 

(SEQ ID NO: 135) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q24341 (Q24341) LARGE FORKED PROTEIN 37 0.96 

Q2434 0 (Q24340) SMALL FORKED PROTEIN 37 0.96 

013788 (013788) HYPOTHETICAL 59.6 KD PROTEIN C17G6.10 IN CH 37 0.96 

CHOD_STRSQ (P12676) CHOLESTEROL OXIDASE PRECURSOR (EC 1.1.3 37 1.3 

HAGA_PORGI (Q51845) HEMAGGLUTININ A PRECURSOR 36 1.6 

P94986 (P94986) HYPOTHETICAL 88.5 KD PROTEIN 35 3.7 



Comments : 
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CT516 

Nucleotide 

Genomic coordinates : 

Start: 118885 

Stop: 115402 (SEQ ID NO: 136) 



Amino Acid 

MKIVQNNFTPDERSGVIHIRKPAKIEKAVFGNIAAAIDDSAAVRKDPKKKRNLKNGLEPA 

SKKLAKNIERISSEELKRVTDVQDPKLLHSIMKRTARQIGYDIGDDISPQSAPDRDGSSS 

SSLLPIRMINIRTEELLEKGGKDTIVRIHILDGILPDNVPLPFKAEIKVDLVDEKYEGED 

GGGS S DSGPSLFETFPEFV PAGWPPI TN DPN AFSRNNGN KQQAV FKH VE VNSLA DGI TLS 

TKGS I FNTGNRLKI S I VTEDKNKT VLFDSQVT I S SP I PKITEV FACRN V ALMRL DMPKA I 

NYDNVEYTPDTLNEKYVSDYPANFPRLSRQAEIASNIAAKLPRENQLSDINKPSVSFVYS 

KTNTVNTPVLNKVLYNETLKNMEGNESEGYKILNATEITHLRNPSNPARTFICVSVPESE 

IEAQWKMLGWIVGFKTSSDVLTTSSGYNIVFPASKVTQSDKLFSVISTDVNANTNKWVH 

NTPSRVGCFGSSVNFRVDAATAPDWPGPTNGPDFFSYQLRPCI ILKTDNDNREPRITAVL 

SSPATEYAGERTTSLLPRALNVSVGPLTEVRGGDIITPVQTALLGGEQPTFKAPAEPTKL 

YAVFPVLDSHNGLVKASDNPFQPIHSITSRNKTTVLTVSDVIVNDDDDDVVLEDKSYHIT 

VSDPVSGSILAKENVLSSRITSRPIFIDGARDDRVFSVKMEVFGGDDKGIQMPFTMDGHF 

EGQFSDMSVPSNELAIWNDPSTFTAPVRDTPATDITNKGIVYCRTTLPPISNRGIRDPFM 

KQTSLVPLPTSIPEWAFADYGGEIKYPRHIFISSIRTNDTTNI VNTDTQTEFSIENWLRE 

QIDKEQERHRQLLPAPSEAYTQGEKVYAKMYMGDGVSEETLDQIVHTSNTTYVVDESGTK 

KENLLVN KE DKKLAAI LGKWG IWFGAN KYPDE PADRYTNWRNTGRLRA VGS YSQLRQP V 

APLQTRLATWPSGDPVTRLADGQFLVRLDPRCGGIGSANGFYNNNGANNEFTSSLLFAIV 

GNQDKVVSYAERVRETMKIVARNEGKKHLKNDDGLVLVDRNSALHRRLWNRTTFDHDDIV 

LCVKI PQNVMSKIEPGTSSGVLVDPLVFANVASSTDREEFYKKFIDTSSGPWI DRASVT 

SSYNISV PLN FYTTCGFI VG 

{ SEQ ID NO: 137) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

DNA2_YEAST (P38859) DNA REPLICATION HELICASE DNA2 37 0.68 

094534 (094534) PUTATIVE SPINDLE POLE BODY -ASSOCIATING PROT 37 0.89 

BAA84527 (BAA84527) ALP 14 37 0.89 

YGS4_YEAST (P4 6947) HYPOTHETICAL 30.5 KD PROTEIN IN SAE2-KE 36 1.5 

028907 (028907) GTP— BINDING PROTEIN 36 2.0 

Q49547 (Q49547) LMP3 PROTEIN 35 2.6 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 888 of CT516: this 
corresponds to nucleotides 115494 to 116381 of the genomic reference sequence. 

TaqMan Primer/ Probe Sets: 

5 , start=1869 

5*stop-1892 

3'start=1984 

3*stop=2001 

5 ' prime r=CATTCACAGTATCACGTCAAGGAA (residues 1869 to 1892 of SEQ ID NO: 136) 
TmS-57 . 80 

3 ' pr imer=ACCAGACACGGGATCGGA (residues 1984 to 2001 of SEQ ID NO: 136) 
Tm3=58 . 39 

probe 1=AGCTACCACATCACAGTC (residues 1966 to 1983 of SEQ ID NO: 136) 

probelstart~1966 

probelStop«1983 

d i r ec t ion 1 - Forwa r d 

Tml=69 . 00 

scorel^l . 99 

length=133 
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CT517 

Nucleotide 

Genomic coordinates : 

Start: 180036 

Stop: 179421 (SEQ ID NO: 138) 



Amino Acid 

MEFGNLTNLDVAIIAILSIAIIALIVIMVIMIVFNTRVGRSVVANYDQMMRVPIQRRAKV 

MSIRGERSYNTPI^KVAMKNGLSDKDMKDVSADLVISTVTAPRTDPAGTGAENSNMTLKI 

LNNTGVDLLINDITVRPTVIAGNIKGNTMSNTYFSSKDIKSSSSKITLIDVCSKFEDGAA 

FEATMNIGFTSKNVIDIKDEIKKK 

(SEQ ID NO: 139) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

SCH9_YEAST (P11792) CAMP- DEPENDENT PROTEIN KINASE SCH9 (EC 35 0.45 

039292 (039292) COUNTERPART OF HSV-1 GENE UL10 AND VZV GENE 33 1.7 

HS78_YEAST (P32589) HEAT SHOCK PROTEIN HOMOLOG SSE1 32 2.3 

G435756 (G435756) LYSOSOMAL MEMBRANE GLYCOPROTEIN LAMP-2 HO 32 3 . 0 

LMP2_HUMAN (P13473) LYSOSOME- ASSOCIATED MEMBRANE GLY COP ROTE 31 5.1 

AXOl CHICK (P28685) AXONIN-1 PRECURSOR 31 6.7 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 711 to 1 of CT517 : this 
corresponds to nucleotides 179367 to 180077 of the genomic reference sequence. 

Hit to public sequence : 

gi 1 6856160igb| AF173992 .1 : CT nucleotides l to €15 match nucleotides so to 664 of the 
public sequence with a 100% homology, a score of 1219 and an Evalue of o.o 

TaqMan Primer/Probe Sets: 

5'start=227 

S , stop=24 7 

3' start=2 94 

3'stop=311 

5 ' prime r=TGGCCATGAAGAATGGTCTCT (residues 227 to 247 of SEQ ID NO: 138) 
Tm5«58.19 

3 1 pr imer-GTCCTTGGGGCTGTGACG (residues 294 to 311 of SEQ ID NO: 138) 
Tm3=58 . 60 

probe 1 =TGCTGATCTTGTCATCTC ( residues 273 to 290 of SEQ ID NO: 138) 

probe Is tart =27 3 

probelStop=290 

direc t i on 1= Forward • 

Tml=69. 00 

scorel=l . 99 

length=85 
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CT518 

Nucleotide 
Genomic coordinates: 
Start: 190743 

Stop: 188172 (SEQ ID NO: 140) 
Amino Acid 

MTESKDYVIALVAETKTDEKRLNYVSEGLVAAISNLQNTPEKQRKVVISSDVFGPTWFNK 
TTEFFNSGLRLAKGHLSKDAVMRSVYRDIEGVREHIIDPSWRLTETAAEELCDFTFLKQA 
PLLNLLNAFENIMDGVFRSAANLVLYSTRGDTNEPSWVIDSEMLANRNNSTVADLAMGRA 
KRAIALFLGYTLCDILRWKQSIASRMKERGLDPFAAMPPHLEYGRAADMIEKRIKDFIEG 
SFSDGVTVSEEDGQSYWPTISTVLTNMVSVIQEGFYPPKVGSFHEALLGREIMVLLSAA 
IDAEYRAVLSRTRNAKPNPLTTKLDKYVNNPHLQMPSESVTEREKEWVERERERIKTTDM 
TAENLFRDHPYLPKAIDGILGPKRTPTALQALQREYKRCNKFNDIVSPETLEYFLVNNRQ 
VMFSNYSVTRVLDPDSAARFSMYVLWNALFLCSGGLTQKTNSSAVKSRLILQVFLKDMHS 
LFVCQRCESGFITKSLDTFTISLKEQSKPSMGEQELETYWKAVLDALGGGGGNNKGAENV 
NGLGELMVEILSADSGLLRGGGLGGDIGFEGKMKQKREDEEVRNMHLVDKKGYVFEAAKY 
VHVSKGFAALSFYLLYAAAATSNPSITNNFDRAVYLLLARWGDLKFPTHNLWGNVPTDEN 
TSSLLSFASFWALRNAVRARRNVIDASNTSFVPGRPLPLLSAFSSKMLVDNMLKNNYVKV 
ENVNREKLIWKAFREMQTESEIWKTSGSKAASDRNVKKAKQDLIRNASIGRLIVEPVGKT 
PISSIALFRSMKRSRSEDLKMGSNNKYRLARDTKTATPRNPLSYTGKIVFSLDDLKNFSK 
DSYTTMKVFPLTPLDG 
(SEQ ID NO: 141) 

Top Blast Hits 

Sequences producing significant alignments: 

CAA22155 (CAA22155) HYPOTHETICAL 46.4 KD PROTEIN 
095517 (095517) DJ1170K4.1 (NOVEL PROTEIN SIMILAR TO KIAA01 
Q43688 (Q43688) GLYCIN— RICH PROTEIN (FRAGMENT) 

YHO0_YEAST (P38800) HYPOTHETICAL 149.7 KD PROTEIN IN IRE1-K 
P97572 (P97572) CALPAIN SMALL SUBUNIT (EC 3.4.22.17) (FRAGM 
048591 (048591) GTL2 GENE 

Comments : 

EST confirmation of the predicted transcript: 
Nucleotides 515 to 1 of CT518: this corresponds to nucleotides 18814 6 to 188660 
of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=1308 

5'stop=1327 

3'start=1383 

3*stop=1404 

5 * primer =AGCTGCCCGATTTTCAATGT (residues 1308 to 1327 of SEQ ID NO: 140) 
TmS=58.26 

3 •primer— CCTGCTTTTCACTGCAGAGCTA (residues 1383 to 1404 of SEQ ID NO: 140) 
Tm3=58.54 

probe 1=TGTGCTATGGAATGC ATT (residues 1329 to 1346 of SEQ ID NO: 140) 

probelstart=1329 

probelStop s =134 6 

direct ionl=Forward 

Tml=68.99 

scorel=1.88 

length=97 



Score E 
(bits) Value 

37 0.4 9 

35 1.9 

34 4 .3 

34 5.6 

34 5.6 

33 7.3 



FIGURE 3, Sheet 76 of 160 



WO 01/38351 



118/201 



PCT/USOO/28888 



CT1003 
Nucleotide 

Genomic coordinates: 
Start: 23709 

Stop: 24300 (SEQ ID NO: 142) 



Amino Acid 

MDVSSYKSTIDYHNIEDMDDLQRATYKDRMETELVLEMAKKEGRYVRSLATMDELEVPEE 
PATCYTCGYTFIRRRAPPPKRKSIFREPCAYPELLPDAPSPVRLEELVDVPEGASFFTYP 
PYDDGSSTSSSQAECEDDYPPPYDPSENPQRSQVCDYCTTRQVLSSMTDHARANLIKNLK 
REKKALGLGRRNNFSY 
(SEQ ID NO: 143) 

Top Blast Hits 

Sequences producing significant alignments: 

VP40_EBV (P03234) CAPSID PROTEIN P40 (VIRION STRUCTURAL PRO 
Q9Y4G2 (Q9Y4G2) KIAA0356 PROTEIN 

088508 (O88508) DNA CYTOSINE-5 METHYLTRANS FERASE 3A 
Q9Y6K1 (Q9Y6K1) DNA CYTOSINE METHYLTRANS FERASE 3 ALPHA 
Q23804 (Q23804) SPID PRECURSOR ( FRAGMENT) 
088799 (088799) ZONADHESIN 



Score E 
(bits) Value 



35 
34 
33 
33 
33 
32 



0.46 

1.0 

1.8 

1.8 

1.8 

3.9 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 736 of CT1003: this 
corresponds to nucleotides 23688 to 24423 of the genomic reference sequence. 
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CT1004 

Nucleotide 

Genomic coordinates: 

Start: 26630 

Stop: 27257 (SEQ ID NO: 144) 



Amino Acid 

MAPNSFQKFAPVIKTEKKEEERDEHDDPLRQIDFRDRKTLICLTANCVSRKRKAGSAHDR 
VYKVLRYGNPYKYRRPNRTHRGLALSMDQGEVGTCLPLRPMEETEENPIDKCGVAFLYSN 
YNEGDGMTHL YN DEE YI KKCKT I EGGTRTW VKKNRQE Y FRQALETLMMS HS I KQYSNFI F 
FKEDMEEGFVHKLHTFINMVHPKKVSVL 
(SEQ ID NO: 145) 



Top Blast Hits 

Sequences producing significant alignments; 
Q9XX10 (Q9XX10) Y39A1A.22 PROTEIN 

P70392 (P70392) RAS PROTEIN -SPECIFIC GUANINE NUCLEOTIDE-REL 
075381 (075381) PEROXISOMAL MEMBRANE ANCHOR PROTEIN HSPEX14 
Q40112 (Q40112) HYPOTHETICAL 28,4 KD PROTEIN 

ECEl_CAVPO (P97739) ENDOTHELIN -CONVERTING ENZYME 1 {EC 3.4. 
O70651 (070651) GAG POLYPROTEIN 



Score E 
(bits) Value 



34 
32 
32 
32 
31 
31 



1.1 
2.5 
4.4 
4.4 

5.7 
5.7 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 654 of CT1004 : this 
corresponds to nucleotides 26601 to 27254 of the genomic reference sequence. 

TaqMan Primer/Probe Sets : 

5*start=248 

5 # stop=266 

3'start=291 

3*stop=309 

5'primer=TGGCCCTCTCAATGGATCA (residues 248 to 266 of SEQ ID NO:144) 
Tm5-58.53 

3 * pr imer=CTCTTCCATGGGTCGGAGA (residues 291 to 309 of SEQ ID NO: 144) 
Tm3-58.32 

probe l-AAGTAGGAACATGCCTCC (residues 272 to 289 of SEQ ID NO: 144) 

probelstart=272 

probelStop=289 

direct ionl=Forward 

Tml-68.97 

scorel=l . 97 

length=62 
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CT1005 
Nucleotide 
Genomic coordinates: 
Start: 31091 

Stop: 31961 (SEQ ID NO: 146) 
Amino Acid 

MEGEHQYLNLVREILERGVKKDDRTGTGTLSIFGPQMRFSLRDDTIPVLTTECKIFWRGVV 
EELLWFIRGNTDAKELAKKKIHIWNANGSREFLDSRGLYDRAEGDLGPVYGFQWRHFGAE 
YDTCSSDYTGKGIDQLANILKTLRENPDDRRMIMTAWNPMDLHLMALPPCHMTAQFYVAN 
GELSCQLYQRSGDVGLGVPFNIASYSLLTHLMASMVGLKPGEFILTLGDAHIYNTHIEVL 
KKQLCRVPRPFPKLRILMAPEKIEDFTIDMFYLEGYQPHSGNLQMKMAV 
(SEQ ID NO: 147) 

Top Blast Hits 

Sequences producing significant alignments: 

TYSY_HUMAN (P04818) THYMIDYIATE SYNTHASE (EC 2.1.1.45) (TS) 
TYSY_MOUSE (P07607) THYMIDYIATE SYNTHASE (EC 2.1.1.45) (TS) 
TYSY_RAT (P4 5352) THYMIDYIATE SYNTHASE (EC 2.1.1.4 5) (TS) 
Q89940 (Q89940) THYMIDYIATE SYNTHASE 
P90463 (P90463) ORF 70 

DRTSTRYBB (Q27783) B I FUNCT I ONAL DIHYDROFOLATE REDUCTASE-TH 
Comments : 

TaqMan Primer/Probe Sets: 
S'start=424 
5*stop=446 
3 , start-479 
3 f stop=501 

5 * prime r-ACCCTGAGAGAAAATCCAGATGA (residues 424 to 446 of SEQ ID NO- 146) 
Tm5=57.82 

3 1 p rimer =AAGAGCCATAAGGTGAAGATCCA (residues 479 to 501 of SEQ ID NO: 146) 
Tm3=57.91 

probe 1=ATGACGGCATGGAATCCT (residues 460 to 477 of SEQ ID NO: 146) 

probelstart-460 

probe lStop=4 77 

direct ionl=Reverse 

Tml=68 . 98 

scorel=l . 98 

length=78 



Score E 
(bits) Value 



410 


e- 


•114 


4 07 


e- 


113 


406 


e- 


112 


390 


e- 


108 


387 


e- 


107 


381 


e- 


105 
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CT1006 

Nucleotide 

Genomic coordinates: 

Start: 32124 

Stop: 32802 (SEQ ID NO: 148) 
Amino Acid 

MAFNFEDSTNLFANMDLTAGTTTDPTRPNIIFFESLLPNSGIEVMKRRLVRQGKCGNFEA 
SGGAMSYFWLEDNAEDMENLNSGSHVKTNCLALFLQEFISNWIEETDRHGQYCTFPQYMD 
GGDGSRGGYFTSLAMKWMARDVTFFVFVDRNNWENAASIWMYQKLIAIGAKWKVIVDN 
ASNPMFSVCNACRCKYPGPVSYVIEGHGVGHSDLTCDEISGFFV 
(SEQ ID NO: 149) 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 624 of CT1006: this 
corresponds to nucleotides 32182 to 32805 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5' starts 68 

5'stop=289 

3 , start=340 

S'stop^Oei 

5 , primer=TGCTTGGCATTATTCCTTCAAG (residues 268 to 289 of SEQ ID NO: 148) 
Tm5=58.01 

3 "primer=CGTCCATGTATTGGGGAAAAGT (residues 340 to 361 of SEQ ID NO: 148) 
Tm3=58 . 72 

probe 1 — CG ACATGGACAGT ACTGT (residues 322 to 339 of SEQ ID NO: 148) 

probelstart=322 

probelStop-339 

direct ionl=Forward 

Tml=69.04 

scorel=l . 95 

length=94 
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CT1007 

Nucleotide 

Genomic coordinates : 

Start: 32947 

Stop: 34216(SEQ ID NO: 150) 



Amino Acid 

MDSNTSILPPSKRPGLNLLQVLGIIITVALlASVSSFirYRVGKRKYYPSSSSSSELSDV 
DNGVEGGGGTTTTPTQPSPDGGDGYVDLSPQKKAELRTRVANVIFQEVSKDQGVAFRRAM 
NDSTDKIMEETEARINNFSEPFREATVEREVFKDDTDKNFILSTLDLTEEQFKDIVMAEV 
KNQLENFDYEDMTRLIFDNIPETDYLWTTHFDPKKYDTYSEKVLGFSDINSIERISSTFY 
KGKKYEVTTGNVAVLVDFESETIKEKAGNSLIRNVEFIWDEQTYKSFFPAFNQVFFSFK 
VNKEKREVTVSINNGCVGIVANITPLTTPVGAASGHYIYGTSTAKEKTYLFVIDKYDTTE 
FVCGLSNKS T PLMALN I LFM S DTVFPS FDEAE RPLT DAKAVE I LGKRLG VGRYTNAN I RN 
TQ 

(SEQ ID NO: 151) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q56711 (Q56711) HOOK-ASSOCIATED PROTEIN TYPE 3 39 0.045 

077363 (077363) MAL3P4.5B PROTEIN 39 0.059 

YLM5_CAEEL (P34379) HYPOTHETICAL 49.8 KD PROTEIN D2007.5 IN 37 0.23 

GBA2_CAEEL (P22 454) GUANINE NUCLEOTI DE-BINDING PROTEIN ALPH 36 0.51 

Q9YW03 (Q9YW03) ORF MSV089 PUTATIVE NTPASE , RABBIT FIBROMA 36 0.51 

P87199 (P87199) KINESIN MOTOR PROTEIN 36 0.51 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 907 of CT1007: this 
corresponds to nucleotides 33238 to 34144 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=64 8 

5'stop=674 

3'start*773 

3 , stop=*799 

5 , primer=TGACACGTACTCTGAAAAGGTATTAGG (residues 648 to 674 of SEQ ID NO: 150) 
Tm5=58.05 

3 * pr imer=CCTTCTCTTTTATTGTTTCAGATTCAA (residues 773 to 799 of SEQ ID NO: 150) 
Tm3=57 .41 

probe 1-ATGTAGCTGTCCTCGTTG (residues 752 to 769 of SEQ ID NO: 150) 

probelstart=752 

probelStop=769 

directionl=Forward 

Tml-68.89 

scorel^l .89 

length=152 
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CT1008 
Nucleotide 

Genomic coordinates: 
Start: 34217 

Stop: 35048 <SEQ ID NO: 152) 



Amino Acid 

MEGVI LDKIET I AKRAS PSYGS I DVGTA ILRRQFMEKIRGKINEETTMEKIMGTKEERED 
T I RS I V ANVI KENTVKENVTEKI RAMT DKELN DNRE FM H DFGKI STGDGGTFHLFEDTPG 
FESALKAEYKNVPGATTPKYVSMNSLRIDAINGKIEEVYNPSPIMGIREYGTIRRGRYEE 
NAGSKELVFMTKIEKRPNNVAENLIIRVANQQYNVMRMVFFIDYETKKGVSKEEMFIPYN 
VQKTKALKGRSTYFSFVRKI PDEPEGSI I IHALGFY 
(SEQ ID NO: 153) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

BAG_STRAG (P27951) IGA FC RECEPTOR PRECURSOR (BETA ANTIGEN) 41 0.00*7 

Q99051 (Q99051) IMMUNOGLOBULIN ALPHA FC RECEPTOR PRECURSOR 41 0.007 

YIBAJSCOLI (P24172) HYPOTHETICAL 31.8 KD PROTEIN IN RHSA-MT 38 0.079 

Q46749 (Q46749) ORF-A1 38 0.079 

Q25920 (Q25920) MATURE-PARASITE- INFECTED ERYTHROCYTE SURFAC 35 0.53 

Q06166 (Q06166) MATURE PARASITE- INFECTED ERYTHROCYTE SURFAC 35 0.53 



Comments : 



TaqMan Primer/Probe Sets: 

5*start=361 

5*stop=384 

3'start=429 

3 , stop=451 

5'priioer-TTTGAAAGTGCTTTAAAGGCAGAA (residues 361 to 384 of SEQ ID NO: 152) 
Tra5=58.35 

3'primer=TCGCATCGATACGTAAACTGTTC (residues 429 to 451 of SEQ ID NO: 152) 
Tm3=57_ 96 

probe 1=CCAGGAGCAACTACTCCA (residues 397 to 414 of SEQ ID NO: 152) 

probe Is tar t=3 97 

probe lStop=4 14 

direct ionl=Reverse 

Tml=69. OS 

scorel=l . 94 

length-91 
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CT1080 

Nucleotide 

Genomic coordinates : 

Start: 291719 

Stop: 292205 (SEQ ID NO: 154) 



Amino Acid 

MTSPAPSPSSTPKSSCTTIVNRCGFLLDNNKEWIYDTNSKFKCEPKNLELIGVLSGVSD 
NWTQ I S PDQI FVGT YM VKYNWSKSGHERFS DMSNNCLDN I TRPSEV I ES V IKKTS SDFK 
MKYTRSLMDHTEKYYFSGDQKLSKISSWCTTPIRQWVCNSV 
(SEQ ID NO: 155) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

CAB45519 (CAB45S19) GIBBERELLIN 20-OXIDASE-ARABIDOPSIS THAI* 32 3.0 

Q52743 (Q52743) CELB 32 3.0 

Q39110 (Q39110) GIBBERELLIN 20-OXIDASE 32 3.0 

060963 (060963) L549.2 31 3.9 

FLPA_ARCFU (028192) FIBRILLARIN-LIKE PRE-RRNA PROCESSING PR 31 5 . 1 

DPOE_YEAST (P21951) DNA POLYMERASE EPSILON, CATALYTIC SUBUN 31 6.7 



Comments : 



TaqMan Primer/Probe Sets: 

5 , start=201 

5'stop=220 

3'start=290 

3'stop=312 

5 1 prime r^CCCCGACCAGATATTTGTGG (residues 201 to 220 of SEQ ID NO: 154) 
Tm5=58.70 

3 1 prime r=AGGGCGTGTAATATTGTCCAGAC (residues 290 to 312 of SEQ ID NO: 154) 
Tm3=58. 09 

probe 1=CGCTTCAGTGACATGAGT (residues 265 to 282 of SEQ ID NO: 154) 

probelstart=265 

probe lStop=2 82 

direct ionl=Reverse 

Tml=68.90 

scorel=l .90 

length*=112 
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CT1009 

Nucleotide 

Genomic coordinates: 

Start: 35073 

Stop: 35967 (SEQ ID NO: 156) 
Amino Acid 

MALQEKDITIGNVSAALRELMYSPTHMQHHDKLNTFLDRNVESSSEEKIRQIVDKIRSQT 
TSDI SETVNNVTTNGTAFSLFEDTLEGMVKKN I GDNLQSGDFI DGRKKLNDMKSLATGAI 
LSRQRDFVAESITGTKDWLKAIMGCGIIRYTVFVNNLARSTLDNDDDKAATYYNTPIYGG 
YCKM A I KDYE I PDS Y SKVE AEHTVEGRKMTFN I KWRGDT INN LI T 1 1 PS VTG YLAS I SED 
ADVQAPLLLNCNNCFIEADMSSLYMDEKKTEASFTLNLPEIEGADANAVYEICIVW 
(SEQ ID NO: 157} 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

YB75_YEAST (P38321) HYPOTHETICAL 101.2 KD PROTEIN IN FAT2-P 34 1.3 

Q08281 (Q08281) CHROMOSOME XV READING FRAME ORF YOL138C 33 2.9 

092271 (Q92271) 12-8 KBP FRAGMENT OF THE LEFT ARM OF CHROMO 33 2.9 

Q26032 (Q26032) VARIANT-SPECIFIC SURFACE PROTEIN 32 3.8 

O02179 (002179) CALYX PROTEIN 31 8.6 

O01394 (001394) POLYHEDRAL CALYX PROTEIN 31 8.6 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 780 of CT1009: this 
corresponds to nucleotides 35166 to 35945 of the genomic reference sequence. 
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CT1081 

Nucleotide 

Genomic coordinates: 

Start: 292189 

Stop: 298777(SEQ ID NO: 158) 



Amino Acid 

MQ1J<I£NE^KEEHETWVHNPS<^GFNIFNSSPVYFEVHNEMDALIFMAAFLKHNSLWG 
EINANMDLYTFDYAGAFLDERWCHHEKSFSVVRAQLINSYYKCRRKIMQALDNNYNNKNK 
KRKNVGGAPAFTFMSGDGEGGKEALEASFDVIGGTRGGRFGVDSTPCPHSSAMQLKLDNE 
GNYGCIACFASMFFVLENPGDESSFISTDASKIGQAQAWIDERLRNNENGGEENNVFKKT 
FHMLADITQKAHETAYSNTIPLGPNGRQWNWPTHTVEPIAHEFVTHSLVNTLKNLGDRKL 
PRFNFDILYNLLNPFGKMLLVFIQNCHILTGHKNNENWPRGSASGKWWTINFVGVNMWT 
FQVTKCKVEKDRKISDLACMETLPRLPNPGSTTVDDRIVFKGFCRGENLGSVGEWSDIT 
QSVKNFCLMVENRKFSVDKETGFISSESIVSDPFFSLEVTGCRSNRAQDTINNGRVSARV 
MRILKSREGARVWLAKDENAIIFENVNHDTAISTDAMERAIGQHKILYYDIETTDKDFTD 
KKSVITSIGFCLCTGGDMTHGGERGVFGLVAPGSDVEKVKETIINSYDPEEKEDIMKQCP 
QVIE I FTNE FEMLLG FGKY I DKVKPH V I SGWNN VAFDDPFVFTRIVKHLSDHTKDMS YCV 
ADASTAESVLPRATEGGGGGETPYRLSTPQERIQLASTGIFNKLGKFVDKKTGMLKPEMT 
ADLLAGAESQANTKFKERNKLSSSNKGSAGWFQKIIGGMCSAIRLDLMKVCEKAYKESLS 
EFNI^AVLAKVSSVGDKVKNVKDEIVDLHFHLLGFLKLKKAQDQAKVHVYCCKDAYLTGIV 
STSINKEGEIFRI/^VIDSALTEAVVTANLATPLCIGEGAICRNMGEERADRRGVGVFaiHSI 
ATDTKGGMVSQPIVNHVPYQTIDMTSLYPMTMCQNNLCTTTFVTHRQIMQLRDRLVLEKM 
KNKTTDSLLLLDVIDECNQIVLSEYRPIDIAVASWKNSNSNRQTPITRIEESLGLRFIEN 
LDAEKTNNKTWCTNTSPNMNVTAAGM DYFPE I VCDINMQFAAKVNDDMHIAPASLEYMIiQ 
VLPIMLIDRPYIGAHITAGKCRTLEDILSELEKDFSVEKDEEIIRTHWTFKGQKQYDFCH 
SPVTQMARHIIESTGRNIRDYEGNEKFERLVSLSDRIYRRVGAFDSANDPAVRLWSSRLI 
N VGML VRTWN VKT D I LKG 1 1 PQMQAT Y RADR WMQNKAKE F AKMGDMKRAGLN KVGQN IM 
KLCT4NSMYGHLALRARSSRKEFASGSANTASS I SNMSATGGIGGGTRHSVTANQITENAR 
CVFGNIGCGLQMALPGTKQTYGDTDSVFCVHNIVGDGGMIPEYDEQTGKYYYVMDIALKN 
KMAA 1 1 PI LVN S LT KG I Q FV E R R DAG VGMMN I AH ERLA V AGLL F AKKT Y HMLH FNEN S AA 
FNDMIKLKSTDNNNKFASFIKRPSHADGYWPHNPSLILRAAEGPAGKKLKSFLEEEGIH 
DEKSMEEWFTSSPTWMAMDASVINNLYASQIVGVEKGNWIDAMTSRPIEAGTEMMEAVTQ 
AKAAFTPYKKGAFVKKGITPTTKLKGLQSLIARFLPKIEEKKSCYLDVMKNHVENFASHI 
TNPAMMITSSRVNKFDTSKEQSRPNPLALAINNHIJSIPSSEISLGQKFKTVTSVSSWSLSA 
EEGEVPAGYFNAGSVRWDATNMKGSVPAFSVKNLSWPNAITSVYKMVESDKTAIKSMIA 
KNVEVLCSTSANTGFSLRRGALSFNTGVIVTKDVAMACIRSLNNKQMLLFVGGGKDYGED 
DDDDDEEAEEEDEENGENEENKGDCVTEKKIPGRSTNKDVGEETKTSEKTEGERKGSKTA 
KGKTEEIASSLSKCGKKDARDVILDRLLKATHSSCTNNEERTRVMQYSNCTLSSYITSV 
MKLDQRVADQMENLISQLDQIRNLSNKKRQEKGGPFKSELDAMVAAVKVKFFPVLDASRK 
LTQDHWKKCP VS I PETREEKPLMGV P FEVALN SLIGKHKCTDTCDMACCQSLY FVLLYTL 
ALKFENERLARQIGLDDSVDLMAEMLFGGDKLLAQEVLKRVKDAQDRKLVKSLLPLNYNH 
DTNTIIFLFESLRFAQKPVAGMSVSEIKDAVRGLAFSTTTGTVWNYTDERFFGPLYNMDE 
LCNERVNGNCKLSFITGIYHTAAVELAAACLSCVL 
(SEQ ID NO: 159) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) value 

DPOD_YEAST (P15436) DNA POLYMERASE DELTA LARGE CHAIN (EC 2. 52 5e-05 

DP0D_SOYBN (048901) DNA POLYMERASE DELTA CATALYTIC CHAIN (E 51 8e-05 

DP0D__SCHPO (P30316) DNA POLYMERASE DELTA LARGE CHAIN (EC 2. 50 2e-04 

CAB58156 (CAB58156) DNA POLYMERASE DELTA LARGE CHAIN 50 2e-04 

0BF1_RAT (P25977) NUCLEOLAR TRANSCRIPTION FACTOR 1 (UPSTREA 50 2e-04 

UBF1 HUMAN (P17480) NUCLEOLAR TRANSCRIPTION FACTOR 1 (UPSTR 49 4e-04 
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Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 4 to 1205 of CT1081: this 
corresponds to nucleotides 297 608 to 2 98809 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=2738 

5 , stop=2760 

3*start=2817 

3'stop~2838 

5 1 prime r^TCGTCAATCATGTTCCCTATCAA (residues 2738 to 2760 of SEQ ID NO: 158) 
Tm5=58.05 

3 ' prime r=TCGATGGGTCACAAAGGTAGTG (residues 2817 to 2838 of SEQ ID NO:158> 
Tm3=58.56 

probe 1-GATGACCATGTGTCAGAA {residues 2787 to 2B04 of SEQ ID NO: 158) 

probe 1 s ta r t=27 8 7 

probelStop=2804 

direct ionl=Reverse 

Tml=69.00 

scorel=1.99 

length=101 
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CT1082 

Nucleotide 

Genomic coordinates: 

Start: 300935 

Stop: 305108 (SEQ ID NO: 160) 



Amino Acid 

MTEQGDQGIKVRKLHGPRGERGETGPAGAVGPAGPQGERGAIGPAGKDGAVGPAGPQGER 

GAIGPAGKDGAVGPQGPPGERGENGRPGRDGAVGPQGERGAIGPAGKDGAVGPQGERGAI 

GPAGKDGAVGPAGPQGERGENGRPGRDGAVGPAGPPGERGAIGPAGRDGAVGPAGPPGER 

GATGIPGRDGVDGSVGPQGERGEIGRPGRDGAVGPAGPQGRRGATGRAGKDGAVGPAGPQ 

GEKGEAGKDGSIGPQGIQGPRGETGPPGRDGTAAERGERGFPGPPGETGPPGKDGVDGSE 

GPQGKRGETGPVGPRGEPGLAGLPGRDGAIGPAGPPGERGATGLPGRNGVDGSIGPQGRR 

GATGRAGKDGAVGPAGPPGERGATGIPGRDGVDGSVGPPGERGETGPAGRDGSVGPAGPH 

GERGENGRPGRDGATGPIGPAGPQGEKGENGRPGRDGATGPIGPRGETGAMGKNGVDGSM 

GPQGRRGATGRAGKDGAVGPAGPPGERGETGPAGRDGSVGPAGPQGETGLTGSPGRDGAT 

GPIGPAGPQGEKGENGRPGRDGATGPIGPAGPQGEKGENGRPGRDGATGPIGPAGPQGET 

GLTGRPGRDGATGPIGPRGETGAMGKNGVDGSTGPQGRRGATGRAGKDGAVGPAGPPGER 

GENGRPGRDGATGPIGPAGPQGETGLAGLPGRDGAIGPQGEKGENGRPGKDGATGPMGPP 

GERGETGPIGPAGPQGATGLPGRDGVDGSVGPQGKRGLIGRTGRDGAIGPVGPAGPKGET 

GLAGLPGIDGKDGSVGPQGAIGPIGPRGERGETGRPGRDGEDGSTGPMGPQGLRGATGAP 

GPQGERGLKGRPGKDGETGPPGRQGRDGIMGPRGLRGEKGAPGNDGLEGPEGRDGAPGPA 

GPIGPQGIRGLKGIQGRPGRDGEMGPAGKDGIEGPRGQDGTTGAKGPRGLRGFQGRTGET 

GAQGSRGEKGDRGLTGPQGRDGPPGEEGPQGLRGERGAPGPRGPRGIRGRSGPQGSNGVQ 

GPRGPRGTKGRTGIQGLTGIEGPRGPRGIQGKEGRMGKIGHRGEKGDKGDRGEQGIAGAD 

GEKGPRGLRGIRGPIGAPGKPGTEGVRGPRGVRGVPGYPGAQGELGPQGPTGPQGPAGPQ 

GPMGRTGDTGPMGPPGAVGPRGEKGGRGRKGKNGPKGADGKDAVNIIQKYSITHARAEIM 

WEGNEIGEAYIGRSYGTDTIPVMIENRIGMTNEDKKNEYCIQVMTMHSITTRGRTSGVFV 

VSNKTDYILLVTLLMPESVSCRTDVSTNARSERVNAVRERESKSYRFIRPSDQSIGTHSR 

SKIAVVMYPDASMSYSVDTLDADVARR£TTSVLLLAETIHGEKDRGFYADRGTVGRI*T4VP 

PTEEELLVLQX 

(SEQ ID NO: 161) 

Top Blast Hits 

Score & 

Sequences producing significant alignments: (bits) Value 

Q14054 (Q14054) COLLAGEN TYPE VII PRECURSOR 940 0.0 

CA11_CHICK (P02457) COLLAGEN ALPHA 1(1) CHAIN PRECURSOR 935 0.0 

CA17_HUMAN (Q02388) COLLAGEN ALPHA l(VII) CHAIN PRECURSOR ( 933 0.0 

Q63S70 (Q63870) TYPE VII COLLAGEN 928 0.0 

060444 (Q60444) TYPE VII COLLAGEN (FRAGMENT) 925 0.0 

CA11HUMAN (P02452) COLLAGEN ALPHA 1(1) CHAIN PRECURSOR 923 0.0 

Comments: 

EST confirmation of the predicted transcript : 

An isolated EST has equence identity to nucleotides 1 to 557 of CT1082: this 
corresponds to nucleotides 304552 to 305108 of the genomic reference sequence. 
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CT1083 

Nucleotide 

Genomic coordinates: 

Start: 50094 

Stop: 50295 (SEQ ID NO: 162) 



Amino Acid 

MAVTEIPCGTRNIAEEDVELELILVTAEAEVREAMAAALAAAIIGAVWQIGRVLDEWA 
AEVELM 

(SEQ ID NO: 163) 
Top Blast Hits 

Sequences producing significant alignments: 

LIPB_MYCTU (Q10404) PROBABLE LI POATE- PROTEIN LIGASE B (EC 6 
Q64033 (Q64033) ANTIGEN LEC-A 

OL56STRAT (Q07017) OLEANDOMYCIN POLYKETIDE SYNTHASE, MODUL 
RPSD_PSEFL (P5232 6) RNA POLYMERASE SIGMA FACTOR RPOD (SIC34A 
05454 0 (054 54 0) RNA POLYMERASE SIGMA FACTOR 
Q9YAS2 (Q9YAS2) 136AA LONG HYPOTHETICAL PROTEIN 



Score 
(bits) 

30 
29 
29 
23 
28 
28 



E 

Value 

1.8 
3.1 
5.2 
6.9 
6.9 
9.0 



Comments : 



TaqMan Primer /Probe Sets: 

5 f start=24 

5'stop=4 5 

3 f start=95 

3 , stop~112 

5 f prime r=TGGTACTCGGAACATTGCAGAA (residues 24 to 45 of SEQ ID NO: 162) 
Tm5-59.21 

3 ' prime r=CTGCTGCCATCGCCTCTC (residues 95 to 112 of SEQ ID NO: 162) 
Tm3=59.19 

probe l^TTGTAACAGCAGAAGCAG (residues 71 to 88 of SEQ ID NO: 162) 

probe Is tar t=71 

probe 1 Stop=88 

direct ionl-Reverse 

Tml=68. 98 

scorel=l .92 

length=89 
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CT600 

Nucleotide 

Genomic coordinates: 
Start: 1118 

Stop: 491 <SEQ ID NO; 164) 



Amino Acid 

MHMWGVYAAILAGLTLI LW IS I WTN I ELNKKLDKKDKDAYPVESEI INLTINGVARGN 
HFNFVNGTLQTRNYGKVYVAGQGTSDSELVKKKGDIILTSl^GTCDHTLNWKAESKELE 
LYARVYNNTKRDITVDSVSLSPGLNATGREFSANKFVLYFKPTVLKKNRINTLVFGATFD 
EDI DDTNRH YLLSMRFS PGN DLFKVGEK 
(SEQ ID NO: 165) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

051578 (051578) EXODEOXYR I BONUCLEASE V, BETA CHAIN (RECB) 39 0.033 

068195 (068195) DIOL DEHYDRATASE -RE ACTIVATING FACTOR LARGE 32 3.2 

BACCJ3ACLI (068008) BACITRACIN SYNTHETASE 3 (BA3) [INCLUDES 31 5.5 

TKTJMYCPN (P75611) TRANSKETOLASE (EC 2.2.1.1) (TK) 31 5.5 

Q9ZER8 (Q9ZER8) REPAC1 PROTEIN 31 7.2 

CAA10001 (CAA10001) REP AC PROTEIN (FRAGMENT) 31 7.2 



Comments: 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 569 of CT600: this 
corresponds to nucleotides 513 to 1081 of the genomic reference sequence. 
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CT520 

Nucleotide 

Genomic coordinates: 

Start: 209342 

Stop: 196799 (SEQ ID NO: 166) 
Amino Acid 

MFKANVLNLGGGKFLESDVRDHLIKCANQMKEEPTTLRICLSNKLPEYDNRRLPLLLLNE 
GEQILVTDNLTKNGNPLVKQMGHLAVQDRVGGDGSVKPNNLLYAGCNVVEyDTVNRGNDG 
KLIMYSQPATLKDVAKSKKKGMYKVKKVPEITGDQFLDKLNERSCQNENRRMDEEGPHVG 
TGKLLRELI IMMRLYEEETS S AEKLCVT PAFRE FLGCGRTATDVPVFKVAFI TNASLMGL 
KVIFYPTILEEERIAAVSDTENVVLLKSILKVQLELLSECMPRIVERVESMIKKTVACFK 
I DIGGS DNWNLPGHCKVSDTAFP YHHAQLVGEKKNI LS I SNENMVTSLGWKADRAEEWM 
CKTLESFEKKCLYLENLMGSMANTDDWRRKILFSELGPEMPYRNKSLIMDQDFCTIC3^CY 
KFLAEGGGLLLTKTNATLLKEKMACKGLDDSGDGDDEEEDNEEGGSGGKSGGGSGDENNI 
NKPPPAPKQIPPLAANVYNSIINDDDKLDQIVCFFKRKHGFFLSDIDNSPLI*AMEET«LPQ 
KAMS KKNCVER VKPETKNI I RNLTG VN TIKFDTIMFFAI LQ I V VR YENRNLKLPRDT DI L 
QQRLKNNTWDALSKGKFAEMWQFTHKESLKPPTIEELESIPPPPTQSEEEAAAAAAASTS 
STTPDMVSSLEEGATSTSSSDENQIASLENIKKLLSIITSTFATGADKNDTIFAWTWTL 
AERFCALYNITSHPEEYYQQIIREDFEFEGGFEKFRHMCDAINRELSIYVPKSVLEKQSV 
CRMGVAAYENSMERIKNKTNSKLCKIKYDESTMVYELNNDTFKTFDYDESDKSFGmYEC 
APMETFQRLFASVKSDKEAVLADKKSEKREKLYQQKQEYLRKCDNDDVSARQILNNVASN 
ESDEESDEESDDEENYGAAKGGATGDYYGGDDEDDCYGFLGEFGSSDDENVPSDNASSIN 
NVQDDVFRDVNFIKTFNFRSSLCHRQKYVSTVIVEEMEKNLCDVLTLDNSAAESGDILKE 
INRRSLRMRNWVVPFTMPVREIVKPNVNSEDGTANSNNNIPPFCSCASLNNFKSDSPLSS 
NNTMSNEKCIKLLPIPSSKHLKDLTVALRFNTMACERRYFSDVTAALGFVKKDKVNGNIR 
SILDNKRWDAIKQCKLAGKCLSSALPLGIYENVISEDNKLINTFRPRSLARLACSSGGDG 
VSDKSVNNGFFSGIWALCANQDLESVVLGSTVVDPLKPTKVFNQSLSEKELKEKRQQMCL 
DAANYFKDHNVSKLNIYECFKMMEECIMRTALNGKTSNDSEFFSNLITRYGSGTNSPASR 
LWTILETVRECFNNSLPIDWGSLVKDWDGSDMLNLKAGVSNVDESGAVFELSEFLGVSAR 
AFFGKDLDTNLDADTWECLLNDDNKDWKAQVAKAYEFALKDNDIRSVENFINSSNLLTNN 
NVIKKLKIKPTPSNDVRHQIWVEDEYYPRNKSTIJiSRAEWMAATEEVLKTEMSLSCVLAM 
VAMYRIMMQGES VREI ATAPLRLSVDKMVPL I RCFKI TSKWCSCTGKGDSPKKADAS IKE 
GRFYDIEEDPLHFYRFAAYVIGQVASNDIVIEEMTRKILMSFDFNGFDTSNWLQFITYRF 
SHVI^GRRSRLLSRPLSLVKNLVSVSSLADKNSEKSNDMYEKRVGKVMKRIARLVLVKAA 
DSVRASSNDLLDCCILDVKDVSVKSLDEFRAKTRQELQETRIDTNYNLVSNSCTTAQLAA 
VEKS SRI INTN I S FHNI PAGQAKVMDANEE AFI DPS LEE I NKEDNSGAKQMTGKGGSNRG 
RSKKSGGGGFNNAGGFYNDDSSRGSSSWDEDSRSRTGFSQIHMDARNEEDRESGLFSYD 
GYVLNRIKNMITQNQINNDIVKVISDIENFFKICVPFSKECEYALYGVTETALSAGMDAIE 
RWNKAVEEETNKIRKECRDLTDTG5VYDMNI ICPGDYMSSVGEGGNGGCGGGSSSSGHLL 
SNNNNEANQTNEISEDQLKHEGSCJCSFWFNFYKKVVNSSEKKQGKGSVLANTGHEGRIVG 
RPI4RTFIQYKGKGFAETKVLTRYFSNHISDSYWSQVMPICYIKNMALGDEDKSKKKFGKR 
PWKNFNNNSNSSSNSSVKYVSIQDLEKKDSLKNVPMGYDEDLLSLYDDSLTTSTEKLENI 
KIVNDSKDAYVILGSSNQSSFDQTFSQQYFTHQKISNINTYKSIjGKMWNCNNGMSPKNQI 
VLLKKLLFKNLNILWIKLYERHISVLCNWGCIHPNSSKNSHFEMTECNNAPCGVTDSNPPL 
SVYHSGFLSVEDYGQLLKDTFPLMNLHRTFSAKSKDNNSSDPSPEKISAASLAKAVYARE 
VLSSCLDPEGNFCTSWITNSCSVLFTPGTNIRRGGDFFNKSCYRQQDNDYCFIGKEETKK 
CPNFVSSEIEIVSILKTAVFLSTNSDGHKRVLRVINYNKDHSGLYAGIDTGCADDEDDDD 
DQGGTDKTCLLQEDSMDAKRMLI SMRS V I NGKSLDE SS LA I KKDNFN FLAGT DKGF YLDN 
SFFNSPVQGKFVAPRGTKIFKKCCDFLLNKGTGGVFARIFFTDWACIVSSSKGKNNKKAI 
ESTLQIRNGGCFSERLTPSMFDNESEQGELFHDRYCPDFLSDYNKQNIFSEQAYKCSFLA 
NPVCPAKNMLKRAKNIRLCITNAGTALISKIMAEVEKMGNARTFISNGTAIPFRLAENTA 
CISVDNNRYFLIDGTYLLGGRLEGINLVTDMYTRCKLKAEKHVILNSLFSTEFISAALAS 
SMEGTTMGRGLCLIEHVSYMKNTDSVSNMNKNFWSMAEDQEETDENEDDDDENEEDEDEN 
EENTENTSVVKYEPVSKTAFSSSLKPPSIFIADEDYIFLSILYELAKATSDCETASSSSS 
SSSSSSSSSKHSSSSSSSNKKRKQKDDVNSTTTALHALRKCYISCVDQKTGMPRMDWYL 
LRGLMNFGGMCTAIASGDGEKAHHMVQTLCSVALNIATKTAVVFVGTKGNNLKTTLVDLC 
KRTWFERFTNINVTALNNAGDSSSSTQANLASFAGKKGIVIIDEVGHQGSFGSKKSSSED 
DKDESA5RSGNVDFGGSGGEMNSVDINEARNAYGDGGNSKIVFSNINRLMTESKLKVCDQ 
EYDFISELKHEKNRKNACNDTKKRKRGGEIEDEGVECEEIERNDGKNDENGVRIKDPINI 
S FFARKAHWWNCSSG VVSTTFKEKN I V YNMLHRGAM P FS IKDCT DSPWLNETDAVYRHCK 
KPIEYEGKFSKSEVKTALKCILGKFGSKICDNESFESIIDENCQVNNLHSWNDCKEDIDE 
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WNEKraSKNKKNKQNMKIEDKVDAIMNIIQKNNGLLKWNTSFDRDGSPVLVCNPATERFS 

EMITSSI^AQDMLEIKKYLGDNCLSTNGGVKKSVIDGNTSAPGVLIAYHCVYTGKISDDL 

SKTN Y PVLLPPPKKQHFVAVDDAAEKALLGPTLSNINI DSIRNIKTI SRKLSS I IKDPEA 

AKLLVDRDLDFMNMYERYDASLFDVVKKPSKYSFPGFTSDGSVVLSTSTSDCENVLSCLK 

KRIEKDKMSAKNSGSFIRMCMDKNLLSDEKDDSSSNSSKNTSSLPKTDDNSSDIANFLSV 

FGENRQQSSQFSFASNSSGGGDSNKEACFNVDTPKRRQLVSALQKHNSDGSSSIITEIAK 

AIPQKNDVSSSITKHMLPGQFPSSLLKNMTSPQNSVMIRGIFQQGAKSSITVSPIMMSNS 

YIFSFFVDEAMSKRLIVFPCDTTEVFENECNEDVKKIIGia.DRGMKYIHSSLMMERCIKFG 

KHGIKQRQHEFNHHKKAWNDFSGHSSDNKKKDRISDVSSVLPSVLMKNLIRNKVLELRDV 

KSVSRLEENTNTFFHLYTSMSLCAKAATNYGESSSSSATITEVEEDNSCDAEEQQLRRKK 

PANYESMCNKLPSPLQMCQINPKSLNTMAMNIARSRQGAWAQLNSMLNSVLFVEMPFVKT 

TRFFGRDFNIKMHSPATKNRPAINFDNCIGMSLPNPDMDVVGYDKEGELIGVGSSLTKHL 

CDAWGSMDVRDLMYSCHHLHMLFEMA1QYTECKRRLSSLKTLKSDKTGVDYVAVMLACMV 

YQIJ4VSNLKYPVFLSSSSHKRANTEDIADENQVSSLSVPMFLAMVVNKPLHALRHSTNLA 

LPNASQKS DH S DIVKY I VMNQWGLRLN PDYLC PNCVKH VL 

(SEQ ID NO: 167) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

SR40_YEAST (P32583) SUPPRESSOR PROTEIN SRP40 54 2e-05 

092 1T1 (Q9Z1T1) AP-3 COMPLEX BETA3A SUBUNIT 54 2e-05 

AAD56 625 (AAD56625) NUCLEOL IN-RELATED PROTEIN NRP 52 7e-05 

P90493 (P90493) HERPES SIMPLEX VIRUS TYPE 2 (STRAIN HG52), 52 9e-05 

P70475 (P70475) NEURAL ZINC FINGER TRANSCRIPTION FACTOR 1 ( 50 3e-04 

095815 (095815) DENTIN PHOSPHORYN (FRAGMENT) 50 3e-04 



Comments ; 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1064 to 1 of CT520: this 
corresponds to nucleotides 196778 to 197841 of the genomic reference sequence. 

TaqMan Primer/ Probe Sets: 

5 , start=7375 

5'stop=7397 

3'start~7450 

3*stop=7472 

5'primer=GATGACGATCAAGGTGGTACAGA (residues 7375 to 7397 of SEQ ID NO: 166) 
Tm5=57.66 

3 ' pr imer— CCATTAATGACAGACCGCATAGA (residues 7450 to 7472 of SEQ ID NO: 166) 
Tm3-57.78 

probe 1=CAATGGATGCTAAGAGGA (residues 7424 to 7441 of SEQ ID NO: 166) 

probelstart=7424 

probe 1 S top=7 441 

directionl=Reverse 

Tml=68. 98 

scorel=l .98 

length=98 
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CT601 

Nucleotide 

Genomic coordinates: 

Start: 1511 

Stop: 1196{SEQ ID NO: 168) 



Amino Acid 

MYLSHIRQTPLVEERRALTFKMYHHNNNNQHSFVNCQCRRTSSSINCSSCSRETFNSVKA 
I QYFN KTSRNNT AHH FKMPAS KDRN YSS FE YAETAVAAHN I SQW 
(SEQ ID NO: 169) 

Top Blast Hits 

Sequences producing significant alignments: 

YT66_CAEEL (Q11082) PROBABLE G PROTEIN-COUPLED RECEPTOR B05 

000885 (000885) MAP KINASE KINASE PROTEIN DDMEKl 

CHD1_HUMAN (014 64 6) CHROMODOMAIN-HELI CASE- DNA-B IN DING PROTE 

097292 (097292) PFC0965W PROTEIN 

096226 (096226) SER/THR PROTEIN KINASE 

096563 (096563) C-13 ANTIGEN (FRAGMENT) 



Score E 
(bits) Value 



32 
30 
30 
30 
30 
30 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 353 to 3 of CT601; this 
corresponds to nucleotides 1184 to 1534 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=96 

5'stop=116 

3'start=142 

3'stop=162 

5 * pr imer^CTTTGTGAATTGCCAGTGCAG (residues 96 to 116 of SEQ ID NO: 168) 
Tm5=58.03 

3 ' pr inter =AGTTTCACGGGAACAGCTTGA (residues 142 to 162 of SEQ ID NO: 168) 
Tm3=58.43 

probe l^TCTTCCTCCATCAACTGT (residues 124 to 141 of SEQ ID NO: 168) 

probe lstart=l 24 

probelStop=141 

direct ion l=Reverse 

Tml=68.98 

scorel=l. 98 

length=67 
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CT521 

Nucleotide 

Genomic coordinates: 
Start: 272387 

Stop: 268691 (SEQ ID NO: 170) 
Amino Acid 

MSKSSSSTVKSASFFNSLMENAPSSKIELLEDGWTKKAAAADTDTPTAKPTGLSISLMDI 
SGSMGSVKSAVADSCSGIMATLNVIAPGIQNAIVYYNDFDKHSIESGPVVRAPDCSEWEG 
GDFVKHMRKTE VCGGGGGGS EALHS SLM Y VFNNM I PAFKKMHG I TRDEKFP I LIFVFTDE 
DVRIANSDTGKLCANSYDSETAPEEEFIMKTWGQKPLTILDMRKALVENDCWLRILNFSR 
CSGSNQSELCQEDVINFSGYDNNRWQLFESFDRRSCNVRKNIATFIMRQSISLFKNLNDQ 
FSAFPILREINQEELNVFIESEGRSEPAGFEKYGDAQRESFKSRVLNMAPLDFGRWQGG 
GRYNNHKRSVFLNCAYDSAFCCSKQTFNPQQQQQQQQSSSGGGGISKLAWTQRAQSITG 
GGNAASTLALHMNACFQSLDDFGIDHTNLCDCKGCTKLMASVEATSDQGRKTKLSRKYA^ 
VHWAKMFAEKLFKMMIKEQSMMYACSAVPDEIGAIYAFVTGNNAGVCSRVSTILSDLGTE 
CGNKAEYAFLKEGKHMKSASYDALQVINNTDLTPEQSSMFMWFYVPNDALEEAGKIFHQS 
FSFSNSYTGGGLLSLDEYKRFEFGQCFDFIKKLVSCLKITRNVEDVLLETSKTSNRYFAI 
PVFCGSDDQKEVLREELASDLFGGREDVAEMMFIDLETVIQKLGTLYDVRLSLPEGGYAA 
IKSVCAAASWAASCEVPSNTSNMILSIAKMAFTKYYQEQNSSSETDLDIILPSILEGTAD 
GEIENNLSGVVFLRCLITWANKIGVDKNFTNKLEHFLALRILTKAGDSKIGEKYETFPVR 
RLDL S EKDLKY I CKRCG V KS LKM E Y DN DEKLCLRCKGN YRMGKPMV Y HW DNKLTRDPRAK 
TASPTTLNLLNAKKI DDKVKEMASDI IGALNLPPTDKDNEIAVSAAAKAVGILYGKTCLL 
YKLLNEGNIDIPVAVCVECDCCKSKYMMSTLGPDKPQNRKCPWCRYANKLVAMGRGGKKL 
LMDLI ECGAPSLAMVEEAI RTSGDVMYEELGEGEEFY 1 1 DYFLKLKNTAI AEGNKLQQNN 
NKRPAPLQVTSPSSPPKKMRSDLPDSLLAAIGECAIETKEKTTVNLIGLGEVKVVENVGP 
NDLDGKDPFISLQEYCSWDKFNSLFVNPWLGYRLDEQWDDWNTFLIHVKKNDVWKFLCNK 
TSPFSVVVMNDGSGLLNVDNVNVLVRQKICV 
(SEQ ID NO: 171) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

AAF04 634 (AAF04 634) HYPOTHETICAL 45.2 KD PROTEIN 76 le-12 

Q9Y1T1 (Q9Y1T1) DNA POLYMERASE ALPHA CATALYTIC SUBUNIT (EC 36 1.3 

P91805 (P91805) ARYLPHORIN GENE-SPECIFIC BINDING PROTEIN-2 34 6.4 

045322 (045322) DY3.5 PROTEIN 34 6.4 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 895 of CT521: this 
corresponds to nucleotides 2687 36 to 269630 of the genomic reference sequence. 
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CT602 

Nucleotide 

Genomic coordinates: 
Start: 2996 

Stop: 2702 (SEQ ID NO: 172) 



Amino Acid 

MDILEDIYKSAITLVLQSPEFVNDVKQEASQVVEGLIPSIREAVFRRLLEEERKKHEDEV 
GDVE DKRQAV I DKANTM I TTM AAE YLE S V D I LEE FG F 
{SEQ ID NO: 173) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

CRCM_HUMAN (P23508) COLORECTAL MUTANT CANCER PROTEIN (MCC P 36 0.055 

ACVS_PENCH (P19787) DELTA- ( L-ALPHA- AMI NOADIPYL) -L-CYSTEINYL 33 0.48 

ACVT_PENCH (P26046) DELTA- ( L-ALPHA-AM I NOADIPYL) -L-CYSTEINYL 33 0.48 

Q9XJC5 (Q9XJC5) HYPOTHETICAL 26.4 KD PROTEIN 32 0.62 

NOSZ_ACHCY (P94127) NITROUS-OXIDE REDUCTASE PRECURSOR (EC 1 32 0.82 

EZRI BOVIN (P31976) EZRIN (P81> (CYTOVILLIN) (VILLIN-2) 32 1.1 



Comments : 



TaqMan Primer/Probe Sets : 

5"start=121 

S'stop=144 

3'start=200 

3*stop«222 

5 •primer-AGAGAAGCTGTCTTTAGACGGCTT (residues 121 to 14 4 of SEQ ID NO: 172) 
Tm5-58.14 

3 f primer=TGCCTTGTCTATCACTGCTTGTC (residues 200 to 222 of SEQ ID NO: 172) 
Tm3=58.14 

probel-AACACGAAGACGAGGTGG (residues 164 to 181 of SEQ ID NO: 172) 

probe Is tart =164 

probe lStop=l 81 

directionl=Forward 

Tml-69.14 

scorel=1.85 

length=102 
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CT522 

Nucleotide 

Genomic coordinates : 

Start: 276736 

Stop: 275206 {SEQ ID NO: 174) 
Amino Acid 

MASGFAIKGIVKNYRRIPSIIESIKSIRRSELAEGVYIVSLHKNTPKHEVDEIVNKIRLS 

AGNPCLEKTSLFLQHHSQMRNFYTRKGAESESDWLKRLPEDLRNINNIVKREALPHDKSF 

TFSPLYRILTDRLFNAAIHNCKYIIVTADLLMGCGITNNKVEKKLLSMGSILGGESMVPL 

HDIAHRLSYKGLRIENPIVGSCH DQCLVVPVSMLGKI FSSNMYPTFKNFDQCMALFLNAV 

VTHSAEKMDGKHERNKVIHMPNEVYLDAARRKYLEEKLEETNKLDAIDEEAREEYGNEIG 

RIGDKSTCLVFALSARDFFLTNRFNEDTPLYSGTERGIRFMCSNYCTMRDEGGFRPRLIM 

SAYGPTSYPI I FNTLYDQFNVQYYPCVSGWLS FI GDDQLAPEPESLVDI VVRS IKNPS I 

RIFSGDGETVYQDGRRVDVGGEGKNQKFNREERTILNVLRIIKAYNEERTKEDEDEEEEE 

EEEE EQQT AAT VT VE S DW DLSLERGENW V 

(SEQ ID NO: 175) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

061310 (061310) TSJ5 44 0.003 

CINA_ELEEL (P02719) SODIUM CHANNEL PROTEIN {NA+ CHANNEL) 42 0.009 

MYC_BRARE (P52160) MYC PROTEIN (C-MYC) 41 0.019 

Q9Y0C6 (Q9Y0C6) J5 PROTEIN (FRAGMENT) 41 0.019 

035788 <035788) CYCLIC NUCLEOTIDE -GAT ED CHANNEL BETA SUBUNI 41 0.025 

MYC CARAU (P49709) MYC PROTEIN (C-MYC) 40 0.033 



Comments : 

EST confirmation of the predicted transcript : 

An isolated EST has equence identity to nucleotides 374 to 1 of CT522: this 
corresponds to nucleotides 275340 to 275713 of the genomic reference sequence. 

Hit to public SBV sequence: 

gi|60074l0|gb|AFn8573.l: CT nucleotides l to 552 match nucleotides 555 to 6 of the 
public sequence with a 99% homology, a score of 1063 and an Evalue of o.o 

TaqMan Primer /Probe Sets: 

5'start«682 

5'stop=703 

3'start^767 

3»stop=787 

5'primer-AATTTTGATCAATGCATGGCAT (residues 682 to 703 of SEQ ID NO: 174) 
Tm5=57 . 93 

3 1 prime r=CGTTTGGCATATGGATGACCT (residues 767 to 787 of SEQ ID NO: 174) 
Tm3=58 . 39 

probe 1=CAGTTGTTACACATTCGG (residues 716 to 733 of SEQ ID NO: 174) 

probelstart=716 

probelStop=733 

direct ionl=Forward 

Tml=68. 99 

scorel^l . 99 

length=106 
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CT603 

Nucleotide 

Genomic coordinates: 

Start: 24906 

Stop: 24660 (SEQ ID NO: 176) 



Amino Acid 

MTCPE I SKH I SGTDRRFWNTADPGGLS Y PFNPLFTLHLHLKNFSKI FS AHSS LGGGPLTR 
P Y VKFEGWTAGSTQRQ I TERS 
(SEQ ID NO: 177) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

CAB4 9803 (CAB49803) HYPOTHETICAL 69.0 KD PROTEIN 31 1.8 

074312 (074312) PUTATIVE TRANSMEMBRANE 29 5.2 



Comments: 

Hit to public SBV sequence: 

gi (6165655 igbi AF099142. 1 : CT nucleotides 118 to 231 match nucleotides 5652 to 5539 of 
the public sequence with a 90% homology, a score of 139 and an Evalue of 5e-36 
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CT523 

Nucleotide 

Genomic coordinates: 

Start: 281865 

Stop: 281127 (SEQ ID NO: 178) 



Amino Acid 

MVSTRSMEAKAAAAAKAKEVSPTTSKRKAEDLTEGTEEEEESVETHPPSKLPRVDEDEVY 
I DENVDGDVQ I LASS I E V ARMERERLAEAMVRDI KIEEEKAATE ARKE I ASRL I YKEMV Y 
LLPQLENMTNRLRPRSLIJ*HNEMTITDRTFSDLQIFNKVTFEFPILTDIAFLAREKSRVE 
GSRFYNDMKIGPITAYKLNLMCNKFIESWQKVKAEISPFVEVSVSSELEGSPFWDFKQR 
IVKHT 

(SEQ ID NO: 179) 



Top Blast Hits 

Sequences producing significant alignments: 

Q23915 (Q23915) PROTEIN KINASE 

077819 (077819) CORNEAL EPITHELIAL RHO-ASSOCIATED-SER/THR K 

Q134 64 (Q134 64) RHO-ASSOCIATED, COILED-COIL CONTAINING PROT 

P70336 (P70336) RHO-ASSOCIATED COILED-COIL FORMING KINAGE 2 

P70335 (P7 0335) RHO-ASSOCIATED COILED-COIL FORMING KINAGE 1 

Q63644 (Q63644) RHO-ASSOCIATED KINASE BETA 



Score 
(bits) 

43 
43 
43 
40 
39 
39 



E 

Value 

0.001 
0.002 
0.002 
0. 017 
0.029 
0.029 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 712 of CT523: this 
corresponds to nucleotides 281143 to 281854 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5*start=404 

5'stop~425 

3'start=502 

3'stop=522 

5'primer=GATCACTTCTCAGGCACAACGA (residues 404 to 425 of SEQ ID NO: 178) 
Tm5=58.93 

3 , primer=ACGGGCAAGGAAAGCAATATC (residues 502 to 522 of SEQ ID NO: 178) 
Tm3=59.06 

probe 1 =AATG ACCATT ACAG ACCG (residues 426 to 443 of SEQ ID NO: 178) 

probelstart=426 

probe 1 S top= 443 

direct ionl=Reverse 

Tml-68. 98 

scorel=1.98 

length=ll 9 



FIGURE 3, Sheet 97 of 160 



WO 01/38351 



139/201 



PCT/US00/28888 



CT604 

Nucleotide 

Genomic coordinates: 

Start: 50300 

Stop: 49079 (SEQ ID NO: 180) 



Amino Acid 

MSHINSTSAATTSSNTLPICTTTAPMIAAARAAAIASRTSASAVTSINSNSTSSSAMFRV 

PQGI S VTAMPPVPALTSLTE STGTRMS STPNVDV I PV PGPKNKSKSKKKDSKRKKNQNGN 

RSSDEDEPSLVIDDGSGRQSKNKKYSWVTSLATTTAERNNDTLAPPRPFLPTPEEGNMSE 

IDAGLSNPVTRQITGEVYSAALTSGVGDNGLYPSHFTVADTSYGDCETPIPGPAFVLDDG 

TVSRGTSLLHREEAEFLNDGSKVIHTVKPRNSKYSNIQRAASCMAYAVDLLNNHNITSDQ 

FDFMAMTAWAARQRCGEMAKFFEKRDKDrGEYRNKVVQYNRGIFTRTTEMNKRAKIILEQ 

QQRREAAAAAAATGATAPIPTTSAAGVGATSSATTNSLEYQEIRYQ 

(SEQ ID NO: 181) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

AM YH_ Y E A S T (P08640) GLUCOAMYLASE S1/S2 PRECURSOR (EC 3.2.1. 55 8e-07 

AAC49609 (AAC49609) GLUCOAMYLASE 55 8e-07 

039781 (039781) MEMBRANE GLYCOPROTEIN 53 4e-06 

039782 (039782) MEMBRANE GLYCOPROTEIN 50 3e-05 
Q9Y075 (Q9Y075) PROTEOPHOS PHOGLYCAN (FRAGMENT) 48 8e-05 
Q14888 (Q14888) MUCIN (FRAGMENT) 48 8e-05 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 666 of CT604 : this 
corresponds to nucleotides 4 9128 to 4 9792 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=572 

5'stop=592 

3*start=640 

3»stop=658 

5 • primer=GCCAAATCACCGGAGAAGTTT (residues 572 to 592 of SEQ ID NO: 180) 
Tm5=59.06 

3'primer=CAGCAACCGTGAAGTGGGA (residues 640 to 658 of SEQ ID NO: 180) 
Tm3«59.24 

probe 1=CTCACTTCTGGAGTTGGA (residues 604 to 621 of SEQ ID NO: 180) 

probelstart=604 

p robe lStop= 621 

direc t ion 1= For ward 

Tml-69.03 

scorel=l - 96 

length=87 
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CT524 

Nucleotide 

Genomic coordinates: 

Start: 291298 

Stop: 289681 (SEQ ID NO: 182) 
Amino Acid 

MEDFKQLKVKNGICLSGENTENYERVLLTFKSVKSVRRSELKEGHFIVRLRDKEVLHIKN 
GNERLRQLTGDPTLQIGLKYTSSLPKQGSFLEDEDPNYGKKWNESLPSPFQEMNKIVEEK 
ALVNDKNFKFSPLYRI IHERLSNAAVKKCDYMI ITTDFLVGCGFSPRNCTRTLKNMEQVL 
VQHGGTSSRVSVYDICDRLTYNGLSIANPIVGSFSNMCLIVPMDKLGLLFYNSTHPSAKS 
IGNYMSCLFNAAWYTLEKSNQKLDNFEKEIRFAKNEVNLLVSERSVLEEKLKESKKLYA 
ASEEQRISLRDVHKKSSIASSRYDGGACLVFAFSDRDFSLLCRTNGNGSFYSATEEGIRY 
VSS DDYRKRDV DERRPRLVM S I TGS DAP I CI RDS I RNHFNNHFIASGKGNEISFIDPPNE 
RLLMEMVREVTGSDIKIFMDNGKVYQDGVEI KVI DPSSKEGKDI IKKEETLPEEERKRLR 
RERRMIFNTVKAIETYNEERGEEEEVATSSGGTKRKREEKEGDYVALLNKACKEIKVC 
(SEQ ID NO: 183) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q57105 (Q57105) FESMID CLONE 31, COMPLETE SEQUENCE 39 0.10 

094986 (094986) KIAA0912 PROTEIN (FRAGMENT) 39 0.10 

094944 (094944) KIAA0866 PROTEIN 38 0.23 

MYST_HUMAN (P35749) MYOSIN HEAVY CHAIN, SMOOTH MUSCLE ISOFO 38 0.23 

YDH6_SCHPO (Q92351) HYPOTHETICAL 140.8 KD PROTEIN C6G9 . 06C 38 0.23 

Q22705 (Q22705) T23G7.3 PROTEIN 38 0.23 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1642 to 1 of CT524: this 
corresponds to nucleotides 289650 to 2 91291 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5*start=723 

5 f stop«74 6 

3'start=828 

3'stop=852 

5 • primer =TGGAAATTACATGTCATGCCTTTT (residues 723 to 746 of SEQ ID NO: 182) 
Tm5=58.47 

3 " primer -TTCGCTAACTAGAAGGTTGACTTCA (residues 828 to 852 of SEQ ID NO: 182) 
Tm3=57.59 

probe 1=TGCTGCAGTTGT AT ACAC (residues 750 to 767 of SEQ ID NO: 182) 

probe 1 s ta r t=7 5 0 

probelStop=767 

directionl=Forward 

Tml=68.96 

scorel=l . 96 

length=130 
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CT605 

Nucleotide 

Genomic coordinates: 

Start: 51809 

Stop: 50423<SEQ ID NO: 184) 



Amino Acid 

MDSSASVVFMRFAPPGEETALPPRRATPGSVAYDLFPSEEMDIEPMGLAKISTGYGIDKF 

PDGCYGQIVSRSQ4TWKNNTSVPTGTIDVDYRGELKVILRNHSAEKSVPIRKGTSIAQLI 

FLRYCDVEEEQIVYINETTGERTIIDSSSKKDNKNQARSVRGTGGFGSTDNPNFTETTVS 

RNQQEENKKEELEEGEIVEMEGFIDIPFLEGFENILAEQSNETGVTYPNTNQDVEEKDTK 

NIDWRELEAEFSSGIGSGSMDSSDSSDSSSSSSDSSDSSDSSDSESSDDSEGGDNKVRR 

IRRHQYHRRQLSYSDDVNGGGRNSEKMEMDRVTHIKTEHIKREDEPRYEERERYIHPRRM 

QVPKDYYCEQYEHYDAPAAAHHHRHHQHRHQHQRHFNQPRSNNSSDVTAYVNENSPTRPC 

RDRNS RFSER PNNGG YNRIN SR YTTFDP YRYGARRGRGGVY 

(SEQ ID NO: 185) 

Top Blast Hits 

Sequences producing significant alignments: 

Q89662 (Q8 9662) COMPLETE GENOME 
Q86612 (Q86612) 0RF2 

DUTCHVPl (041033) DEOXYURIDINE 5 • -TRIPHOSPHATE NUCLEOT I DOH 

DUT_ YEAST (P33317) DEOXYURIDINE 5 ' -TRIPHOSPHATE NUCLEOT I DOH 

DUT_LYCES (P32518) DEOXYURIDINE 5 * —TRIPHOSPHATE NUCLEOT I DOH 

DUT HUMAN (P33316) DEOXYURIDINE 5 • -TRI PHOSPHATE NUCLEOT I DOH 



Score 


E 


(bits) 


Value 


90 


2e-17 


89 


8e-17 


88 


le-16 


87 


2e-16 


83 


3e-15 


83 


5e-15 



Comments : 
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CT606 

Nucleotide 

Genomic coordinates: 

Start: 65024 

Stop: 64010 (SEQ ID NO: 186) 



Amino Acid 

MSSSQGLNNNMCTTEILLPKCTSSSLSLEESVDYLEKDFEELGIPLVEGKEVLLEFAYKI 

LKKRDTIRVIGDEQGDVCSVFFLRFGKKKTFNPQTKMWLVKLANAIALSMGWPEPACTC 

SRMMTTAKKIPVPESYKNVNRNIQKFEDVHYIDINFQSFVREQIGLSVLGKNDVQKKKKE 

ETPFFAPFNKSKIGGECIEDLKYDSESVSIIRDVFNLLGEMPTEDVKTSRSCINPSHNDT 

N PSMRLV FR PM YWRNSKL VM DKLSKEQDS ALIEKYMGGEHQHC 1 1 GGRNV LL YC I TALC F 

S S DCG FKKM LTN DE I KQLIW YLVLLFFH I ICP 1 1 QSK 

(SEQ ID NO: 188) 



Top Blast Hits 

Sequences producing significant alignments: 

YNHE_ECOLI (P77S22) HYPOTHETICAL 56.3 KD PROTEIN IN LPP-ARO 
TTK_MOUSE (P357 61) DUAL SPECIFICITY PROTEIN KINASE TTK (EC 
Q56333 (Q56333) FLIL 

CAB49508 (CAB49508) HYPOTHETICAL 32.8 KD PROTEIN 
BAA85006 (BAA85006) ORF1P 
BAA85071 (BAA85071) ORF1S 



Score 
(bits) 

35 
34 
34 
33 
32 
32 



E 

Value 

69 

2 

2 

5 

0 

0 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 531 to 1 of CT606: this 
corresponds to nucleotides 64063 to 64593 of the genomic reference sequence. 



TaqMan Proe Sets: 
5 , start=546 
5*stop=567 
3'start=655 
3'stop=677 

5 •primer-CCCTTTCTTTGCACCCTTTAAT (residues 546 to 567 of SEQ ID NO: 186) 
Tm5-57.43 

3 " prime r=ACATCCTCAGTAGGCATTTCACC (residues 655 to 677 of SEQ ID NO: 186) 
Tm3=58.14 

probe 1=GTATGATTCTGAGTCTGT (residues 606 to 623 of SEQ ID NO: 186) 

probe Is tar t= 606 

probelStop=623 

direct ion l=Reverse 

Tml=68.79 

scorel=l -79 

length=132 
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CT607 

Nucleotide 

Genomic coordinates: 

Start: 68659 

Stop: 65032 (SEQ ID NO: 188} 



Amino Acid 

MGVPEAKKVYENAYGAQNGRVIKEKTGYEDCYDDEDDEDYCSGEEDCTTSSLLKATSLAN 

INSKNFLDFGRGKKSSSSSPTCDYTLDMVDLPTYNVSDLVMLGRQIATTMLKGQKNMGQM 

IliFINTTNQQIIDVLHDGFKVIRE^DTMHSRMQNKKHIYENFYCRDEKKVISEFFSRKYK 

HEKIKARIERVPIIIPSSQEEVDWLTEPPIEDMMMAPPVSNHKMDDYEGLDYHINKHTDV 

MKKRKFLTNS FL FRN V PTTS FNSS PTAVLKS RFKDAFFASQMEGVI LYYAFRM I RVMKNL 

LKSKNI^GRYTVLFTDGKAPAIKMMTRAKRQIRQERSKEKAKSRNENCLNRKTNDLLFYS 

CERMMMRLPQGLMASALLDIMRIPVLKTTGSKCMYLSNASFTEAEDDIVRLTSCLLNLET 

PGKHFSLLEKRKILEYDSYNMSGNRKESKRWEDLLNVLKQHTNDENQTLSMKLFSHDSDV 

LVKWNLMVGHHKNVCRLTGTOFKDSETFLKIGHVKFFRCMNSNSSGENQANELGGFAAKR 

RTKPNTIYNLAESPLMLSPESTLLIMLTKGSDYNSAIVSNCEYDTWVRKEVAVFENTYCT 

CVGGWEIFLSEQEARKNNKDCDDSVGNISMGNLSKSNCRKCDKKLVLPFWTIKFFYLSQA 

I DFVRDPLQLCFPPTH LIDLETDVS LKHALH RALAVN AAANVMS YLTMG S FNQRVFGT I T 

TLSDISIhXSGANNNESKNTGSDVESDTEDLIPFSNNKRKSGNDPQKSTRKKSKVNATRK 

SAPVTKKLSSSVFESIRGFFESHTEGGIINDRGILTKERIDVFGNNLDTNPEALGEENGG 

GGGIVSSIPGLSTEQTSILKTEQNNSTSDFLDFFKKFNEMDDVEEEEEKMEEGEKEEEEA 

DLETDDWLDEARKAFEYKDSDFLEAVTAATNEMTSSLAKNNIEEDEHSRCSVSSKLNNKQ 

PVMDEEKWAEIVNEFDKCISLDNITYNDNSLLSRLSGVLMDANKREDGNNSNWLYEPVQ 

GI DDERFSG V P YS VKTMNLLV I V YMNMCGLE DNT I V YQQLMP IIHSE FCGKTEE DKICT D 

RTNFMSAALEYTMLQYMPELKKTPRIKQIKRKNWERIPKVLDDFKDKVSTCTDNYNKLLA 

TLNKEGKIPSENTKWLPSQGQFMPVLGVAISKPWSPLTLWSSFYLQHQQRQDVSLTNITP 

PNSPRPEQ 

(SEQ ID NO: 189) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



Q12532 (Q12532) HYPOTHETICAL 119.1 KD PROTEIN YPL009C 

Q9XZI5 (Q9XZI5) RANGAP 

Q13387 (Q13387) HYPOTHETICAL PROTEIN 384D8_2 

Q9YTL7 (Q9YTL7) ORF 4 8 

Q25662 (Q25662) REPEAT ORGANELLAR PROTEIN 

096266 (096266) HYPOTHETICAL 283.6 KD PROTEIN 



50 
47 
46 
45 
45 
45 



le-04 
8e-04 
0.001 
0.003 
0.003 
0.004 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 860 of CT607 : this 
corresponds to nucleotides 65102 to 65960 of the genomic reference sequence. 

TaqMan Primer/ Probe Sets: 

5'start=1428 

5 , stop=14 4 8 

3 , start«1516 

3 , stop=*1538 

5* primer ^CGATTCTGATGTGCTGGTGAA (residues 1428 to 1448 of SEQ ID NO: 188) 
Tm5=58.31 

3 * primer =TGGCCAATCTTTAAGAACGTTTC (residues 1516 to 1538 of SEQ ID NO: 188) 
Tm3-58. 35 

probe 1-TGGAACCTGATGGTTGGA (residues 1450 to 14 67 of SEQ ID NO: 188) 

probelstart=14 50 

probelStop=1467 

directionl=Forward 

Tml=69.04 

scorel^l - 95 

length=lll 
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CT608 

Nucleotide 
Genomic coordinates: 
Start: 69118 

Stop: 68704 (SEQ ID NO: 190) 
Amino Acid 

MDGDSSSLQLLSESEFDYWETLKEQGVWELALEVFNEVSNSIETVKEEEDYTVLRSRNY 
FPTESITLYKMQEEEESTPIKKRKLASGKSPRSLCRELRLLQIPSTTTFKAAPRSSSRR 
GKNTRLRRVCKN YGAHQ 
(SEQ ID NO: 191) 

Top Blast Hits 

Sequences producing significant alignments: 

VE2_HPV08 (P06422) REGULATORY PROTEIN E2 
FLP_ZYGBI (P13784) RE COMB I N A S E FLP PROTEIN 
Q9XZS0 (Q9XZS0) GUANYLYL CYCLASE (EC 4.6.1.2) 
CINA_ELEEL (P02719) SODIUM CHANNEL PROTEIN (NA+ CHANNEL) 
GLMS_HELPY (026060) GLUCOSAMINE — FRUCTOSE- 6 -PHOSPHATE AMINO 
Q9Z8C4 (Q9Z8C4 ) TRANSGLYCOLASE/ TRANS PEPTIDASE 

Comments : 

TaqMan Primer/Probe Sets: 
5'start=114 
5'stop=138 
S'start-^S 
3'stop=259 

5 ' prime r=GGTGTCCAATTCTATTGAGACTGTG (residues 114 to 138 of SEQ ID NO: 190) 
Tm5=57 . 77 

3 , primer= s CGAGTTTCCTCTTCTTAATAGGGGT (residues 235 to 259 of SEQ ID NO: 190) 
Tm3=58.39 

probe 1=AACAACAACAGGAAGAGG (residues 209 to 226 of SEQ ID NO: 190) 

probelstart=209 

probelStop=226 

direction l=Forward 

Tml=68.8 6 

scorel=1.86 

length=14 6 



Score E 
(bits) Value 

38 0.028 

34 0.41 

33 0.71 

32 1.6 

32 1.6 

32 1.6 
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CT609 

Nucleotide 

Genomic coordinates : 

Start: 78365 

Stop: 77441 (SEQ ID NO: 192) 
Amino Acid 

MWCSTHLSYSEFFTPLQSQKLFRNFFRALEFRGWTASSTECQVPRVDLWVGPMSDYTRNC 
WFQKP/TLTFVCFWNRRFWRLVDPEMRGYT^LFSLENFTLPLSQKLFKNFFRALQFRGWTA 
SSTECQVPRVDRWVGPMSDYTRNVIAPETYINFCVFLEQAFLETGRPRNERVYPSVFTRE 
FYSSSISKTFQKFFRALQFRGWTASSTECQVPRVDLWVGPMS DYTRNVIAPEIEEVSYGH 
FWTRCFWTKILLDGNPLPLPPPFKKGPRVYNDCTTPHSNHHNHHHHHGRTSILQQTLSRK 
WSSLL 

(SEQ ID NO: 193) 
Top Blast Hits 



Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



DYRK_HUMAN (Q13627) DUAL- SPECIFICITY TYROSINE- (Y ) -PHOSPHORY 
HXAl_MOUSE (P09022) HOMEOBOX PROTEIN HOX-Al (HOX-1.6) (HOME 
O60275 (060275) KIAA0522 PROTEIN (FRAGMENT) 



DYRK_RAT (Q63470) DUAL- SPECIFICITY TYROSINE- (Y) - PHOSPHOR YI»A 
DYRK_MOUSE (Q61214) DUAL- SPECIFICITY TYROSINE- (Y) -PHOSPHORY 
061543 (061543) BHLH-PAS TRANSCRIPTION FACTOR SPINELESS 



37 
37 
37 
37 
36 
36 



0.22 
0.22 
0.22 
0.22 
0.29 
0.50 



Comments : 
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CT1010 

Nucleotide 

Genomic coordinates: 

Start: 41348 

Stop: 41795 (SEQ ID NO: 194) 



Amino Acid 

MTHLVLLILSLSLSPVYHHLTPYLSPHLTYTPISPITSIFPHLIHSLQFQHPVLAEPTHN 
QIWTPVFPFIPNRHHLCPQALAVYIRRRGQARSISSLQASRRATQQALSLLLPRRDLPIL 
KLQEWPLQPPPHQVLTPCWTLSYLVLRN 
(SEQ ID NO: 195) 

Top Blast Hits 

Sequences producing significant alignments : 

062081 (062081) C31A11.6 PROTEIN 
Q05330 (Q05330) PUTATIVE ORF 

Q60974 (Q60974) RETINOID X RECEPTOR INTERACTING PROTEIN 13 
P79230 (P79230) KAPPA CASEIN (FRAGMENT) 
Q9WAL8 (Q9WAL8 ) POLY PROTEIN 
AAD12852 (AAD12852) Y8A9A-2 PROTEIN 



Score E 
(bits) Value 



36 
32 
32 
32 
31 
31 



20 

2 

9 

9 

8 

6 



Comments: 



TaqMan Primer/ Probe Sets: 

5'start=209 

5*stop=227 

3'start=280 

3'stop=300 

5 * prime r=TCCCTAACCGGCACCATTT (residues 209 to 227 of SEQ ID NO: 194) 
Tm5=58.83 

3 'primer^GCTGGCTTGCAGAGAACTGAT (residues 280 to 300 of SEQ ID NO: 194) 
Tm3=58.29 

probe 1=TAT AT AAGGCGGCGCGGC (residues 250 to 267 of SEQ ID NO: 194) 

probelstart=250 

probe 1 S top=2 67 

direct ionl =Forward 

Tml-69.00 

scorel=l . 99 

length=92 
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CT1011 

Nucleotide 

Genomic coordinates: 

Start: 41758 

Stop: 42097 (SEQ ID NO: 196) 



Amino Acid 

MLDSKLLSSEELKELTSYVSTSSRRSDMKKHLLHLFEEHEKIFQFIQGKHKFSLYTLDFE 

IFYVMLNILLVEVKNILSPIPLLFDRNLQPVRRLWMFHNGPASPERCSRSLG 

(SEQ ID NO: 197) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

044068 (044068) DNA POLYMERASE ALPHA (FRAGMENT) 31 2.3 

Q49563 (Q49563) DNA, TRANSPOSON-LIKE ELEMENT ENCODING 3 ORF 31 3.0 

BAA74885 (BAA74885) KIAA0862 PROTEIN 31 3.9 

O4405S (04 4 055) DNA POLYMERASE ALPHA (FRAGMENT) 31 3.9 

O76063 (076063) LEUCINE-RICH REPEAT PROTEIN SHOC-2 31 3.9 

YD89 MET J A (Q58784) HYPOTHETICAL PROTEIN MJ1389 31 3.9 



Comments : 



TaqMan Primer/Probe Sets: 

5 f start=107 

5'stop=127 

3'start=193 

3'stop=217 

5'priiner=TCGAGGAGCACGAGAAGATCT (residues 107 to 127 of SEQ ID NO: 196) 
Tm5=57 . 83 

3 * primer =CTTCAACCAACAAAATATTCAGCAT (residues 193 to 217 of SEQ ID NO: 196) 
Tm3=57. 4 6 

probe 1 =ACAAGGTAAGCACAAGTT (residues 138 to 155 of SEQ ID NO: 196) 

probelstart=138 

probe 1 S t op= 155 

directionl=Forward 

Tml=69.01 

scorel=l -88 

length=lll 
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CT1012 

Nucleotide 

Genomic coordinates: 

Start: 42053 

Stop: 45491 (SEQ ID NO: 198) 



Amino Acid 

MAPPHLNAAADLLDKVMSGPLSPEGAQTSSPAACVGAKWKALVSFCQKTRFTTNIVMRE 
VKAMEFQGDDFNYSALCASMPQRPVTERQMFALMKSEDEEMGVSANFSPVSDDVINPSSL 
PSGQEVDSSTSAQISGMFQNVWSLLEECGSGSNSNSSPVSRTVLVCTLFIIQVFKFLVTK 
VSNVNVLNQLFGHWFGSLDVAPSNNNSVPSTVVNNNNKPSTSNNSNNISNKRVGGSNNS 
GGGRSKKVTATAKNPFNNVDGDNHGMFAGAPVDVNLDDFVFPQVETLTSKSTIPKEEVNV 
DEDLSKMCRKTALTPLEIHTFNVFISEINPSKYDRSMFCKGFLTAWDKFVEGDTAGVKRF 
RNYILTRSNYASAARAVYEASIKGTVYYNDKSKFLFHDNVNPDLDKSWGNKNGKKPRLPA 
NLMAFMGIDIVKVCAKGIQKYMFAKQFQHPEVEELVPPMAVYAKVAAGLKSGTLFDDWDL 
PEYENCQFIKYDTEGCKKHSELYAKQLLRTGLNQYNKLEEGQSAFPFANIVTVTSASSDD 
IHGDTI IELMYKTKDGVKGVSKI EDEN I IKVNPAEEKKNNRVQAEKTLYFEI DSDDEVCE 
RTEEEFFRPTSWAAPTTPLVPSNVEEEEEEEEQMEEEEEEEVEREEGSDKEDDGDAPAQ 
EEMEEEKEEEQQQQPEEESNGNENQEEEQQQQQQPEREEENKDADSDSDSDSSSSSSSSS 
SSSSSSSSSSSSSSSSSENEAEKKKEEEVPAKIQKRKRLSERPSEAASSPKRMRVEEEQQ 
QQLS PSLDI LQTAVDEMMEE I PAPE P I VATT S PKAATLALKTG FS YS S FVRGDDLS VAGN 
TSPTEPAAVPAAATCTSDVGNDFLDMLDGLPGDIVMQPGECDVTAKFFEGITLPDGTDNE 
CTGFDDLLKATETDNIITTTCFTSPIHPSSNSAPRKDIDNCSSIKRSRAGSLFDTDDDSE 
TNEVEKEAPKRKKHLKKRRNKSHRGSSGSASSSHCMSSDEESEDERDMKSTSKVHKSPKA 
HVKHSPKYDAVNSDVNNSYNNVNSTTCMSSSDSDAEAQPKSHNKSHSRKHSSSSTSDKKQ 
NQQCSINTQNVKKTVVQSPPSFRSFSPKKDELGDFLSRKHTKPVRPYNKKRDNVNTTNNV 
VQRSA 

(SEQ ID NO: 199) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9YTL7 (Q9YTL7) ORF 48 101 4e-20 

095815 (095815) DENTIN PHOSPHORYN (FRAGMENT) 99 le-19 

GAR2_SCHPO (P41891) GAR2 PROTEIN 97 6e-19 

AAD4 6501 (AAD46501) LATENT NUCLEAR ANTIGEN 94 6e-18 

040947 (040947) ORF 73 92 le-17 

GARP_PLAFF (P13816) GLUTAMIC ACID-RICH PROTEIN PRECURSOR 91 3e-17 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 955 of CT1012: this 
corresponds to nucleotides 44515 to 45469 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=1345 

5'stop=1366 

3*start=1389 

3 , stop=1407 

5'primer=CATCCGGAAGTGGAAGAACTTG (residues 1345 to 1366 of SEQ ID NO: 200) 
Tm5=59.57 

3'primer=CAATCCTGCGGCAACCTTT (residues 1389 to 1407 of SEQ ID NO: 200) 
Tm3=59.65 

probe 1=TGCCTCCTATGGCTGTAT (residues 1367 to 1384 of SEQ ID NO:200) 

probelstart s =1367 

probelStop=1384 

direct ionl=Reverse 

Tml=68.99 

scorel=1.99 

length-63 
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CT1013 

Nucleotide 

Genomic coordinates : 

Start: 45950 

Stop: 47825 (SEQ ID NO: 200) 



Amino Acid 

MSSTTSPSSLWDDDDDDDEEDEKDVKQEVSNRPPIFSYMETVSFSDNDEDDNKGEEECFG 
SNFDMFGDSDNMPSTSTAPFPPPSTTTPLPTPRSIMDTDSDECDEEGAAAASAPSIAASS 
SIPVGI SEAELKKMEKKKRKE I KKLKKMMKDPLPHL Y VGGE PP VAADYKTRAN I SLYKVD 
PSIDMCGVAPPQFCAELPTPSIDVYTSSYVFPPPTPAMHNKKGSKKCQFI^GRKALRKWI 
HENVCMAPPGKRGGVFLAHLEQRFIJU2HGDEYKVPRMFVSRVLNK^ 

MTFYTNLCWIVNGWVCFDKDDGGIHGDASEYATGENFDTWFHKREEQKTNGSASKKRR 
LTPDTSNMGTSTDVQEFQTMGTNTDMQEFQSMGTNTNPIETSSVGVNTNPLPNPPPRLVI 
TPLTNDVPELDMMWLYSPSRGGGNSRMSANTGTSPLSNTPIPTCFTGGANVWPNGFVPP 
T FPLECDEDDPS I PNS YN YEE DKVFH PFYE YMAKYLS PLVPS YNKGQTCN VVQEW FKGS F 
SIAKRRGTVPKFCSNISHAFFCNMDVCTAMCKWAKTVIRHGQYCNRCIVRRSCTSMLAYH 
YIVCRDASCDVPKCRERVRNDMDD 
(SEQ ID NO: 201) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

AAC97971 (AAC97971) FAS-BINDING PROTEIN 47 4e-04 

Q19871 (Q19871) F28D1.6 PROTEIN 44 0.004 

NUCL_MESAU (P08199) NUCLEOLIN (PROTEIN C23) 43 0.006 

AAD56625 (AAD56625) NUCLEOLIN-RELATED PROTEIN NRP 43 0.008 

VP41_YEAST (P38959) VACUOLAR ASSEMBLY PROTEIN VPS41 (VACUOL 43 0.008 

035613 (035613) FAS DEATH DOMAIN -ASSOCIATED PROTEIN (DAXX) 43 0.008 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 934 of CT1013: this 
corresponds to nucleotides 4 6867 to 4 7800 of the genomic reference sequence. 
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CT1016 

Nucleotide 

Genomic coordinates: 

Start: 52093 

Stop: 54913 (SEQ ID NO: 202) 



Amino Acid 

MSEPSVYAFIDIKEIENGWEKEFGLLVQPGQKLAPFRDISYDSSKLDCDAFSCIPSDILH 

SDNEKRVGECNFAEHTSVSFPVKNPEGKTLRHFTACGPGCYRRYKQRDPHTGLPVARGVL 

MQDHVDHETGNKMCEYLNQSLVMWAAVPWIRPGDLTEGYNTTHVPGFAFKEDDERDSKRV 

KYENVVISKAYCDFFKQYYDADSGSCYRSGWMKFVHLMFGQYFTNLSYNLANPKPYNLTG 

NTWS DVVSVLTDNPI VDAGAAPSRSEMDEI ITKKKFNVFPSEQTS ARQKAEN 1 1 RSQYG D 

GVEI DPSSVDAI^QFVNREGWGTEKKSDRLMRVAIDAVMDAAMRLQVMGLDDSQSRRLLL 

KNMIKMSRNNPEYARHFSSSLKLIGVTLAIKRSVFSKGASAKRKETAINNGEQHRRSRWS 

PETVTEEDALLFARENITEDPKHPAPFVDILHSPDINSSIKSGSSSSIWNDILSRISSTR 

KLEEKASVFVKNLWKVVRQFLDILEGKLFSDGYEWDDNIPLMIGVDQILREVIKAASNM 

CARFASSALESSLVTGFIDSASAITSRLAVQLAARTFSVFLEESVIEFWAASLRLAIQA 

FADLAT LAAS ALT V I G I VI FV I Q VLGL I LDLALGLGWY DHI FS PE DLKKQVLV FRRE F AK 

AGNVDVGVAQPVTPEEIVAINVFLQTEENGEEKKEEGARKSKIDFLQKYFHSTPLMGKKS 

KFVYI QEAAQE YLGGRTMNAFGQRI I TAADDS DTTTTTQEGRRDDET VTKKMRS 1 1 LET G 

QTLKDYSSAVNYNASRLDYVGEEWVRNTALKEETRSNTTSDNLFKKTVSLASMAGAFLVL 

GIGVLVASHITLLRFTNIGLAFAF7\GLLAFIALMSISYINMNAMGVVNSDAIYRSTALVG 

DIKTDPRRVGMVQRHVGVGAKYNMITDFVSPMLDEIESD 

(SEQ ID NO: 203) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

O67140 (067140) L-SERYL— TRNA ( SER ) SELENIUM TRANSFERASE 41 0.048 

YD86_SCHPO (Q10411) HYPOTHETICAL 222.8 KD PROTEIN C1F3.06C 38 0.32 

Q88606 (Q88606) (CRUCIFER) GENOMIC RNA FOR RN A- DE PEN DEN T RN 35 2.1 

PME3_PHAVU (Q43111) PECTINE ST ERASE 3 PRECURSOR (EC 3.1.1.11 34 3.6 

081424 (081424) T2H3.2 PROTEIN 34 4.7 

004260 (O04260) HYPOTHETICAL 49.5 KD PROTEIN 34 4.7 



Comments : 
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CT1017 

Nucleotide 

Genomic coordinates : 

Start: 55054 

Stop: 58189 (SEQ ID NO: 204) 



Amino Acid 

MDSLTNTVTLLVNDRLGNHRTNKPITEQDVENTLNLNSLERASLLKLYSVFIKEMQSYSG 

CIPKNKYTNVQEIFEDGLITFEWRDGTKVHRSVSPSSPIPLSTKKSPRSSPSPPPSMPSI 

KEEE FEEEFE DDEE IYETDENVEDFI NGDGE DSEEEEEE DI I VDDEEEEN EEGENKY VL A 

FSNHLRRQTAAAAAAAAAAAAADIEKKDKNHAVSAHDYTLSALQQQQQKLLQQQQQQQHQ 

QRSSSEKVTSTPNKFNKFLLPSNGFSEQTELFVCFDVDKIAQYNGLVELDILPIVAEYII 

NGLGLKCSMET PPVKPCRRKEVKDVWCQPKTS FENDAVEDKHLAFAES P I LQR PRDFP I P 

KKITAYFCLDDSVDIKNPWGSCPLLKSGSNFRVSEYSRHFNEFSGVKNDDDTSSNTCFIY 

SQKNPNIEIVSKLNIEFEVMMEGIITHRKDLFETGILSDSSLATAMAFCHPKARVRNVAL 

FYFSVYLPFSKITRKETIKCSETDKVHIGSDAIFSPPSDNPNISAHQNNNNNNNNNTSVN 

IEDRPIRNNNISRKMTITNYQCMACKERCTNNCTNGNYPDRGNQHLSHSVKGEDFFKILN 

NSKVDSLKKLSRVLIPAPPSGNYTSKFCDRSSMCHSFFCRGIEPVSTSFSSDSFEKTKLV 

LYGKVVDVINSYSAIKTSHNNRIRVFFNSEEKDNKTIPSRAESAKNAFKDILVHECNKER 

AVSYFEQNKLSSKDGHLSNKWWIELNDLNIMFEKHTODFYKKCSKVNDAESLKDIFNDFE 

KTCDKYECTAKRAIIGAQDPSTSTPSKKENGITRIISTLSEFHSKDEATVSALLDKTMLLG 

SRTIMSGVRCVIRNNSVFSGFENKNTNNNWELEIRHYVISMGGAAVTKISDEDLEQFTPV 

RGAVSVTTAPNDKLPVGAHQTWKDEQTLKTNTKRNSLYDSYNSKRNNRDNNKIKNRSLKL 

SDFNWRTPNISIQEFNANKDDVNKKRYAEWASAAPKSPSPTSSSSSNSNSSSPPLSPLS 

PTVKNSNNKPLYIPPHKRMTTTAV 

(SEQ ID NO: 205) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



077033 (077033) TRFA 6D 7e-08 

Q94463 (Q94463) K7 KINESIN-LIKE PROTEIN 59 2e-07 

NUCL_CHICK (PI 57 71) NUCLEOLI N (PROTEIN C23) 57 5e-07 

AAD4 6501 (AAD4 6501) LATENT NUCLEAR ANTIGEN 57 8e-07 

040947 (040947) ORF 73 56 le-06 

Q9S148 (Q98148) ORF73 HOMOLOG 55 2e-06 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 215 of CT1017: this 
corresponds to nucleotides 57667 to 57881 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=1741 

5 f stop=17 62 

3'start~1847 

3'stop=1864 

5 * primer =CGTGGTAACCAGCACTTGTCAC (residues 1741 to 1762 of SEQ ID NO:204) 
Tm5=59. 16 

3 1 primer ^TTCCAGAGGGAGGAGCGG (residues 1847 to 1864 of SEQ ID NO:204) 
Tm3=59.84 

probe 1=TGAGCAGAGTACTGATTC (residues 1829 to 1846 of SEQ ID NO: 204) 

probelstart=1829 

probelStop=184 6 

direct ionl- Forward 

Tml-69.01 

scorel=l . 98 

length«124 
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CT1018 

Nucleotide 

Genomic coordinates: 

Start: 58947 

Stop: 60029 (SEQ ID NO: 206) 
Amino Acid 

MKICQISSPTLTLSIPLEGVYHVKQLLHLKVHLDVKGVKQLLHLKVRLDVRGAKQNPWRK 
NLCLLKKNVKSAKQLPHLKVHLDVKSAKQLPHLKVHLDVRGAKQLPHLKVRLDVKSAKQL 
PHiKVHLDVRGAKQLPHLKVRLDVRGAKQNPWRKNLCLLKKNVKS7U<QLPHLKVHLDVKG 
VKQLLHLKVRLDVRGAKQLPHLKVHLDVRGAKQNPWRKNLCLLKKNVKSAKQLPHLKVLL 
DVRGAKQLPHLKVLLDVRGAKQLPHLKVIiDVRGAKQNPWRKNlXTLLKKNVKSAKQLPHL 
KVLLDVRGAKQLPHLKVHLDVRGAKQQQQLCLPLKTISTSFTHLLLCLYMEYGKHQNLQV 
X 

(SEQ ID NO: 207) 
Top Blast Hits 



Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



041125 (041125) A643R PROTEIN 

077336 (077336) PFC0425W PROTEIN 

Q65683 (Q65683) 42K TRANSPORT PROTEIN 

Q48275 (Q48275) HYPOTHETICAL PROTEIN (FRAGMENT) 

Q89659 (Q89659) 42K TRANSPORT PROTEIN 



36 0.34 

35 0.75 

32 6.5 

32 6.5 

32 6.5 



Comments : 



FIGURE 3, Sheet 1 1 1 of 160 



WO 01/38351 



153/201 



PCT/US00/28888 



CT1019 

Nucleotide 

Genomic coordinates : 

Start: 622 03 

Stop: 63019 (SEQ ID NO: 208) 



Amino Acid 

MDVEFGFFHGLLSKALLPDEKHQPVIRRLCADDSE^NKGEDGCCSFCGRRGTGESNTACLE 

QLIDVCSFIGTVSSIGTIINSNLSTSCSRLQKTSDSYAALSHSSFLDWYPSLKKTTEDV 

LPHSLRAIWNKQLPKLYEKTLQPIEEEDIGYKDYWSIEDDDNVDDGDQQEQMIIDEESY 

KTIGEKSTIELIGMYNNNKFGNEFIRIPLRETALHAQSLRYDTEAKFVNHKDSIPLFYEN 

STCTCKERLI DFSERQLQQLKQDGMDKPTDK 

(SEQ ID NO: 209) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

AAF02799 (AAF02799) FSI10.23 PROTEIN 37 0.14 

023058 (O23058) BAC IG0OSI10 37 0.14 

OPIl_YEAST (P21957) NEGATIVE REGULATOR OF PHOSPHOLIPID BIOS 34 0.90 

Q24042 (Q24042) HYPOTHETICAL 89.2 KD PROTEIN 34 1.2 

Q24043 (Q24043) HYPOTHETICAL PROTEIN (FRAGMENT) 34 1.2 

076871 (076871) EG:100G7.2 PROTEIN 34 1.2 



Comments: 



TaqMan Primer/Probe Sets: 

5'start=2 92 

5'stop=313 

3'start=375 

3 f stop=396 

5 "primer «GCGGCATTATCCCATTCTAGTT (residues 292 to 313 of SEQ ID NO: 208) 
Tm5*58.17 

3 ' pr imer=TTGTTTATTCCAAATGGCACGT (residues 375 to 396 of SEQ ID NO: 208) 
Tm3=58.03 

probe 1=CTGAAGACGTATTGCCTC (residues 350 to 367 of SEQ ID NO: 208) 

probelstart-350 

probe lStop=3 67 

direct ionl=Forward 

Tml=68. 97 

scorel»l . 97 

length=105 
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CT10000 

Nucleotide 

Genomic coordinates: 

Start: 3143 

Stop: 6956 (SEQ ID NO: 210) 



Amino Acid 

MVLTLSCTTRRVASSKGNFSKEDAVLGNQFPILKKSNNLSIARPPSIESFSASVEKIFRE 

WNESGGEKIFDISQNEEEWMDIISLVESVYEPVFSKSLKPDKIADKTCLTAAAFAALASA 

VDEKLTILSGSDGSVLQRTTKVMKKDPKKIAESLLNNEKWTSILLDRLKTAKKLLSRRGA 

LKSAERVEVLHRLNKLKEAPLPHHPSLFDNFSGGKTSAVSAGTVIASDMHFKLVEHIFKV 

SFRKWGPCGDKTESGEEEDEEEEEEEKKHSISRFVLQFMNGHNGQHYHRPESASVYFCDY 

YDYLAYRNLPNEYKLSSMHPGTFNMEDLPFRPFAVPSTYKTELEYKRFVQSTNLPQLSFD 

YGEFLCYCI FGADW YKHLGDWDSLENSSM I S FDSQTLSGVYKNTAN YKRLGKKRNG IAD 

LAVRSMAEFIRTEAHKALTAEEMEEEEEEEEAEEEAMDQEPAEVDFLSVPHLRRKIRQAV 

S VLNN F VEN DLS I LVSN FKNVLT DDT VSGT DT DNFG S S GE FE ALS S H L FLSR I L DE V H I L 

RNTDIQRTLFSTHVSLSDKSPPSRVRGSNVNFNNNAGNISSLQTYGGIEELPENVLVGLS 

GGFEDTDMYSGEDVVVVWDGCDGGKVLSVTFNCGDNFIQLHEKTAETFKDDTDLVERIRD 

VLQTASKTGNLNKKAYSRKNIYAVLRENGIERPGDDFTEKGIALKDKTNQPPPPARSAKI 

TVEGVKGFFSGFRDILETRALTTYSAETFRDLGQGIVKETEGLTAATVAETSFSEGLAES 

LRSDANLGLEFSEDAKTWFKNDTSRSLLEETRALRANNTSFSSFARDMGVQVSADLDAE 

FAAEMRETYPDAALEQNLKDLDKFEETIPESQVKKLKKIDSYLTENPERAGKEINDTELS 

KATDSVLGKKLGNAVTVLMNNFGKVTIWGASVVAGFLGPAAVALVHASRGAHLNVVDHT 

SPKGVISYKIVDFSCADRNTGWAKPTKHPFREEIDHVIALDASFLTENGAYVFPEDGGPK 

SKYKAYAPICGTKDAAOGECGSWATFDDPHSVLPWVASMKDLPKGQSLSCDKGMSTLKAV 

SSVLLSIGKDVAEAIFEVAEDAVVGLASKAISAVINNPLFIFGVPLGFGIAATRLNPSNW 

KTGLIVFSILLWILIVRFFAGSGPLTUOWFGAKNSAKRKQTEQFEDGGGNRSKIVIJVEK 

DNANSKLQSRRNETGPMRLEELPGHEDLRPVFFPATTNYSKSAKILGYKSKPFNDFYTKI 

INTDIIKMDR 

(SEQ ID NO: 211) 

Top Blast Hits Score E 

Sequences producing significant alignments: (bits) Value 

YK06_ YEAST (P36062) HYPOTHETICAL 75.5 KD PROTEIN IN SDH1-CI 43 0.010 

Q12532 (Q12532) HYPOTHETICAL 119.1 KD PROTEIN YPL009C 43 0.017 

075184 (075184) KIAA0702 PROTEIN 41 0.050 

O60721 (060721) RETINAL ROD NA-CA+K EXCHANGER SPLICE VARIAN 41 0.050 

043485 (043485) RETINAL ROD NA+/CA+, K+ EXCHANGER 41 0.050 

AAD28522 (AAD28522) FLAGELLIN (FRAGMENT) 41 0.050 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 969 of CT10000: this 
corresponds to nucleotides 5867 to 68 35 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5 f start-2102 

5'stop=2125 

3 f start=2185 

3'stop=2204 

5 ' primer- GGATTGCTCTCAAGGATAAAACAA (residues 2102 to 2115 of SEQ ID NO:210) 
Tm5=57.56 

3 • primer =ATGTCACGAAAACCGCTGAA (residues 2185 to 2204 of SEQ ID NO: 210) 
Tm3=57.81 

probe 1=AAGTGCCAAGATAACGGT (residues 2148 to 2165 of SEQ ID NO:210) 

probelstart= :s 2148 

probelStop=2165 

direct ion l=Forward 

Tml=6S.99 

scorel=l . 99 

length=103 
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CT10001 
Nucleotide 
Genomic coordinates: 
Start: 17098 

Stop: 20342 (SEQ ID NO: 212) 
Amino Acid 

MGAPTNADFTRTVSGVASSLYLVNPGAPSDREKLVLASSYSDSFVYNYKDAVVTAEAPKW 
CPFNEPALHEHIMNRLEKAGLINRSRFVCNPVKSAGEVCGFRYSGGSTPQNLIFPIGASE 
I N WLV I LRNAAR FGT VAAS AKDAIERI P DLREGGTS KH V AKNAMRRLRVWRAFNW I A£ AS 
RSAGMIRYTPFSVGCALYDHEVRRRQLKGSYERNVLFLGDHFCKESTLLADISRGGRSSD 
FWTIVEAVIRYKNRHARTISNETNAIPEDSSINLEWEDVLVKNQRDTNQGDDSTLEKTLE 
AAIKEHESIGEKRBCKHILEFIKTCLTEEQREMIFKGVGGKGNLSPAHLTNLADAILANNA 
KAGIWVIMQSLLKQINFSILHLIGYEAQRLI^FKLYMPALI^FISQRGIGDVFLNGVFN 
LE VMKERAAN AKI RDMVSRDAYKNNTNESNNLGNYSQFDI AT I YGQMS DCNA I PLS I N I G 
V PLHKS RMNMQDI EKT IQHIV DMGLAK I RVAEENRVARRLLHRRKMQE KAARERAAARQR 
RLRGEEEEEEEEEEEEGEEMEEEEEEAGTSGVNGSGYDQEEEDDGEEEEEEEEEEDEEDS 
EGEDMNGENNSRKRKNTGNTSSTQQPPQKRQRGKNAPISTKGKRLKERDNIGGFLLATIQ 
NDDRQVNVESIQKLLTARQRKYVKGGKCGDNGLPELLIEKVTNLLDSVFHFRKGSILNSI 
HANRRSETGVYTTKANCICDYYERNVSKDSSNNTPHSSECIERARERDASCAESNKRPCP 
VDSNNPEDVEQRMRELIMDPPSLSGVEDSLAIERVLQNEILFTSLVTNPIFNAVLGAEKG 
DLGRFIVI^NIVKFMNMTIACLVDGDMPMLLDSRGKTKNLLEKGTVKNTRKFFKPNMTAA 
ELNVATAQSAGHQYMNAGHCPEPGIKQMLLPDCIMKLKS IAMEKGRGGRSALHRQKCDHA 
FCKMLKCLFFNIDPSNAADTFIDPASRATLFRiDDLCRDRKKYKNIDWVKDLLDPVMKGT 
NKWVGTGEYTNIGRDSNVAAPVDFYTILKYTMIDDGVISVPSRKPNDVYYSTIERADDLL 
TESRDASCESYRPTLFDARAVLEVNGDGRVPYPSSETVEDLGGEEEEEEITGVIDDSTEI 
EDVQVQDSNLFDVELFDIPEIEQHQQGGEEEELPSAISEVFASLPADNDSSSPAHIPSFG 
NSEEGEKSPE P YN I FDSALDQLLDLI DS DGRNNNPKRVDWNS VTIQE 
(SEQ ID NO: 213) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

GARP_PLAFF (P13816) GLUTAMIC ACID-RICH PROTEIN PRECURSOR 77 6e-13 

094 922 (094 922) KIAA0835 PROTEIN 76 2e-12 

Q9YTL7 (Q9YTL7) ORF 4 8 75 3e-12 

CENB_CRIGR (P4898B) MAJOR CENTROMERE AUTO ANTIGEN B (CENTROM 75 4e-12 

Q07034 (Q07034) RNA BINDING PROTEIN 73 2e-ll 

NAB 3_ YEAST (P38996) NUCLEAR POLYADENYLATED RNA-BINDING PROT 73 2e-ll 

Comments : 
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CT10002 

Nucleotide 

Genomic coordinates: 

Start: 20783 

Stop: 23729 (SEQ ID NO: 214) 
Amino Acid 

MEETITLKESTGTVSPFKNENITRIASNYVRAFTDTWSHLVNISGAPLTAEKNPSAIPAN 
E LNR YWTKTN VLCN PLFKLE DH I T R DE DTGT I T LKFKM Y I DDKNGL YQ S AVLMLAL DS FV 

slas fshgadlvsnksenkfcvki ph dtraesllnnvgfpaglsgpfkrws i nykaanls 
gksgidglsgsmltvlknntnkratdilhlvnnvsasaqqlddsemsrtfnhqkkvgvcy 
dinvsssrqvnqrnllhhqniigqhliefrtkqleraqnkkvkeeengeheemtseeeee 
edeyeeggclsdideedfyedgydeeegddnrtrkkkkmeedeedeeeeyddeedeeeae 
tcgangvidceddai i fpngqnskrkkngkktnikkrsrrkgecsantlsfvekyvgnck 
slgikpvgcpppsteftsl™kgseadscyntcqstrgasrirsllnkysvkdlmqvnsp 
sswkwanppdrrfvlfdkktkeevevkfeiecekseyfdwselpsnikvwlketakiik 
hlaliedflpamgaatpkiplnliktmtsifsvrdivgfkipeevlsfipiewktsisam 
gklsvqfdri ievidlmitngafatsclnnafflergwprdgsntwlhtdlvqlstsi f 
rsirnrgvniggnnntgsnssssscggnkgdygvrcglsiskrgitlkpppaamtnsssp 
sssamislpqptrqsidlsittiiqdfsevsgklrlnglqknmsdkskdvfndaiydsga 
fkalltctvndksrrkrkrrtllasgegwrrnln3vsqgndvndahqfqeecgikiggga 
srvykraqrrgsavssrrrvrnkpqftiavsdedddceeegdfsselnpthsqlllfqqr 
qqdsctedddvlvsveeynnrvsgssttagdrvlakdllstvspnekrnsaalaaltisr 
hslfnalsaktklgengrffl 

(SEQ ID NO: 215) 
Top Blast Hits 









Score 


E 


Sequences producing 


significant alignments: 




(bits) 


Value 


Q9YTL7 (Q9YTL7) ORF 


48 




75 


2e-12 


GARP PLAFF (P13816) 


GLUTAMIC ACID-RICH PROTEIN 


PRECURSOR 


75 


3e-12 


Q07034 (Q07034) RNA 


BINDING PROTEIN 




74 


4e-12 


NAB3 YEAST (P38996) 


NUCLEAR POLYADENYLATED RNA-BINDING PROT 


74 


4e-12 


Q9YPA9 (Q9YPA9) HYPOTHETICAL 45.2 KD PROTEIN 




72 


2e-ll 


040947 (040947) ORF 


73 




69 


le-10 



Comments : 

TaqMan Primer/Probe Sets: 

5*start=l699 

5'stop=l723 

3'start=1773 

3'stop=1797 

5 ' prime r=ATGACGAGCATTTTCTCTGTTAGAG (residues 1699 to 1723 of SEQ ID NO: 214) 
Tm5=57 . 07 

3 * pr imer=TGCAGAAATAGATGTCTTCCATTCT (residues 1773 to 1797 of SEQ ID NO: 214) 
Tm3=57.23 

p robe 1 = ACCAGAAGAAGTGCTCAG (residues 1743 to 1760 of SEQ ID NO: 214) 

probelstart=174 3 

probe IS top=l 760 

direction l«Reverse 

Tml-68.95 

scorel=l . 95 

length=99 
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CT610 

Nucleotide 

Genomic coordinates : 

Start: 85707 

Stop: 83427 (SEQ ID NO: 216) 
Amino Acid 

MPPKHKPNTALKKHIIRNQQRKKEDDAESRFQRNMGQEVSKLDAPTSSKNRQRRKIRTSK 

II^RSGDCVAGDCSDLENDEGKRDTDQEGGGRGGGNEEEEEGKEEGEGEEQQREEKEEQS 

EEKEEKDGEEEEEENVEDEHVTPTTSVSKRAKQMKKHIFPPSKKRKRSDTESKALAVPAG 

KMMTVSRPLRGAITSGSILGVRSENAPQYDYVSYIADEAVVKEKAIQYRIRSLLANLLKA 

NKTKAFPTSSSLLSSEQGKKKFGGKRTNTFVVTNVGAEIiVKALLANSCWAISHRKDIRSG 

EIQWQELSSKILKSLNDGNATEINNLMSSIVEDRIQRTVKERVYFEQLATVCNNLFGTRI 

LPNKNFDKNFVSVASDNSNATVRGLSIPRYFRAINNNVWVKMSSTMDLLVGGGMRSKSEH 

SISMLEKCAAGVLARASARPVEB<MIKSAVEETSQAFNLSTGVFVPKQQQQQRQQQQQQFP 

PFQPPPFPLPPPQAPFQVQQPTYQGYLNPYYQYNQYYNPYAPQQLQQQYPLYFLGNQSQP 

PPQLQQQQQQQQPPQPPNNIPPPPTPQQQSPSNIPPPQQQQQQPFPVQLISSPPPPPPIP 

NTAPSPPISRVRFDSRSTTPQPPPTPVLPKPTPLPPPSTARAEEENATDMSFTDIDSELG 

SIDFDLPPATPGRNVEEI IKAQRQAVKETGVRGEEEEEEEAFI API IRQPRTPGNFRDEL 

LDVNES I YGS DI EPAAAAAAFDWDMGLDDLNGDEPYE FE 

(SEQ ID NO: 217) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

P93794 (P93794) LOW-MOLECULAR-WEIGHT GLUTEN IN STORAGE PROTE 84 3e-15 

Q05573 (Q05573) OMEGA SECALIN PRECURSOR 83 5e-15 

AAD43602 (AAD43602) T3P18.1 83 8e-15 

004365 (004365) OMEGA SECALIN 83 8e-15 

048809 (048809) F2401.18 83 8e-15 

065375 (065375) F12F1.9 PROTEIN 82 le-14 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 34 to 361 of CT610: this 
corresponds to nucleotides 83630 to 83957 of the genomic reference sequence. 
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CT611 

Nucleotide 

Genomic coordinates: 

Start: 88938 

Stop: 85761 (SEQ ID NO: 218) 



Amino Acid 

MEGVTSIVAAAVPEVAILITDLMGGRNNKRSTYERIVGIVGESGDLLEAILDICNRNSYR 

DELLEGETVVINPTGLLKEISLLMKKALDMNIKMKSNDDPVPFTTLDQNEQEFIGHLKSC 

KKQDGPAYKDLIHRIYSGMFVMKNTRLMLDEIIRGNAGDAVEEKNALCEAYAEMISDMDL 

IRIFLLLVAIKRDQNKKHRHMKSVIYEDWVSLNTLKDVFHKEWYMWPFSALQVGTKIRD 

ARTFSVLFGSDMHEGRNNDRIWENMAFSVTEAFLSGPSTNNHYNKGHLRMYAARPVYDAM 

EYVPQELHHILFGTKIAKMIDIVYRYSIYWPYLLAADTERVEEPKKSVMSPSGLIISPN 

ASLLENTPLSLVSRHGIPSARKLGSFILEHENAENMHLEEAIAKCMVSQTLQEESWGESQ 

AAMVYQPSDEVEVIQAHVTKILSGNTTNKTCGLCYADLDMKPKFFNCSHENMKASYDYFP 

VHAFMDTFEARQErCSAKLCPDCTIKHLMYVYEKVSAGSEKLKDVFRCPCCGEYMVQFIG 

RCHEFSSLFERAILAGENVDPEYIAANKLLITELIKRAEKCFYTVELLQAEFMEMCKMDK 

DFALDKDSKFTVVDNRFRPPVKLFKMVEGETGDSKCSLICTQCLLPNVCDQPNEMEDIVT 

VDVPPPVLPYPPPEQLEDYYFQDVEDAEFDDPPTDELVRYDTGPGLHKWPMRLSCGFLAS 

N FVP PNEEVTNCRQAVS I LKRT PEKKIRGWN PESPEGKV LLALAN WH STDRM PENMKGLL 

NDISVIHNTRERFQNRVKVHYLNSVFGGFDDRDFEQWGVSIPLIATYFYVYEKLNHESA 

LGLWAKMFVKNLIGEMVLERPECVFHRAHSFVLHCVDRRALSGIRPNQGAKMEIVKQVNI 

VRQNMTSESIKDPVFT VDEKRTLEWKVEKEGQE I KT VKC PKCKTPN IKLGGC I TMTCYDC 

SGRRDGYPTVFCWICEDEITNPDHILIDHKLLYSDCKSTKAALEKVYNCTLCCLALRKCS 

DS YLSKQRGGGEEEE I E I YVME DGFEFDVHTKTAVPTK 

(SEQ ID NO: 219) 

Top Blast* Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

AAF04 637 (AAF04 637) HYPOTHETICAL 84.4 KD PROTEIN 14 95 0.0 

BAA78677 (BAA78677) HFB30 PROTEIN 44 0.006 

AAD21842 (AAD21842) ANDROGEN RECEPTOR ASSOCIATED PROTEIN 54 44 0.006 

094793 (094793) HRIHFB2038 PROTEIN (FRAGMENT) 44 0.006 

Q9XII0 (Q9XII0) F7H1.11 PROTEIN 41 0.042 

YKZ7 YEAST (P36113) HYPOTHETICAL 63.6 KD PROTEIN IN YPT52-G 41 0.056 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 829 of CT611: this 
corresponds to nucleotides 85988 to 86816 of the genomic reference sequence. 

Hit to public SBV sequence: 

gi I 6165655 |gb | AF099142. 1 : CT nucleotides 1 to 2738 match nucleotides 8582 to 11319 of 
the public sequence with a 99% homology, a score of 5398 and an Evalue of o.o 
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CT612 

Nucleotide 

Genomic coordinates: 

Start: 91607 

Stop: 89060 {SEQ ID NO: 220) 



Amino Acid 

mgsnqqqsfiskrngtkqeislekiikrienaclpvnqyvpkldknainpqelashimdr 
lpat i sfqemddfladyaktki vdhpdfgklagrficsni hkntkewns fsattqklrha 
i hpgtgkpasvvndtyyenvmanaeildavi dykmdylftcfglrtleys yli kigsptd 
rkkrilverpqdmimrvavgihgsdiksvietydlmsrhyfthasptlfncgtvtpqlss 
cfllglqddsiegiydtij^tuviisktagglgihfhdlrakgspisswsgthpglmaflq 
ifnvsvkkvsqggdkrrgaaaiyisdwhldvkdfidcrknagnedlrtrdlfpaiwvsdl 
fmervkagknwslmcphecpglsdvhgeefkalyekyeaegkgkevvkaralfdqinsar 
ietgtpyvcfkdtinrksnqenvgiikssnlcteivqysdseetavcnlasiavnkfvky 
spipslrpyvdyremkrwkimtrnldkvidvnfyavdktrisnmktrpmglgvqgladl 
ffklripfeseeaalinkrifetiyygaleasceiakekgetyelfegsplskgifqfdm 
gken i knrdi y fnslp i hdw eqlrrdi mkyg vhnsmfvapmptastaqi lgnses fe plt 
snmynrnvlsgsfqvvneyvireliklgewnsvtkqrimasggsiqtlpnipkstkelfk 
tvweinprttldmaiqrgmfvdqaqslnlfveepelskvrsmtmyawekgiktlyylrtk 
gaaravqftvdknvlqevkkeapspvaafsapvreeeeekkssiwpdpaaallcsinnp 
gacemcss 

(SEQ ID NO: 221) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

AAF04636 (AAF04636) LARGE SUBUNIT OF RIBONUCLEOTIDE REDUCTA 1720 0.0 

RIRl_SCHPO (P36602) RIBONUCLEOSIDE- DIPHOSPHATE REDUCTASE LA 807 0.0 

RIRl_MOUSE (P07742) RIBONUCLEOSIDE- DIPHOSPHATE REDUCTASE Ml 804 0.0 

RIR INHUMAN (P23921) RIBONUCLEOSIDE- DIPHOSPHATE REDUCTASE Ml 803 0.0 

AAD37491 (AAD37491) RIBONUCLEOTIDE REDUCTASE Ml SUBUNIT 801 0.0 

RIR1 YEAST (P21524) RIBONUCLEOSIDE -DIPHOSPHATE REDUCTASE LA 796 0.0 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 84 5 of CT612: this 
corresponds to nucleotides 89225 to 90069 of the genomic reference sequence. 

Hit to public SBV sequence: 

gi|6i6S6S5igbi AF099142.1: CT nucleotides 1 to 2547 match nucleotides 5913 to 8459 of 
the public sequence with a 100% homology, a score of 5049 and an Evalue of O.o 

TaqMan Primer/Probe Sets : 

5'start=1327 

5 f stop=1348 

3 , start=1383 

3'stop=*1402 

S'primer-GTCGGCATCATCAAGTCTTCAA (residues 1327 to 1348 of SEQ ID NO:220) 
Tm5=58.78 

3 • pr imer=TGCACACTGCAGTTTCCTCC (residues 1383 to 1402 of SEQ ID NO:220) 
Tm3=S8.4 5 

probe 1=TGTCCAGTACAGTGATTC (residues 1365 to 1382 of SEQ ID NO: 220) 

probelstart-1365 

probelStop=1382 

direction l=Forward 

Tml=69.00 

scorel^l .99 

length=76 
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CT613 

Nucleotide 

Genomic coordinates: 

Start: 94397 

Stop: 94175 (SEQ ID NO: 222) 



Amino Acid 

MVIRLCFLESITCFVYGIMAPLSLDTNTDYLSHKKDTNKKIQMQINFIPYSNMHVYIAGV 

YTFHEKKGLTYQQY 

(SEQ ID NO: 223) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

096794 (096794) AM I NO PEPTIDASE P 29 4.1 

YA2G_SCHPO (Q09706) HYPOTHETICAL 157.7 KD PROTEIN C2F7 . 16C 29 4.1 

024567 (024567) ESR3G2 29 5.3 

AAC36183 (AAC36183) PUTATIVE PEROXIDASE 28 9.1 

Comments : 

Hit to public SBV sequence: 

gi | 6165655 | gb| AF0991 42 . 1 : CT nucleotides 1 to 222 match nucleotides 3447 to 3668 
of the public sequence with a 93% homology, a score of 357 and an Evalue of le- 
101 



TaqMan Primer/Probe Sets: 

5 , start=4 8 

5'stop~65 

3'start=169 

3'stop=194 

5 " prime r=TGGCATCATGGCACCACT (residues 48 to 65 of SEQ ID NO: 222) 
Tm5*57 . 92 

3 B prime r^TCATGAAATGTGTACACACCTGCTAT (residues 169 to 194 of SEQ ID NO: 222) 
Tm3=58 . 55 

probe 1 =CTTTGGACACAAAT ACCGA (residues 68 to 86 of SEQ ID NO: 222) 

probelstart=68 

probe 1 S t op= 8 6 

directionl=Reverse 

Tml=68.82 

scorel=1.22 

length=147 
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CT614 

Nucleotide 

Genomic coordinates: 

Start: 129006 

Stop: 127158 (SEO ID NO: 224) 
Amino Acid 

MEYIGEKNNNPVSNESVSEKELKLRSSFI^IIGKKTSKYEQVMGVYEAIESIRQSELSEDT 

FWHVKKDKQLKFARGLKRLQELVEDDSLRIERISCAPPEPGHLFKDDAGHVTDEEWLAT 

QEEDVRKINTIVKEKLKRKDKDFKFSQLYRYMSNSLSEAVEKKHDCMIISSDFLIGLGFS 

TMNVTHALKSMERTMQKHGFKDMMVPLVEICHRTHYKGEYIANPIFKSHSSHCLIVPLFM 

VAGVFARSAHPSAASIEMYLSTLAYAVILYSDEKQRQIREELARKNLQIKEELENQVEKT 

TKVEKELETQWKTTKVEKELETQWKKEEYKNSYIETEQLFKVSEEQKESLRNVHKKSS 

NATFRYDSGSCLVFSISSTEFYLLCRTDKSGSFETATENGLRYIFSPINKKRDTAGMRPR 

LIMAVTGCDAPIACNDSIKHQNKFKVLKCNRSSIVFQTPPSDEDLKGIVQKVTGSDIRIF 

MNDGTVYQDGQRIDISSPQELDEENMTQFEIEQQRKLHSMMENTSKIVTRYNKERHLTTK 

EARTRNKTEKWFEKVKKREEQKKRENGEQSTSEQEQRGVKRTWENDNEFDSDVEEEEDGN 

NTQEQQRVKRHAI S V 

(SEQ ID NO: 225) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q06166 (Q06166) MATURE PARASITE -INFECTED ERYTHROCYTE SORFAC 54 2e-0€ 

066878 (066878) CHROMOSOME ASSEMBLY PROTEIN HOMOLOG 53 4e-06 

GLE1_YEAST (Q12315) RNA EXPORT FACTOR GLE1 49 8e-05 

035788 (035788) CYCLIC NUCLEOTI DE-GATED CHANNEL BETA SUBUNI 48 2e-04 

Q14789 (Q14789) GIANTIN (GCP372) ( MACROGOLG IN ) (GOLGI AOTOA 48 2e-04 

Q92U69 (Q9ZU69) PUTATIVE VICILIN STORAGE PROTEIN (GLOBULIN- 47 3e-04 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 2 to 976 of CT614 : this 
corresponds to nucleotides 127217 to 128191 of the genomic reference sequence. 
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CT615 

Nucleotide 
Genomic coordinates: 
Start: 130290 

Stop: 129405 (SEQ ID NO: 226) 
Amino Acid 

MEYIGEQKLINLLDETPEEDELQLRSSFLMIGEKQYEKYEEVMSTFEAVETIRKSEFRDG 
VFIVQLKENKHITFEGGLKELRELTGDNSLKIESLLSSIKPEKGHVIUCNTSTTTDDEWL 
ASQDKDVQEVNKLVKEKTRMLFRGFYFSPTYRYITKSLPQIPFGEKERFVVSTDFLIGLG 
FSADDVMEKLIAIEGNMRKSGLKYTWVPVAEVCHLKKYKGDIVVNPIFKSYHSHCLVIPL 
VYLGYMFSRNVQPPSLEVETYLLALAFAIDLYGREEMRKSCMRLCEDISEVKRG 
(SEQ ID NO: 227) 

Top Blast Hits 

Sequences producing significant alignments: 

Q13779 (Q13779) APOLIPOPROTEIN B-48 { FRAGMENT ) 
APB_HUMAN (P04114) APOLIPOPROTEIN B-100 PRECURSOR (APO B-10 
Q13788 (Q13788) APOLIPOPROTEIN B-100 (FRAGMENT) 
P96470 (P96470) IGA-SPECIFIC METALLOENDOPEPTIDASE PRECURSOR 
028789 (028789) SIGNAL-TRANSDUCING HISTIDINE KINASE 
CAB55172 (CAB55172) HYPOTHETICAL 77.9 KD PROTEIN 

Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 1 to 725 of CT615: this 
corresponds to nucleotides 1294 99 to 130223 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=296 

5'stop~318 

3'start=360 

3 *stop=383 

5'primer^TTAAGCCTGAGAAAGGACACGTT (residues 296 to 318 of SEQ ID NO: 22 6) 
Tm5=58.14 

3 'primer-TGTACGTCTTTGTCTTGAGAAGCA (residues 360 to 383 of SEQ ID NO:226) 
Tm3=57.87 

probe 1 =CTACTGATGACG AGTGGC (residues 341 to 358 of SEQ ID NO: 226) 

probelstart=341 

probelStop=358 

directionl=Forward 

Tml=68.84 

scorel=l-84 

lengths 8 



Score E 

(bits) Value 

36 0.34 

36 0.34 

36 0.34 

36 0.44 

35 0.58 

35 0.76 
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CT616 

Nucleotide 

Genomic coordinates: 
Start: 147517 

Stop: 144748 (SEQ ID NO: 228) 
Amino Acid 

MESIKEQQQQQPTVTFSEEPDQVYEFEDTTTTSAKKPTPSKAKFAAGRRMVSKQRRNTIR 

SPHTETVEEWGEEEEQQQQTPPEITPAEKKQQSLQELDALMGKVPAHLDVSVLAKSVAE 

FLENDEDEDEELEKNKKAQKSVLFNSVMNSGRTELSPSTFCDGCVSKVKSAFEGKDLVSN 

IVKVEGEAVKKTAIATDTTKLANIiFI/SCMNLQFHEHVTIETLNKKALDKGGPLFTLKLSD 

AVYVDEMDLEKKRQIFGSNGDKSLFKELGGNYIDSAIKSTGLVMSTPSSSSTKKAGTHFK 

TTNQIVEESVTESMRNGCCCFKNDKWLAKRESNLKSLNNTVFGEEDDEKSAYAYSDSEDE 

DEDENEEEVDYDYNNETIESSVGNVIKNLIRKTIGLSDVEEEKEEGEQSEEEEEDSDDDD 

DDASSVCSSSSSSSSVTWAAAEEEEEEDEEDKDKDTATWEDEDDKESVISSSSEDSEE 

DEDDDGATSQCSEVVFGDVTECEFDESDGNPLYLASDNSFRPSASVTKYPQSEEEMDVSL 

LSKNRSTPVCLSLCRHSSGCITNSFNMSTILKSIiKLFPAGTEAAEDCVHIESTKKKDEDE 

DEEDQGLDLQNSQYYSVLVDVDNLIIFSMGSTTYESSMVEVDYDKSFWSSFDKSVKPYCE 

SKKSALINALCEDNVTAKVYATVHTLAIPFCESMPINHINNTTPYGSYKTFRISLPGNFS 

GQHNDINNNWRSDMYTKMVENLLKREWENKTHSRRYVRNLIVDGGVGENSGNYLKVHEN 

NEDI FGSIEANSMSAKTAAAAFKNVAKKCDLIQTTTNDILTGPFKQYLI DYKYNS ARKN I 

IMEPCEGDETTAHEMKRAQDAYKQALHRAKITASSISLRGIWHEMITRDMNTTYNSMFMY 

IPDFYKYVQVSPVNVSPLYMLD 

{SEQ ID NO: 229) 

Top Blast Hits 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 4 76 to 1 of CT616: this 
corresponds to nucleotides 145115 to 145590 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=1507 

5'stop=l526 

3'start=1554 

3 , stop=l575 

5 1 prime r^GAATTCGACGAGAGCGATGG (residues 1507 to 1526 of SEQ ID NO:228} 
TmS=S9. 36 

3 • p rimer =TGAAGCAGATGGTCTGAAGCTG (residues 1554 to 1575 of SEQ ID NO:228) 
Tm3=59.21 

probel=CTACCTGGCTTCAGACAA (residues 1536 to 1553 of SEQ ID NO: 228) 

probelstart=1536 

probe 1 S t op= 1553 

direction 1 -Forward 

Tml=69. 02 

scorel«l . 97 

length=69 



Sequences producing 
Q9YTL7 (Q9YTL7) ORF 
AAD56625 (AAD56625) 
NUCL_CHICK (P15771) 
GARP_PLAFF (P13816) 
IE68_PRVKA (P24827) 
NUCL MESAU (P08199) 



significant alignments: 



Score E 
(bits) Value 



48 

NUCLEOLIN-RELATED PROTEIN NRP 
NUCLEOLIN (PROTEIN C23) 

GLUTAMIC ACID-RICH PROTEIN PRECURSOR 
IMMEDIATE- EARLY PROTEIN RSP4 0 
NUCLEOLIN (PROTEIN C23) 



85 le-15 

77 6e-13 

76 le-12 

75 2e-12 

75 2e-12 

75 3e-12 
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CT617 

Nucleotide 

Genomic coordinates: 

Start: 148612 

Stop: 147766 (SEQ ID NO: 230) 



Amino Acid 

MSPVISQQSSPSATSTAAARIISTANLRVLGVKNKEEKDEEEEQQEVEPEIIEPATDFEI 
PFSPALTICIYINANRIHINSKGVCLNRKKIKPTSTINKNQDVPPELANASSYLVQTEHV 
TDKFLSSHCSICNYNVNDGEYKSALSTTRNGDQPLMRKSVRYVPLNEDNVVVQKGTYYGT 
TFIPEKTGRRILWFSHYKKSPRPITAKLCCLLETINSFNGSCSSSSSASSSSNAPGPIEE 
FQVSSS I FFKKEECCPLQMKWVEQNELDAESPVLVLLMLAL 
(SEQ ID NO: 231) 



Top Blast Hits 

Sequences producing significant alignments: 

Q922U2 (Q9Z2U2) ZN-15 TRANSCRIPTION FACTOR 

Q20150 (Q20150) F38B7 . 5 PROTEIN 

094002 (094 002) SEC12 HQMOLOGUE 

Q204 97 (Q2 04 97) F47A4 . 2 PROTEIN 

013779 (013779) HYPOTHETICAL 71.2 KD MEMBRANE PROTEIN C17G6 

P93002 (P93002) REGULATORY PROTEIN NPR1 



Score E 
(bits) Value 



36 
35 
34 
34 
33 
32 



0.24 

0.70 

1.2 

1.6 

2.1 

3.5 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 741 of CT617: this 
corresponds to nucleotides 147819 to 148559 of the genomic reference sequence. 



TaqMan Primer /Probe Sets: 



5'start=363 
5'stop=383 
3'start=446 
3 f stop=4 67 

5 1 prime r=CGACAAGTTCCTTTCATCCCA (residues 363 to 383 of SEQ ID NO: 230) 
Tm5=58-62 

3 * prime r=ATCAAGGGCTGATCTCCATTTC (residues 446 to 467 of SEQ ID NO: 230) 
Tm3=58 . 15 

probel=CGGCTCTAAGCACAACAA (residues 428 to 445 of SEQ ID NO: 230) 

probelstart=428 

probelStop=4 4 5 

direct ion l=Reverse 

Tml=68-92 

scorel=l . 92 

length=105 
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CT618 

Nucleotide 
Genomic coordinates: 
Start: 150145 

Stop: 148675<SEQ ID NO: 232) 
Amino Acid 

M ESVRDVKFYT FMN V LAEKAKKI QRLN KDKGWRTS I N AE I GY GGARLM DVRFTGRKSMDE 
LARCLYNCDGEYTTLRLVGSSAGNI I VYSLAFIMGIRGECCGFNVNNRLRMGKI IDRELF 
YKITGLNFPETVKCTCDGVRAICDLFLEVAALQEHPAWHETKEVGKKQQQHFNEFGSQYP 
GTKFNKRHKLSTKI I QQMFS EEKTMEQVLAFSEGTAASGFSDLYVEAP IQYVVNMYRAI S 
KMEGRVGAMYNLSRVLILLCSRWEKKPGYKNDFYSKCEMYIGSKKIVDDESFTFTDLITG 
DLVPLVRLAPSNEDI QRDVI RFNDST D I LMDS I DVRDWLPVLSKI I WQNVSARLfCLRNN 
KSLSKLAKWKWNGMVSTHDNFDSNDYVIEHKRQLAADIMSDSLSKNHLPNFSKTITEYDE 
KE^KTTPLICWNYIFELSPMGKHLFPLEEVCGFYEASLPLITPWQLKVVQKKRGRQMVIY 
GPRKRPRTQ 
(SEQ ID NO: 233) 

Top Blast Hits 

Sequences producing significant alignments: 

Q64568 (Q64568) ATPASE (PLASMA MEMBRANE CALCIUM ATPASE) 
ATC3 — HUMAN (Q16720) CALCIUM-TRANSPORTING ATPASE PLASMA MEMB 
Q12019 (G12019) HYPOTHETICAL 559.3 KD PROTEIN 

Comments: 

TaqMan Primer/Probe Sets: 

5 f start=676 
S , stop=701 
3'start=823 
3'stop=84 9 

5 ' primer =GAAGCACCTATACAATACGTGGTTAA (residues 676 to 701 of SEQ ID NO: 233) 
Tm5~56.76 

3 "primer=GGAGCCTATGTACATCTCACACTTACT (residues 823 to 849 of SEQ ID NO: 233) 
Tm3=57.26 

probe 1 =AG AGT AGGTGCCATGT AT (residues 733 to 750 of SEQ ID NO: 233) 

probelstart=733 

pr/obelStop=750 

direct ion 1= Forward 

Tml=69.06 

scorel=l . 93 

length=174 



Score E 

(bits) Value 

37 0.28 

33 5.4 

32 7.0 
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CT619 

Nucleotide 

Genomic coordinates: 

Start: 169814 

Stop: 165116 (SEQ ID NO: 234) 
Amino Acid 

MSTTQTQTIERPLPGKNNEDNSRLACLIAEGLQQQQQQQDGDSEISLPLVNAGTFACYDS 
TIANLTEGRLGSETENAKIRVKIHPSVFIIETNKEMTIEEISTKSLNALVEKRAREARRF 
SSLTEQKFPRGGGGCYSRKNERFIEGEINNIKLNMEETASSLERLAGLLPWINIKDWTM 
HDEKEIRLDLKGNDGMEELVNISHLNQEEWEMERLSSSIVLKDAYGVFYAHHGILDIVLT 
TSRFTGKLLQHPVIFRLMDVKVWINTPLQIAFPDTSKNPNAKKILYQHPSLTRLRDLNDM 
ASNSKSVSSIIIPELSKFNSTEFC^fHYFTAQCFFGKNTNSLKDLVTRYYQLSFKNKFQPK 
LYEPTATATAASSSSSTASLTTEQKEKIAQSILSSKGKSLGDVSSTLSKEYDENRKRTKR 
QKTSTDTNIVPSGAPTSISMKNPVTCFFGPQYTSIMDCISEKTDWIEMHLFLTSLNDAEH 
NKTLWDRKSNVSEIHDSGRFLTFGQNNTTAFIPDVDIPTLKLILRDDSGESSAAIIASL 
IYYNNVNLEGREFSNVSDAWGLFSGGSAITVGDIAREIASIYNIGRESNCDSILFPGEP 
ILAGRRSYGRQYRWYDPINCWGLYRSCLETMTRNIMRGQPVKVDETAWMYMHQQVLQVV 
LLPFFDCVLKSGVWAVKEARQLTDYIVREVLLKYTADPDQHKFLLFKKPVMDLIAKIVTH 
YAVTHSAADNGGVCLAFPRDPPFIVENDTSLRYYTLT DTPQS I LNGDNVAENLKSATSVA 
SSPSSSSRYSSETPIRWNLPVPTGRFLKMNKDLELFINVPLISSKEQKQQQQQTTATAP 
FSSETISKSFLNYVPPKSLTRNVTYGQNIAEDGFLGLKNKGELVSYFKVVKNTERDDGIK 
DMEIGDINNHQDDTGSLSSSSSSFVDGVRTSFSVDGKIEHVSAFLPGTTSQPTNLPVHAS 
KQVKYSVKELGMETVFFEPLLSSAVLYEASKTKSTQHLSPMRIYKECVS PLSTGRIDIFP 
SKVGTVAGTGFEFIWKVLQYDTGLPTTLERLSPKIPSVPISGEDSKMEVIAESGKGVQNI 
IAIAADQLRGSNNIVGGGTRRAIQQQQQQQQQETQAVVPVNVPARFEPTFTEIELFLQNK 
B*RNVIATIISRMMMLVSNEEMKIIKEVCEHVSHIMVDGLYVALDPRKAIEEILERITAEQ 
NGI T I DTGNEGYGSLRYASSGRLFI NDEASEE AAAA I GGGGALGTGRRVPVELRS ILDKL 
NTIGSTTQQQQQQQRQQRQANNNNTVPEDIKVHNEQMQKIRDSSLFTSKLLNYIRDDGRK 
DRIKTNI SETLKKYSRI PSYFI ASKAQKPI PWKHTKDN I NLNKI PEDLNFSPAQNLFVPV 
NPRHILTDMQWLNCISIIETATRDSAIVMQSFQEQADKTTTQLEELLSQWNNIVSQVTDE 
KSPAYVSSVKLEWLNNEASRIAAIRENSEKSKIVMGVQGKIVNIDELGIVAVARSIVDVD 
FYIKMPNVWASRDWKNLIYYAVNI7UVTPLINNISRGIMAASQTSVLYDSSLALIAAEQAT 
RNITM 

(SEQ ID NO: 235) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

ST20_YEAST (Q03497) SERINE/THREONINE- PROTEIN KINASE STE20 { 46 0.001 

YN23_YEAST (P53832) HYPOTHETICAL 52.3 KD PROTEIN IN MRPL10- 41 0.047 

Q24523 (Q24523) BUNCHED PROTEIN, CLASS 2 ISOFORM (SHORTSIGH 40 0.11 

NIT4_NEUCR (P28349) NITROGEN ASSIMILATION TRANSCRIPTION FAC 39 0.24 

062235 (062235) F36F2.3 PROTEIN 39 0.31 

MYSG CHICK (P10587) MYOSIN HEAVY CHAIN, GIZZARD SMOOTH MUSC 38 0.53 
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Comments : 

EST confirmation of the predicted transcript ; 

An isolated EST has equence identity to nucleotides 234 to 1099 of CT619: this 
corresponds to nucleotides 165434 to 166299 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

Vstart=1920 

5'stop=1941 

3«start-2022 

3 T stop«204S 

5' primer =GCCAGTGAAAGTGGATGAGACC (residues 1920 to 1941 of SEQ ID NO:234) 
Tm5=59.57 

3 * primer ^AGTTGTCTAGCCTCCTTTACAGCC (residues 2022 to 204 5 of SEQ ID NO: 234) 
Tm3=58.24 

probe 1=CTTGGATGTACATGCACC (residues 1943 to 1960 of SEQ ID NO: 234) 

probelstart=1943 

probelStop=l 960 

directionl =Forward 

Tml=69.03 

scorel=l . 96 

length=126 
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CT102O 

Nucleotide 

Genomic coordinates: 

Start: 62990 

Stop: 63659 (SEQ ID NO: 



236) 



Amino Acid 

mewinqrtsredl™tytgnavirsaakqalaiekhaaerrgekawttsaaaaassnfnn 
vqqdytddditqvsiansvlnnpflkryaklidnlaisslppdieddviihtrdasnstv 
rvdganiyfaiidgdlcvypkqyisdkvlcgslnrekalfynssknkwtygcnlnfdivd 
aaimkhpdykeettstkhirkilgigaseklnithylnyfiq 

(SEQ ID NO: 237) 
Top Blast Hits 

Sequences producing significant alignments: 



Score E 
(bits) Value 



P75338 
Q80990 
Q18668 
Q80991 
Q80985 
Q02689 



MG307 HOMOLOG 

LATE MINOR CAPS ID PROTEIN L2 (FRAGMENT) 
HYPOTHETICAL PROTEIN C47D12.2 



LATE MINOR CAPS ID 
LATE MINOR CAPS ID 
HYPOTHETICAL 45.6 



PROTEIN L2 
PROTEIN L2 
KD PROTEIN 



(FRAGMENT) 
(FRAGMENT) 
IN COI INTRON 



(P75338) 
(Q80990) 
(Q18668) 
(Q80991) 
(Q80985) 
(Q02689) 
Comments: 

TaqMan Primer/Probe Sets: 
5'start=331 
S'stop=354 
3'start=379 
3 f stop-402 

5 ' primer ^CACACTAGAGATGCCTCCAACTCT 
Tm5=57.74 

3 • pr imer=ACCGTCAATTATGGCGAAATAGAT 
Tm3=58 . 84 

probe 1=CAGTCAGAGTAGATGGAG (residues 356 to 373 of SEQ ID NO: 236) 

probe Is tart =35 6 

probe lStop«373 

direct ionl=Forward 

Tml=69. 01 

scorel=l .98 

length=72 



32 
31 
31 
31 
31 
31 



2.7 
7.9 
7.9 
7.9 
7.9 
7.9 



(residues 331 to 354 of SEQ ID NO: 236) 
(residues 379 to 402 of SEQ ID NO: 236) 
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CT1021 

Nucleotide 

Genomic coordinates: 

Start: 69264 

Stop: 76212 (SEQ ID NO: 238) 



Amino Acid 

MSDTGQMEENRPATQKRRPGDEEEEETGSSNVPYYANFGDDATYSMYTGEGKRGKFVLEP 

PKERSVQRVQKPPKEKEEREQRSNVRTRRPGQEFEQKVLQDRSRERSEKLGQNLAEKGLQ 

ERQKKYTPKVAQTMTKKIIRFREGGRKFKAPQQQTSDKGAATNVLEREEIEMAAEREQPV 

EITGDTILGGLGEEDDEDMGEDELTIQHSSMAVSQPVQQIVVSSPIPPKPTRPAPDIPIQ 

EDIVGKNISQLPPLPLDDYEDEEDEHLYEEVNDFLVAPPTAAAAASTRPPRPNIPPPPPP 

WAVADETLKNLASIAALEKEAEEQRAAAVEREREVEEQRAAAAAAAAAAAAQREADEKR 

EREAEEQRAAAAAAAAAAAQREADEKKEREVEEQRAAAAAAAAAAAOREADEKREREVEE 

QRAAAAAERE I LAQQLQEMKEQMRIKEEERRKELADKEEEKRRELAAKEEEKRQEILAKE 

EQLEKLNFQLGTEITSKRALEQMLEEEKASRSRSAASAQLAIQAIEYEDELPQAVEPQGQ 

LVPMDTDLYGKMYDLNKKLEVQNNTLTSAFEDVNKTOEQNQLVAQSLEKSAKAIEKLTSQ 

KRLPVDDPAFMQRIITERDFSLKNLGNVYKRVLGVYFTLKRDLFKSKALITDKESRDLEV 

RLTDVSTDLRANDLNTILERLDVSVNIRSGGTLYTKFTEADTALADQVPSRIEISNRSRS 

ALLPFSSAGLDTNFTNSSDKYNEIVNQLSSINEAMNILKENIVPTLNQIKIDVTNLLTVS 

SSRQYAIEERVYSDVSRMDSEI RKFLAIMNSKI SPYFKGDWTDERQRS I ADSI SSQI KSN 

DKIKESVATLHDINTTSRIRSNPLLHKSSVLSSPDFLNAVNDFRNFLDIQGGSQFTYDVL 

SGQNIDDLSLASKTTEKVTELCLELSIILDVIHKNALSLNLPAITYPAGETSMEESGSLA 

VDIRQEIGKNISDSSAELSRTLSEALQIFQOQQQQQQOQFQQQLLQQQQDQQNQQQLLQQ 

QIEEQQRVQEQQQQQQRDOQOQEQQQREQQQQQQQREQQQQREQQQQQQQQQSDQFRQQL 

LQQQQQFQQLLQQQGRRRGGDDGDEEREEREEGAEKDDCVRKVAESVATKYTADLTTLFQ 

REENNFQSKIASAKLGTLVFATPPSPIMNLTKLREEYSTFTTQCFSKLTAENNSIMRIFP 

ERIVEVCKSKNLNLMGKYLYIITTAQTEMEDRVECNILSGIFNQIEEFSNNVKQQQQQQAA 

SASSTNPPPPSTPSTTPPVTSMQVCELDDQRTLEKAAIVEAITLANAVLQTTKSASAPST 

AAEREIALKLENGKTSIRMEKVDLSSGATGVSDQQKWIDESTSKQELEDFIAEENFVETA 

HNEMDIGLILDAKKNDPTRDANLRLVKPHGINVQSFPYYVLRTWLGETDILDEDTVHPEY 

FRQYIDRNWKVEEHEREDTLKALGVSLSDTLAHIKDYYSPSVKNDASKSVPFALNTLLYN 

IFAIDGGMISSLSRTAFIYRKFLRQSMTDKEVAQGPVRSQLCEATIASLFTACSNLLRSS 

PIADKVEPRLQEKLAAAAAVDTSTGDMFRIRVCHLMYNFIVAYVNLCNNRINYTLNVLRA 

SGLANKKVVAGKTTKGHTSSSHRFGSYDVTYDFSVLYKILQLQKQNISLLLEKGFNAWES 

CVAAMAAFTADPSLSISDADQSILFPLEGGEIVIEKHENDAEKNVDMVQELWKETALTLM 

AKELNSYYNWFIYSKDTDMEKLARVCRMIIGIVKAVLRLTNKAESLVDTNALSDIFKLPV 

I P I D DTKT LA I NI W FT LNN V I KP WM V S FKQM FRQKDGGVS S AYFS FQN I QQQKHQQT AS 

ILDAWACAPGKLTKAAHVFI SGYENH 1 KLKKDDLLWGASMKFPADGRGTVVEGWAQQYNN 

ESVLEDFTDFSIEVNAPASGLLIPPDPLLSSMFGKGNGGSSSSSSKDNTIIGKGGLILNR 

QVVGQEQAPPINTSSDTKKIRRDANIEPIIGTPYSVIKASKGVSISVLDDFNEDSPEDFA 

LKTSIINDAIREIGQRMTYTRPIFDHQTQKNIHYSSPKIILEGSDLKNGQRSGQSWAPSS 

SSLTLASDWNLPSLELLYRELATKQVEKEEEEKSEREEDKGQKLNEKLSFVVNKAIGTIQ 

QQHQYSERGGGMKRYQQHSADQASNGGIDDIELMNSKDATSMRKAKLALAVTNKIAAAAA 

RDGENSSAKPSNFGNRLDEAINPGALLLRRGGGVRGGQTPQSSMLTMFRPGQTGGNSSWW 

TTNTPLIQRTTSVGNNLVVLVPNLLDSHPPTFN 

(SEQ ID NO: 239) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

076853 (076853) SRF RELATED PROTEIN 136 2e-30 

AAD50121 (AAD50121) ADENYLYL CYCLASE 131 4e-29 

077033 (077033) TRFA 120 8e-26 

AAD4 6501 (AAD4 6501) LATENT NUCLEAR ANTIGEN 120 le-25 

088542 (088542) OPA- CONTAINING PROTEIN 1 118 4e-25 

Q62006 (Q62006) OPA REPEAT (FRAGMENT) 118 5e-25 



FIGURE 3, Sheet 128 of 160 



WO 01/38351 



170/201 



PCT/US00/28888 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 462 of CT1021: this 
corresponds to nucleotides 75184 to 75645 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=3774 

5'stop=3799 

3'start=3851 

3*stop«3872 

5 ' prime r=AGCTGCTTCAGCTTCTTCTACTAATC (residues 3774 to 3799 of SEQ ID NO:238) 
Tm5=57.20 

3 1 prime r=CGTTGATCATCCAACTCACAAA (residues 3851 to 3872 of SEQ ID NO: 238) 
Tm3=57 .20 

probe 1=CTGTTACAAGCATGCAAG (residues 3833 to 3850 of SEQ ID NO: 238) 

probelstart=3833 

probelStop=3850 

direct ionl=Forward 

Tml=69.00 

scorel=1.99 

length=99 
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CT1022 

Nucleotide 

Genomic coordinates: 

Start: 79064 

Stop: 83375 (SEQ ID NO: 240) 
Amino Acid 

MDFEGTTSSTPSKMSQLYSSVKKVAEHSFANLHDKATIJVSKVIKDLEGERKKMSTPKSSS 
DGQKLDKAMLDDIINEYQAVKSTADNSIESTIKEIENVLESVRRTKIESEAKNSVTSSPE 
KVFSVEDI£IYSKGRVCKGLKLNANCSRIGGKYAVSMSIKKHNVSSFENNNNQVFSEEPR 
DCFMLETTYPLVGFETSTEDGNTYAVFLTGVGLERSLPKYVPVFDMNAGIQTLNMTGLRM 
AKLPVLCMFGRTEYDNLEDFYITSIETQSFDEEENDARMRCHTEDLERKKRMNDAPAITP 
HVAVYDYSGDGKEQLLYMITEYENTASWCNANGVVTSDSGFSNECAISDMNDLCCFADCI 
DVTVNNEEHEERSMNIVVESDRRLFDASPSPIKTEEDGENSSSSSSSPTVPPPTPYEGNA 
VVEGEEEEEEIDEDESSKYEGSEDALVMKKLAKLSTMKQMRRVKNEPALKITSGGNNSSS 
SINNEDE)GDDDDAVDATAIjCPQTEATVKNSFMAPNDERTENILYETMQISLAKICNNPSS 
MSSYRVFTNKLQECLNTMDDSIRRRPTIWTEESQQFAKGLLFDEVVTSIVAHQMAQDICK 
SEIFGGMFNANSTNIKGKYEGQKKSLYGNKHISSSCFKTNTESNVNNALFAWVKSKLHSG 
TVIPNVFSFKMASEKPSKMKRKRTSSASSSNDEHQEPSTKMMKNDEGEKVAQESSSPSSS 
STPEQQQQAGHDKETINLIPLSFIKMPRSNVNGSASYLSEIFGQRLCGLSDASSTFKRMC 
KTFEDLENEIMRSSFTRLTRYEREVTRLYEKCRSQAVDIEENEMDVLSHQGELFAEFLED 
PIAYFEEVLENIKSWSLENVNTPKRKNKYAKVLVSVNAIRRTYEEYHAFSKFVPMFLFNL 
IKRELEGDNYTHDVHFSSTCLWYLTVMTRNRICDVLQYINNNNNDNEETDIVEEEEEGEG 
EEDKMEESMDVEQQKQVRKGGRKKGQKFNSIGDQVIRKFVKSLCENSMVVSIAINSLISG 
ISWMNKKIPPGFLKDSSTINTLDEVSRFVFSDVKINRKINGTDDKYETVFGVSTRVDSHI 
VGPFSIPVDFSSAGLDKASCGKLYVNTIDGKGILTISPKYDSLNDEDVDSTTTDKLEKDI 
LHLSKHDTFFNINKNKVLPFYNISPSSSLTEKKKTKFNRKKISSGMSNNNGMCVQTPSSS 
NSVSSVSSIVAPSSSVLALSCSLSSTKKKSIWNENMFLTSRNMWRCGFWPPKLCSFIVN 
HRHAVKLVAETAPKTKLCRNI I DRNRKIRFNGLKKVCKSVSAFTGESTYLLNKNMTATSP 
S DLNLCI YTSSLNDPLYTCKLTHEEYQDGNALDDYGAV FVNYT FKSI KSCSSKDETADDN 
AAAADDDGSTTSTSSSTDTDAAAIQDFMHVMIKKIDAMKDIRGKYKKSLAKKTKKH 
(SEQ ID NO: 241) 

Top Blast Hits 

Score* E 

Sequences producing significant alignments: (bits) Value 

UBP1_YEAST (P25037) UBIQUITIN CARBOXYL-TERMINAL HYDROLASE 1 50 le-04 

097236 (097236) PFC0230C PROTEIN 48 3e-04 

CAB43859 (CAB43859) HYPOTHETICAL 85.2 KD PROTEIN 4 6 0.001 

NSR1_YEAST (P27476) NUCLEAR LOCALIZATION SEQUENCE BINDING P 44 0.006 

Q21000 (Q21000) SIMILARITY TO C. ELEGANS MYOSIN HEAVY CHAIN 44 0.006 

Q18918 (Q18918) CODED FOR BY C. ELEGANS CDNA CM11B12 44 0.006 

Comments : 

EST confirmation of the predicted transcript : 

An isolated EST has equence identity to nucleotides 1 to 1010 of CT1022: this 
corresponds to nucleotides 82212 to 83222 of the genomic reference sequence. 
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CT1023 

Nucleotide 

Genomic coordinates: 

Start: 93228 

Stop: 94137 (SEQ ID NO: 242) 



Amino Acid 

MVSSITHLSLLFWAVVASWFTTEGASVRVKROWSPCPDVIDPDHRCQGRLCRRSTRG 
GDDDDDDDDGGTFDTVGSGILGRKKRAAPPPEDEEEDDFYRKKRAAPPPEDEEEDDFYRK 
KRAAPPPEDEEEDEFYRKKRAAPPPEDEE^DEFYRKKRAAPPPEDEEEDEFYRKKRAAPP 
PEDEEEDEFYRKKRAAPPPEDEEEDEFYRKKRAAPPPEDEEEDEFYRKKRAAPPPEDEEE 
DDFYRKKRAAPPPEDEEEDDFYRKKRAAPPPEDEEEDDFYRKKRAAPPPEDEEEDDFYRK 
KR 

(SEQ ID NO: 243} 



Top Blast Hits 

Sequences producing significant alignments: 

Q18401 (Q18401) COSMID C33G8 

Q9YTL7 (Q9YTL7) ORF 48 

P73032 (P73032) HYPOTHETICAL 185.1 KD PROTEIN 

Q43687 (Q43687) EXTENS IN— LIKE PROTEIN (FRAGMENT) 

Q09085 (Q09085) EXTENSIN CLASS II PRECURSOR (CELL WALL HYDR 

014686 (014686) ALR 



Score 
(bits) 

113 
98 
95 
8ff 
83 
77 



E 

value 

le-24 
6e-20 
7e-19 
7e-17 
2e-15 
le-13 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 214 to 1051 of CT1023: this 
corresponds to nucleotides 93435 to 94272 of the genomic reference sequence. 
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CT1024 

Nucleotide 

Genomic coordinates: 

Start: 94623 

Stop: 95742(SEQ ID NO: 244) 
Amino Acid 

MDNLKGEFVALKTDLTHYKTQLDRSILVFVDVVGRLYVIWSEQTAKKEGLATRVAKQAT 

EIQQFKDEINNKYHALTNTLDDIIYIFDHGGSFKRAKHKAIIEAREYSKPLRELECMFTR 

IADMLTLTFMTVYTNIITEFRHSSEQATNSINVTLGRLFLCDDLCNQLPKEEEEEEDLKQ 

KFITFHANLYMLDTRLKKDLIIFBCDVIQQLHVILQKDTYAVKEGVAIRCAKQMNEISQYR 

DNLKDNYNTFSNILNEIVYIFDHGGHFEEVKHKAITLTRNYLKTLMGLKCMFKRISEMLS 

LTFLTVYTNVIAEFINASNISDREINNYLVQLVTCNELCNQLPKPKQYRPLSLIDNIAYF 

SLSVQKHLSGFL 

(SEQ ID NO: 245) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

AAF04 635 (AAF04 635) HYPOTHETICAL 43.2 KD PROTEIN 7 47 0.0 

096219 (096219) HYPOTHETICAL 139.4 KD PROTEIN 39 0.051 

064554 (064554) YUP8H12R.45 PROTEIN 39 0.067 

TPR_HUMAN (P12270) NUCLEOPROTEIN TPR 37 0.20 

Q99968 (Q99968) NUCLEAR PORE COMPLEX-ASSOCIATED PROTEIN TPR 37 0.20 

097291 (097291) PFC0960C PROTEIN 37 0.26 

Comments: 

Hit to public SBV sequences: 

gi l6165655|gb|AF099142 .1: CT nucleotides 1 to 1119 match nucleotides 3220 to 
2102 of the public sequence with a 100% homology, a score of 2218 and an Evalue 
of 0.0 

TaqMan Primer/ Probe Sets: 

5'start=478 
5 'stop=4 99 
3 'start=576 
3 "stop=598 

5 'primer =TTGTGTGACGACTTGTGCAATC (residues 478 to 499 of SEQ ID NO: 244) 
Tm5=58.28 

3 ■ prime r=AATCTTTCTTTAGGCGTGTGTCC (residues 576 to 598 of SEQ ID NO: 244) 
Tm3=57 . 55 

probe 1=TGCGAACC TAT ACATGCT (residues 558 to 575 of SEQ ID NO: 244) 

probelstart=558 

probe 1 St op=57 5 

direct ion l=For ward 

Tml=69.00 

scorel=l . 99 

length=121 
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CT1025 

Nucleotide 

Genomic coordinates: 

Start: 95824 

Stop: 97369 (SEQ ID NO: 246) 



Amino Acid 

MDSCCLISRITPEIAGKLTfflFIPENNFKIVQNSLPDDQVISQFRYFDHRHCYTFMEILM 
ANIKIQDRKQNTTAICELTTGREGLLCRRTI PVFLGSEEKREELLGNLPEGAEI FRPREV 
MQVIGTLLDKKLEIDDGIASVKAALCAGSSSLYLIMSHIVKMTFSAITNMKDINEEYFVD 
FIFRHKQFLNPEFFKHLISLLKNSRKEHVAHLVRRLEHFLMLWTLSKMRFTEMEENYFPI 
SSDSDYGICEKCARKTPKYKLRIFRERKCCDRCCRLYHQQPPPEVYNWDGKITQQSNKGY 
INAGDEI IGMLNSNDKGKTFPP I PKMV VRRVVDGV YGQGTILSKILKFRQANIPTCLFVT 
CNKCNRIFRLTILGPTRNILCPPCRKKSVAVNTQQKGENKPSFVQKGTKRLRVDTGSNKN 
TLEKFCSWERFNTEVLLPWLGYTIESKWQNWESFLGYSSTRYKELWAFVNKQEISSMKDS 
YIKIEDI DQLLRS I LQDQKG VFET VCK IKSR DGL 
(SEQ ID NO: 247) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

AAF04 634 (AAF04 634) HYPOTHETICAL 45-2 KD PROTEIN 689 0.0 

STA5_MOUSE (P42230) SIGNAL TRANSDUCER AND ACTIVATOR OF TRAN 32 7.5 

AAD20715 (AAD20715) PUTATIVE DN A J- LIKE PROTEIN 32 7.5 

CAB60246 (CAB60246) HYPOTHETICAL 80.9 KD PROTEIN (FRAGMENT) 32 9.9 

Comments : 

Hit to public SBV sequence : 

gi|6165655|gblAF099142.1: CT nucleotides 1 to 1545 match nucleotides 2019 to 477 
of the public sequence with a 99% homology, a score of 3031 and an Evalue of 0.0 

TaqMan Primer/Probe Sets: 

5*start=672 

5'stop=693 

3'start=765 

3'stop=7 86 

5 f primer=CCTTTCCAAGATGAGGTTCACA (residues 672 to 693 of SEQ ID NO: 24 6) 
Tm5=57.77 

3*primer=ACGGAGCTTGTATTTGGGAGTT (residues 765 to 786 of SEQ ID NO: 24 6) 
Tm3=57.84 

probel=TCCAGCGATAGTGATTAC{ residues 721 to 738 of SEQ ID NO: 246) 

probe 1 s t a r t-7 2 1 

probelStop=738 

directionl=Forward 

Tml=69.03 

scorel^l . 96 

length=115 
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CT1026 

Nucleotide 

Genomic coordinates: 

Start: 97547 

Stop: 98789 (SEQ ID NO: 248) 



Amino Acid 

MESIKLFTVAGLNMEQANQVAEEIKSEYKTEEEKRIAQEVFDKFTKKLIMQVDTSKHLLT 
RENPNRFVSRPIVHEDLWEMYKKEVACFWTLEEIDFERDPKDWEKLTQDEKDFILQILAF 
FASSDGIVIENLTTRLRQVAQIPEARSFFDFQVGMESIHGNVYGELIDRLVPDEKDKAIL 
FNAAQHFPAIKKKEQWAINWMQSNNDLAELIVAFAAVEGIFFSGAFASIFWIKNRGILPG 
LTSSNEFISRDEGLHRDFACMLLKKGFVDTPSRERILEIVTEAVRIEQEFLTVSLPVKLV 
GMNCKLMSQYIEFVADKLLVEMGLEKHYNVTNPFPFMDNISLENKTNFFEKRVAEYQRAQ 
VMASINKIKKDQQTQETGSPLPILTAPPPVSSSSSEQEDVEDGVGDYISYDDF 
(SEQ ID NO: 249) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9XYN8 (Q9XYN8) RIBONUCLEOTIDE REDUCTASE R2 SUBUNIT 388 e-107 

RIR2_HUMAN (P31350) RI BONUCLEOS I DE— DIPHOSPHATE REDUCTASE M2 381 e-105 

RIR2_MESAU (Q60561) RIBONUCLEOS IDE- DIPHOSPHATE REDUCTASE M2 381 e-105 

RIR2_MOUSE (P11157) RIBONUCLEOS IDE- DIPHOSPHATE REDUCTASE M2 378 e-104 

Q27124 (Q27124) RIBONULEOTIDS REDUCTASE SMALL SUBUNIT 377 e-104 

RIR2_BRARE (P79733) RIBONUCLEOS I DE- DIPHOSPHATE REDUCTASE M2 376 e-103 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has sequence identity to nucleotides 1 to 959 of CT1026: this 
corresponds to nucleotides 97786 to 98744 of the genomic reference sequence. 

Hits to public SBV sequences: 

gi|7672988lgb|AF144620.1: CT nucleotides 1 to 1242 match nucleotides 1 to 1242 
of the public sequence with a 99% homology, a score of 2454 and an Evalue of 
0.0; 

gi|6165655|gb|AF099142.1: CT nucleotides 1 to 299 match nucleotides 299 to 1 of 
the public sequence with a 99% homology, a score of 585 and an Evalue of le-169 



TaqMan Primer/Probe Sets: 

5 f start=642 

5 r stop»663 

3'start=706 

3 T stop=727 

5'primer=TGCTGCAGTTGAAGGAATCTTC (residues 642 to 663 of SEQ ID NO:248) 
Tm5=58.19 

3 • p rimer =AGGTGAGACCAGGCAAAATACC (residues 706 to 727 of SEQ ID NO:248) 
Tm3=58.13 

p robe 1 — TT AGTGGTGC AT TCGCAT (residues 665 to 682 of SEQ ID NO: 248) 

probelstart=665 

probelStop«682 

direct ionl=Forward 

Tml=68.98 

scorel=l . 98 

length=86 
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CT1027 

Nucleotide 

Genomic coordinates: 

Start: 98874 

Stop: 99789 (SEQ ID NO: 250) 
Amino Acid 

MNLLPIFLTTFFVAVDACSCSTICLLPDGKKQPLVFDSVLEEWYPTDVCGPKGAGELFT 
GVDLLTLCIGGKNNGGEWSGKGPCPRINNAVVERDYSLDEEDCKGFRKGFRIPGTDHFHT 
VFSLCWVDRDMHAKWVRNKINPGIVTDDEDLVDSGIRTKFKYSSKIFGKGFNPRPLYSLD 
YQERI KI LKSH FNKRTGNFFARGHLAPAGDFFLASERWATFALEN AV PQ IQNHNNGEWKD 
I ENRARTTPGAAWAETG PI FYQHKKKEYLDKKKKYI PI PHALYKI VY DKNNKELFRVQS D 
MSWK 

(SEQ ID NO: 251) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

CAB55635 (CAB55635) DEOXYRI BONUC LEASE I PRECURSOR {EC 3.1.2 59 6e-08 

Q9Y2C4 (Q9Y2C4) ENGL-A 4 8 7e-05 

NUCGJVIOUSE (008600) ENDONUCLEASE G PRECURSOR (EC 3.1.30.-) 45 0.001 

NUCG BOVIN (P38447) ENDONUCLEASE G PRECURSOR (EC 3.1.30.-) 45 0.001 

073911 (073911) K123 PROTEIN PRECURSOR 43 0.004 

NUCG_HUMAN (Q14249) ENDONUCLEASE G PRECURSOR (EC 3.1.30.-) 42 0.005 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 808 of CT1027: this 
corresponds to nucleotides 98914 to 99720 of the genomic reference sequence. 
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CT1028 

Nucleotide 

Genomic coordinates: 

Start: 103891 

Stop: 104 680 (SEQ ID NO: 252) 



Amino Acid 

MSLAVTEDYGHNEKLIKRLQTSVYHTPLLGADHVMKSISDYIISE^RFMNYTNLLKQVEYV 

FDEETGAVIANICLLKILERCAQKGGIYDAPEDVAFFNSKMGEVTRLFTIIGGRPNMTVR 

VNFKHGQTNNPAYGYLTDDNDTTTVTPPVTPPPSPAARRSPFFTRTLISESSSVDHYVLM 

HDNPKRSSFKVYDIHAETFPHKAPSVPTFPPKTSFEISDVTLDCSMEIFSRDRDVLDNVH 

DYIANDPVPFLVDVVHRGSSLR 

(SEQ ID NO: 253} 

Top Blast Hits 

Sequences producing significant alignments: 
PAXI_CHICK (P4 9024) PAXILLIN 

Q24459 (Q24459) POLYCOMBLIKE NUCLEAR PROTEIN 
LAI 7_Y EAST (Q12446) PROLINE-RICH PROTEIN LAS17 
Q14676 (Q14676) KIAA0170 PROTEIN 
Q94399 (Q94 399) ZK265.2 PROTEIN 
088699 (088699) HOMEODOMAIN PROTEIN 



Score 
(bits) 

36 
35 
34 
33 
32 
32 



E 

value 

0.39 

0.67 

1 .5 

2.6 

3.4 

3.4 



Comments : 



TaqMan Primer /Probe Sets: 

5'start»390 

5'stop=413 

3'start=4S7 

3*stop=477 

5 1 primer— TCCTGCCTATGGTTATCTCACAGA (residues 390 to 413 of SEQ ID NO: 252) 
Tm5=58.77 

3 • pr imer=TCTTCTTGCAGCTGGAGATGG (residues 457 to 477 of SEQ ID NO:252) 
Tm3=59.15 

probe 1=CCTCCTGTTACTCCTCCT (residues 439 to 456 of SEQ ID NO: 252) 

probelstart=439 

probelStop=456 

direct ion l=Forward 

Tml=63 .94 

scorel=l . 94 

length-88 
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CT1029 
Nucleotide 
Genomic coordinates: 
Start: 104759 

Stop: 107330 (SEQ ID NO: 254) 
Amino Acid 

MASTSSSTKKRVHEEDENLI PQPKKKKSKKVLFFPVDKYRAVDKKVVNLI HKILDQEKDH 
LSSTELQMITECNGAREDLLKHLLDEGEFNPTIIEVVSSMPIETIYEILSSSADDKKFVQ 
I SLSML I HILFFADKGTMWVSN AC VQN VLGN DYKVE FEN I RKKYLILE DLLNGVSNHW SE 
HGPLSHMLHSSIPIVQDMLLNRLVRYFSTYEXSDAQFDISFIINSVLWGIDKSVLNELTQL 
ISRGVFIVSYVPMRVRTPSKDSNRPQNTPSQNMSALGMKLNTFSSRISVYRNNTFKKLTE 
LVHNFDYGSKDASSSSPPPPSLSDSVNTFVRLYTNYDIFUWISDWKMPYGFFKKTFDVL 
YSKGLMTLSVSEYTLKKELVTFLRALKEREILIYKMEKRDIICILKKSLFGFNFRCLKQL 
LPLFKHFLKIEEVKHIARFVFRDYSLMCKTQKDLQSFPAIQSASLFMEEFPWLAKTWIDD 
DDDEGGKGHT1»LTFAI VHRY PLI SQLI SHPI LKSLVNTTCRDKHFTPLMHLANTS IMYQC 
NTLLCLIINGAKPEFINKFNENVLHIAIENVNYGVITELRGTLSSEQIEKMVNVRRMMDN 
TTPLMIALARENIVLAQLFDGLYKPKIKVRFGSSKRLRIPEFVLLKGLKESVAYLETRNI 
SYDINI IKDAVMDNSLFEEEYEIAAAGLRGNNCDPEADEKTMNTWNFFTKNSTKWASSIF 
QKNRQKFVK1VDGMNRTYEDSECAICLDSLDGDLPSGRTTCGHCFHNVCWLSLIRMSGPN 
NGSRARGGGIKCPSCRQVTCLGKRIiGVADYDIETEEERDTKNWPSVEEGRREWRKIGVD 
RYEFLVGGVWTNEIKL 
(SEQ ID NO: 255) 

Top Blast Hits 

Sequences producing significant alignments: 

Q9Y577 (Q9Y577) RING FINGER PROTEIN TERF 
AAD27914 (AAD27914) T23015.13 PROTEIN 

AAD56722 (AAD56722) AUTOCRINE MOTILITY FACTOR RECEPTOR 
Q9WV59 (Q9WV59) RING FINGER PROTEIN TERF 
082239 (082239) F17A22.9 PROTEIN 

R052_MOUSE (Q62191) 52 KD RO PROTEIN (SJOGREN SYNDROME TYPE 
Comments : 

TaqMan Primer/Probe Sets: 
5 , start=1406 
5'stop=1425 
3'start«1491 
3 , stop=1514 

5 ■ primer =AATTCCCTTGGCTTGCAAAA (residues 1406 to 1425 of SEQ ID NO: 254) 
Tm5=58.29 

3 1 pr imer=TGGCTTATTAAGGGATATCTGTGC (residues 1491 to 1514 of SEQ ID NO: 254) 
Tm3=57.50 

probe 1=TTGGATCGACGACGATGA (residues 1428 to 1445 of SEQ ID NO: 254) 

probelstart=a428 

probelStop=l 4 4 5 

direct ionl=Forward 

Tml=68. 95 

scorel^l . 95 

length=109 



Score E 
(bits) Value 

43 0.007 

40 0.058 

40 0.076 

39 0.13 

39 0.13~ 

38 0.22 
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CT620 

Nucleotide 

Genomic coordinates: 

Start: 172439 

Stop: 171509<SEQ ID NO: 256) 



Amino Acid 

MDNLI TNDN I ILVTFLSGLAVGCSMT IGLALAMNMLVKC I DRTTTCI SCSPWEKNKNKKN 
RNGSNTESSFISHVRFNTPDKDLDISEPMLKSTTYDLANVTPQVTKLVTFSGPTYASPPT 
PRPVANTPQQQPTSTNKEEESVYMPMSSCSSSFSSDNSLPLPTPPPSPPRSNGGDYVSYV 
NGRHLKLPSNPPSPIFNIKNEEGEDDNVEEHVYEYVPEVPQQSPSIQKCIQELKEMKHKK 
NTLTRSSSNNNNNAPRITQVTFKKFPPNNNNMWENHVYGNTTIVSSTPSPTFrPSPKSII 
RKLSFKRKQ 
(SEQ ID NO: 257) 



Top Blast Hits 

Sequences producing significant alignments: 
Q24035 <Q24 035) ENA POLYPEPTIDE 

Q41805 (Q41805) EXTENS IN-LIKE PROTEIN PRECURSOR 

Q4078 6 (Q4 0786) ARABINOGALACTAN- PROTEIN PRECURSOR 

Q9XZU9 (Q9XZU9) LARGEST SUBUNIT OF THE RNA POLYMERASE II CO 

036428 (036428) GLYCOPROTEIN PRECURSOR 

YHC4 YEAST (P38741) HYPOTHETICAL 80. 1 KD PROTEIN IN SNF6-SP 



Score 
(bits) 

46 
43 
43 
42 
42 
42 



E 

Value 

4e-04 
0.002 
0.003 
0.005 
0.005 
0.005 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 665 to 1 of CT620: this 
corresponds to nucleotides 171494 to 172158 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start«4 94 

5'stop=513 

3 , start=558 

3*stop=578 

5 1 primer =CGCCATCTCCACCTAGAAGC (residues 494 to 513 of SEQ ID NO:256) 
Tm5=58 .49 

3 •primer=GAAGGTGGGTTTGAAGGAAGC {residues 558 to 578 of SEQ ID NO: 256) 
Tm3=58 . 68 

probe 1=GGCGGTGATTACGTGTCA (residues 517 to 534 of SEQ ID NO:256) 

probelstart^517 

probelStop=534 

direct ionl=Reverse 

Tml-69.00 

scorel=l .99 

length=85 
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CT621 

Nucleotide 

Genomic coordinates: 

Start: 173054 

Stop: 172505 (SEQ ID NO: 258) 
Amino Acid 

MGES I FDAVSLATNNPKKSNSRNKKLLRELKNMRKDFPSTFLQCRMI DFHFSGDI I DKH Y 
CHSVNVPDVVPNTIFAVFLPEEDRANNPGLYDSIEGVCITVEQGELCIINKSSVHEFNIL 
VSLHKDLFGE DI LDG IETASREE SRS I HLYLEAGQSI RT PI PRPEGTNTVN YT I V F SNQV 
TV 

(SEQ ID NO: 259) 
Top Blast Hits 



Sequences producing significant alignments: 



Score E 
(bits) Value 



088632 (088632) SEMAPHORIN IV ISOFORM B 

088633 (088633) SEMAPHORIN IV ISOFORM A 

02924 4 (029244) ABC TRANSPORTER , ATP-BINDING PROTEIN 
054948 (054948) SEMAPHORIN IV HOMOLOG (FRAGMENT) 
Q13275 (Q13275) SEMAPHORIN IV PRECURSOR (SEMAPHORIN III/F) 
INA9 MOUSE (P09235) INTERFERON ALPHA- 9 PRECURSOR 



34 
34 
34 
33 
32 
32 



0.54 

0.54 

0.71 

1.6 

2.7 

3.6 



Comments : 
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CT622 

Nucleotide 

Genomic coordinates: 
Start: 183817 

Stop: 180274 (SEQ ID NO: 260) 



Amino Acid 

meaasnlritegagvldidneddinnnvdysnlyedeedeeemnedeeeeeeedyedede 
dtgvrngrnkdppsskkqskfvrdvtndmyddddeeeeeeeeeedeegeeggeydgnled 
eeeeg de ye d dn egegeedeadp allalaaqqe dat 1 1 penqwks i vn t ps p vg pnrq vl 
pmlnfllenvnamggsageeqknkeddnqqiepveeeedeeeeeqeeeeeeeeeqeeeee 
ekepieqeknepekdedaienesvhshrvesspmseggnddgmdyffssiagggndneed 
eeedeeegeeeeeeepaqkseehvetkesvqshteyieeeeeyeeyedesrhtledeeis 
tmhqfnnaprvrrspppdiqecedavvfppimketdilpqikepspkaprmfsilgsgge 
eqydqlndiapppvpsivtfppdnemgeesrdimdqdsmlmppppppppppphqppqlkp 
tnillpppppppitcqslfsnnnnnpsflstvvgkvnntlggkeaeerlhktmesiilktr 
vktllettknlqcselvkvvfqdpenpvkpsekvmerlkniiaaeltmkafldsaavtdi 
ksaelfrktneklelfqrkqimsnplfsaayastyimgeraskirpstpapslkkvesis 
elnedetsmsssaggvcaegdesiagggggggggggewehssfysnqtqanlhmelini 
lkedddnqpcqt yklgqrlaflnnli s fkts s avswsrlvnmls di vtkasvalfgdtnk 
aqedfekhotetndvsdlstssklkqmskesanimeemglgsigaeicfgaistiiekhi 
nklcmdvgrltiflnipivllnwpkeftlskdykvllldsisscsskmavppiyvlnsiq 
fdkavdeededgngseaekrsedgnmfsekdkkeairrvydnirygdsndrtslnhffgd 
aysgvsnnnsknsmfdlqtqgggrfgvaysagssiiehrspifdnalntlvnfmdkrkhl 
lsawik1j.kkaklsievycikyklnqasekynkkgkhgkstsvvpmrnlmyrpsknqdv 
spstpaaatamdvpssvsshvgrkrtfsfsndinsnmssassvyidqesstpsrrrtfmd 
llnnkssvnslakqvkrmkhtkyynsssnsedddeddqye 

(SEQ ID NO: 261) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9YTL7 (Q9YTL7) ORF 48 154 3e-36 

040947 (040947) ORF 73 148 le-34 

Q9814 8 (Q98148) 0RF73 HOMOLOG 138 3e-31 

AAD4 6501 (AAD4 6501) LATENT NUCLEAR ANTIGEN 136 le-30 

VG48_HSVSA (Q01033) HYPOTHETICAL GENE 48 PROTEIN 133 6e-30 

Q91255 (Q91255) NF-180 130 5e-29 



Comments : 



TaqMan Primer/Probe Sets: 

5'start=1940 

5'stop=1962 

3'start=2039 

3'stop»2058 

5'primer=CGACTCCTGCTCCTTCTCTTAAA (residues 1940 to 1962 of SEQ ID NO: 260) 
Tm5=57.98 

3'primer=TCCAGCAATAGACTCGTCGC (residues 2039 to 2058 of SEQ ID NO: 260) 
Tm3=57. 66 

probe 1 =GTGGAGTATGTGCTGAAG (residues 2021 to 2038 of SEQ ID NO: 260) 

probe Is tart =2 021 

probelStop=2038 

directionl=Reverse 

Tml=69.00 

scorel-1 . 99 

length=119 
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CT623 

Nucleotide 

Genomic coordinates: 

Start: 196292 

Stop: 195506 (SEQ ID NO: 262) 
Amino Acid 

MSNGATISDERLILILDKIVERRGVSNLSELLIHPITKHINELLKNTVRHGDRVYMKDAE 

LDVRSRI*EDIKKDCVLKAIEKQGIDVRQIITDYI*AKRKLTQNLVHWYRPPISCTDIDEKI 

QQETGQVGRCSVATYNLRIGGDDGEFTRYDFSIPLGDFKITAKLFRSINDEDVDAVILVS 

RSDWNDVLSFEAFNRTGERVVIFFNVIVEGKSKDIDIVCKSRYKHTHILNGESATYAVK 

RIKRGDTRDDILFAITAFKEE 

(SEQ ID NO: 263) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

PYC1JYEAST (P11154) PYRUVATE CARBOXYLASE 1 (EC 6.4.1.1) (PY 33 1.9 

BAA85008 (BAA85008) ORF3P 33 2.5 

BAA85073 (BAA85073) ORF4S 33 2.5 

PYC2_YEAST (P32327) PYRUVATE CARBOXYLASE 2 (EC 6.4.1.1) (PY 32 5.7 

027375 (027375) ADENINE PHOSPHORIBOSYLTRANSFERASE 32 5.7 

Q88444 (Q88444) GLYCOPROTEIN PRECURSOR 31 7.5 

Comments : 
5'start=282 
5'stop=3 04 
3 f start=383 
3'stop=402 

5 * p rimer =GGCTAAACGAAAACTAACGCAAA (residues 282 to 304 of* SEQ ID NO:262) 
Tm5=58.58 

3 ' primer =CGTAGCAACACTACACCGCC (residues 383 to 402 of SEQ ID NO:262) 
Tm3~57.92 

probe 1-CCAATATCTTGCACAGAT (residues 328 to 345 of SEQ ID NO: 262) 

probelstart=328 

probelStop-345 

direct ionl*Reverse 

Tml-68.97 

scorel=1.88 

length=121 
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CT624 

Nucleotide 

Genomic coordinates: 

Start: 195503 

Stop: 194651 (SEQ ID NO: 264) 
Amino Acid 

MSSNGDEPAVTEAEIASVEAQLGAAHHDNSWITRKSDQLKYRLGAIAYSVAKNASIKYIE 
DQVRQE INSHLTNVMT FEHLYE DAFN PV ICEAI FEKGI PVVWEKVYDVNRRIMEPREDFI 
TEILKEERWRRYIPGFYHTSFSFKYNTIAFTDSSTSFSVPINDKHMLSITPPGAAQGDLI 
DLSLSFKI DSSAKTLTLEFNRKSTFAGIVNRPKSWILSNLRNSDSSDNIGDYLKRNDPI 
YISHDTNGI INPSEDSASLIT I HMPE I ENAS DDLYI DFNLFVF 
(SEQ ID NO: 265) 

Top Blast Hits 



Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



Y687_METJA (Q58100) HYPOTHETICAL PROTEIN MJ0687 
Q9Y717 (Q9Y717) CAGCR3 PROTEIN 
Q83457 (Q83457) FIBRE 

CAB53329 (CAB53329) PUTATIVE FORM YLT RAN SFERASE 
PGK_LACDE (032756) PHOSPHOGLYCERATE KINASE (EC 2.7.2.3) 
Q9XEQ1 (Q9XEQ1) TNP2-LIKE PROTEIN 



36 
33 
32 
32 
32 
32 



0.42 

2.8 

4.7 

4.7 

6.2 

6.2 



Comments : 
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CT625 

Nucleotide 

Genomic coordinates : 

Start: 194 629 

Stop: 193327 (SEQ ID NO: 266) 



Amino Acid 

MALSNNGGIYIVFAVIVLVIGASIALFFAISGVGKGTLHSNAKTKKSKKYKLDSKYTDDD 

EKTDDDNNNNGGGGGGTVDVI NETALQRQTREH FARTLEKAEDE FFTKLADQEFDTYKSE 

NVWLIKDKITDGKVSIPEGDINVPDVGQAIADENLFDLIGTNHDEVKETMDEVVAQKSTN 

ITYEQLVIDLTNILLFGTVTVDPSDENGDESLQRSTDPDAEMVMLTTTPSSQLARQQQPP 

QPTPDYLARYSKELVINNIRGGFISDRDMRTWQGRMSVHVNMKQRTFNVISAATNLDSLQ 

VGLEPVLQKQGRAAVGGRIEKARIEFSFWEGNRVRVYATNKTEDCFCSLLPNCYNVKKA 

S DYW I SSASTAKEKT YLFI ANKNDETS FFYN FEEGVEE I DLDI FMTI DCAPNLPFIKNLP 

RPITDNNIMVALS 

(SEQ ID NO: 267) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

096148 (096148) HYPOTHETICAL 57.8 KD PROTEIN 38 0.14 

TRIl_ECOLI (P14565) TRAI PROTEIN (DNA HELICASE I) (EC 3.6.1 37 0.31 

Q43423 (Q43423) CYSTEINE PROTEINASE (FRAGMENT) 36 0.40 

015696 (015696) ORNITHINE DECARBOXYLASE 36 0.69 

MIX2_XENLA (Q91685) HOMEOBOX PROTEIN MIX. 2 36 0.69 

BAA84604 (BAA84604) MYOM PROTEIN 36 0.69 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 351 to 1 of CT625: this 
corresponds to nucleotides 193270 to 193620 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5 f start=651 

S'stop=669 

3 f start=729 

3'stop-748 

5 * prime r^CCCAGACGCAGAAATGGTG (residues 651 to 669 of SEQ ID NO:266) 
Tm5=58 . 66 

3 ' prime r=ACCGGGCAAGGTAATCAGGT (residues 729 to 74B of SEQ ID NO:266) 
Tm3=59.29 

probe 1~ACAACAACAACCTCCTCA (residues 705 to 722 of SEQ ID NO: 266) 

probelstart=705 

probelStop=722 

directionl=Forward 

Tml=69.01 

scorel=1.98 

length=98 
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CT626 

Nucleotide 

Genomic coordinates : 

Start: 228196 

Stop: 227989 (SEQ ID NO: 268) 
Amino Acid 

MSMyiTRNIIVGLAVVVIALSt4VAFMLSVTPALTGFLLGLGVSALGVTLFGCPTMKSPGGG 

NATINPVA 

(SEQ ID NO: 269) 

Top Blast Hits 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1032 to 768 of CT626: this 
corresponds to nucleotides 227951 to 228215 of the genomic reference sequence. 

TaqMan Primer/Probe Sets : 

5' start=60 

5'stop=80 

3 f start=151 

3'stop=173 

5 1 primer =CATGGTCGCTTTCATGCTTTC (residues 60 to 80 of SEQ ID NO:268) 
Tm5=59.30 

3 , primer=CCTGGAGATTTCATAGTGGGACA (residues 151 to 173 of SEQ ID NO: 268) 
Tm3=58.94 

probe 1=TGTTACTCCTGCACTTAC (residues 81 to 98 of SEQ ID NO: 268) 

probelstart=81 

probelStop=98 

directionl=Forward 

Tml-68.95 

scorel=l .95 

length=114 



Sequences producing significant alignments: 



Score E 
(bits) Value 



SUGE_PROVU (P20928) SUGE PROTEIN HOMOLOG 

Q4854 6 (Q4 8546) INTEGRAL OUTER MEMBRANE PROTEIN 

086349 (086349) HYPOTHETICAL 27.4 KD PROTEIN 

Q9Y9X7 (Q9Y9X7) 604AA LONG HYPOTHETICAL CARBON STARVATION P 
0698 67 (069867) PUTATIVE TRANSMEMBRANE TRANSPORT PROTEIN 
026370 (026370) HYPOTHETICAL 5-9 KD PROTEIN 



32 
32 
31 
30 
29 
29 



0.35 

0.35 

1.0 

1.8 

4.0 

4.0 
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CT627 

Nucleotide 

Genomic coordinates: 

Start: 234330 

Stop: 233778 (SEQ ID NO: 270) 



Amino Acid 

MFQKWFESFLDSSRPRYLDTTCVCSVYSYFSPCRKHIKFSTSHSHEGIKIHPPSILNHNT 
SSPTSGKMCNHHHKRLYLSTDDHTRWYDKNTSCIYLEDIGGVQFMVYEFHLTPKNNQLFS 
FPVHLQIHNRNTEKTSLLVFENEEDMRVRNIHPKSKILIPVSKDTVLVENGFRYKVKIVL 
SNK 

(SEQ ID NO: 271) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

TRBHECOLI (P19381) TRBH PROTEIN 31 6.4 

BCL6_HUMAN (P41182) B-CELL LYMPHOMA 6 PROTEIN (BCL-6) (ZINC 31 6.4 

Comments : 



TaqMan Primer/Probe Sets: 

5'start=187 

5 , stop=207 

3'start=233 

3'stop=253 

5 •primer=CCCACCAGTGGAAAGATGTGT (residues 187 to 207 of SEQ ID NO: 270) 
Tm5=58.36 

3 1 primer =TCGTATGGTCGTCAGTGCTCA (residues 233 to 253 of SEQ ID NO: 270) 
Tm3=S8.89 

probe l^CACCACAAGAGATTGT AC (residues 214 to 231 of SEQ ID NO: 270) 

probelstart~214 

probelStop=231 

di re c t i on 1 = Forwa r d 

Tml~68.98 

scorel^l . 98 

length=67 
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CT628 

Nucleotide 

Genomic coordinates: 

Start: 240139 

Stop: 239455 (SEQ ID NO: 272) 



Amino Acid 

M DSL IS KLEN I FS I AEQDFFN A DSM FMQTMLLPT DAM FT DCES PLY KNKSGGKN I VTDVG 
ESVLSSSSDEKMSFKVLSHVLRRFFVLLHCNYKQTNTPLWKELYKHGKFALLGDLVLFSN 
PFHPNIPAMPFDKSPICDTTGKSIIMSEVMTKELLVKLADKDIGQFFAVLNVTNPITGDS 
FLH Y FAGGNTMRDGEG DKI CTS ADV LR 1 1 AE I T I QKTGKMP YE LMKK 
(SEQ ID NO: 273) 



Top Blast Hits 

Sequences producing significant alignments: 

SYT_HAEIN (P43014) THREONYL-TRNA SYNTHETASE (EC 6.1.1.3) (T 
SYT_ECOLI (P00955) THREONYL-TRNA SYNTHETASE (EC 6.1.1.3) (T 
CDRl_SCHPO (P07334) MITOSIS INDUCER PROTEIN KINASE CDR1 (EC 
Q55626 (Q55626) METHYLENETETRAHYDROFOLATE DEHYDROGENASE 
ERG2_SCHPO (P87113) PROBABLE C-8 STEROL I SOME RASE (DELTA- 8- 
Q85056 (Q8 5056) SEGMENT 2 



Score 
(bits) 

39 
38 
35 
32 
32 
31 



E 

Value 

0.028 
0.049 
.55 
.7 
.8 
.3 



Comments : 



TaqMan Primer/ Probe Sets: 

S'start=24 6 

5'stop=267 

3'start=288 

3 t stop=305 

5 f primer=GCGATTCCCTGTCCTACTTCAT (residues 246 to 267 of SEQ ID NO:272) 
Tm5=58.22 

3 • primer— TCCTTCCACAGGGGCGTA (residues 288 to 305 of SEQ ID NO:272) 
Tm3=S8. 60 

probe 1=CAACTACAAGCAGACGAA (residues 270 to 287 of SEQ ID NO: 272) 

probelstart=270 

probelStop'= s 287 

directionl=Forward 

Tml=69.05 

scorel=l - 94 

length=60 
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CT62 9 

Nucleotide 

Genomic coordinates : 

Start: 247143 

Stop: 244950 (SEQ ID NO: 274} 



Amino Acid 

MEGGDQRTKLTPATVMGLYQSKTPGEGEGGEGGGQFKIPSAIAVKSCCSKNATRRSPPSD 

SPYSLRPMKRLKKNNGEVGGKAPPPVTLRLREDYESTPYNFNRNKKKRPITIDENQFATIi 

NPTYATDIIKKQQLPSVSAASVLRKHRANADTQYRKRFSHPNCAKFSTVNLKARDYTPLS 

VLRSHVKGPKHLKSSCDTVTETNWKRNFSS I DKWVKLEKPPCYFAVAEADTNI AAGLES 

PFHLIRQAAKLGLISDVQDVSSNYETIKQSCIDAKEKASKFLWSNNRTKQPPSSWWPVGF 

GSKNLS VLDTS PLLNWNRLCKNNGKGW I KTMS I DHMAKN V FKLS PGACES I LEKKTTLLG 

EVTAQCKKWESYRRNIPVPAHVQPEYASQVVMIGPSELYLEVKVGVYYMLETGKVIKFMT 

DKEMYCEFVFETVFSHALEGRMKGAVGVRKMCVEGFCVEMDFAGISVIDVLNGDLKCKMD 

ENWQQPNPSTTSSKPAAELMQDHGSLCRMRDTLYGVRMLQATGRLPEGLQSKCKKPITD 

SISAIAIVGKMRERMLNQLPFVLVEIVNIVTRLSQQGLVKPDIKSDNIVIDGITGQPKMI 

DFGLI VPCKKYYNFKCWGTDERFFSNH PHT APEFINSELCSETAMTFGLAYLLI DMLSIlt 

I KRTADLSANS I YTNI PFLS I VSKMY DQEKTNR PRAYE I APV I GACFP FKDNI AKLFQS P 

KHSLYSKKVK 

(SEQ ID NO: 275) 

Top Blast Hits 

Sequences producing significant alignments: 

Q20443 (Q20443) PRK-2 PROTEIN 

Q17735 (Q17735) SIMILAR TO PROTEIN KINASE 

KPC1_LYTPI (Q25378) PROTEIN KINASE C (EC 2.7.1.-) 

001715 (O01715) PROTEIN KINASE C 

001716 (001716) PROTEIN KINASE C (FRAGMENT) 

Q21734 (Q21734) PUTATIVE RIBOSOMAL PROTEIN S6 KINASE II ALP 



Score 


E 


(bits) 


Value 


46 


0.001 


44 


0.004 


43 


0.007 


43 


0.010 


43 


0.010 


42 


0.017 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 933 of CT629: this 
corresponds to nucleotides 244994 to 245926 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

S'start-933 

5'stop=954 

3'start<=1020 

3'stop=l04 0 

5 , primer s =CCCTCTCTTGAACTGGAACAGG (residues 933 to 954 of SEQ ID NO: 274) 
Tm5=58.83 

3 ' prime r=GCTCCAGGGGAAAGCTTAAAA (residues 1020 to 1040 of SEQ ID NO: 274) 
Tm3=58-49 

p robe 1 =TG AGC ATCGATC ACATGG (residues 992 to 1009 of SEQ ID NO: 274) 

probelstart=992 

probe 1 S t op= 1 009 

directionl^Forward 

Tml=68.98 

scorel=1.98 

length=108 
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CT1030 

Nucleotide 

Genomic coordinates: 

Start: 108549 

Stop: 109164 (SEQ ID NO: 276) 
Amino Acid 

MSSGKVTYEIVEGGLLNNKYLLDGGAAICLQSNCVARKRHAGSLHDNIiFKMLGFGDPYKQ 
RRGKTNSKNLAIIEDRPQLGSVSVVQHPTEPERFCSMTFLFAQYNMGNGRKCYFPNDKEY 
VESCKKHERVHKSSTEMKRLRLYYFNKCLHAI AKSPAMKKYNKI I FPARIGCAAAGGDWE 
KYHASIRDFSTIIDKEVIIVSQRM 
(SEQ ID NO: 277) 

Top Blast Hits 



Comments: 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 604 of CT1030: this 
corresponds to nucleotides 10854 6 to 10914 9 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=284 

5'stop=307 

3'start=387 

3'stop=408 

5 , primer=GCTCCATGACATTCTTATTTGCTC {residues 284 to 307 of SEQ ID NO:276) 
Tm5=57.98 

3 * pr imer=TTCTGTGGAAGATTTGTGGACC (residues 387 to 408 of SEQ ID NO:276) 
Tm3=57 . 62 

probel=TGTTGAGAGCTGCAAGAA (residues 360 to 377 of SEQ ID NO: 276) 

probelstart=360 

probe lStop=377 

directionl-Forward 

Tml-69.08 

scorel=l . 91 

length=125 



Sequences producing significant alignments: 



Score E 
(bits) Value 



Q9YTM9 (Q9YTM9) ORF 36 

Q9XAQ7 (Q9XAQ7) NUOD, NADH DEHYDROGENASE SUBUNIT 



32 
31 



4.3 
9.7 
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CT1031 

Nucleotide 

Genomic coordinates: 

Start: 109260 

Stop: 110088 (SEQ ID NO: 278) 



Amino Acid 

MSSNRFSQLRGNEEMVGDYSRWTTVKNRRNRQQQYSHSFRPQQQQQHQKRTSTNSPPAPP 
PPFPI I SWGALGSYSMYRLDDQCRNCDETGY YNFHS YDRKRERVRSLNNTPSEGMWRRTS 
RSSPFLNKKKDVDEAPPPQSNQHMYPLNKYSFREYTPSSKLVNWRDPSQEKQDKILQEEE 
ARAPTPTPQEKEPEVETKDDWIEEETAPEPEPEPAPVPDPDIPAITATTTTTTVATRHD 
DSSTV FLRKVILS I V FWFLGV YSALFAKCIRSKKE 
(SEQ ID NO: 279) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q01823 (Q01823) ORF-3 48 5e-05 

PAR1_TRYBB (P08469) PROCYCLIC FORM SPECIFIC POLYPEPTIDE Bl- 48 7e-05 

Q99356 (Q99356) PROCYCLIN 48 9e-05 

P73032 (P73032) HYPOTHETICAL 185.1 KD PROTEIN 48 9e-05 

PARC_TRYBB (Q06084) PROCYCLIC FORM SPECIFIC POLYPEPTIDE B-A 48 9e-05 

Q27045 (Q27045) SCHI ZONT/SPOROZOITE SURFACE PROTEIN ( FRAGME 48 9e-05 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has sequence identity to nucleotides 137 to 680 of CT1031: this 
corresponds to nucleotides 109401 to 109944 of the genomic reference sequence. 



TaqMan Primer/Probe Sets: 

5'start=255 

5 f stop«277 

3*start-347 

3 f stop=364 

5 f primer.=TTGCGATGAAACTGGCTATTACA (residues 255 to 277 of SEQ ID NO: 278) 
Tm5=S8.4 4 

3 * primer ^ATCTACTTGTGCGCCGCC (residues 347 to 364 of SEQ ID NO: 278) 
Tra3*58.42 

probe l^CTCCAAGTGAAGGCATGT (residues 329 to 346 of SEQ ID NO: 278) 

probelstart=329 

probelStop=34 6 

direc t ion 1— Forward 

Tml=68.97 

scorel=l . 97 

length=110 
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CT1032 

Nucleotide 

Genomic coordinates: 

Start: 119056 

Stop: 121081 (SEQ ID NO: 280) 



Amino Acid 

MAGNRTQrVSSLIAKCISDVEQGMECCGRQAQDALMTRLANLKLGDSLKETDVNLEYLRY 
ASTPLLGELNYDKQQYAATVDINLMAHFSYAALGIESILNSIRRWVANHQRRNNGKKPS 
EPISRPHPLGGVEPPLSSELANAIRDKFISMGALDRLNSAIVTAALGAIASERELFLREN 
AVNYMYDVEFAERDAATTDTGNVVYLSTKMDEDEDDIIKRSEILDKVSKRPAKEGIDWRP 
TPDNSFPYQLIWGDDSVDDTVLIDLITNAIVPNIFMAKFILFICNHLRAVIRSMREILYG 
NISSSSDNYFEDGRKWCFWLNLYNRLEWFMLVVRFVIFLHSKKESFSGADNVNVKRLLVV 
WESFPPVLLDTEWVKTNITSWPVINNSNNNSTLPVTEDTLMRLAIRTSSGARHPIFDEI 
NSLTTAVTNRITFQSAEFCTKILLGRALDEEEAGTKMLVKSVKETGEEKDKNNTFSSFGL 
LLKNTKNEELEINIGDNDDETTDVACWARTSSTSFIRNRTYAFKKIWGLEDASDVVELKR 
ESDAITSFVTDKSSPLLFPYVSDWSCLLLHPCCKAPAIIKSVWLQILKDFSQENIKTINE 
KVQSLSSEICQKSNDRFKNKKIAAEHVRSVKKLLNTISNREQEAALSTEHCIWLTILWKQ 
VVQNTLNLLENFPV 
(SEQ ID NO: 281) 



Top Blast Hits 

Sequences producing significant alignments: 

025761 (025761) HYPOTHETICAL 88.8 KD PROTEIN 
075130 (075130) KIAA0635 PROTEIN 

Q17425 (Q17425) HYPOTHETICAL 62 - 4 KD PROTEIN B0024.10 IN CH 
033600 (033600) PURINE NTPASE 

Y066_NPVAC (P414 67) HYPOTHETICAL 94.0 KD PROTEIN IN POL-LEF 
Q53626 (Q53626) ATP-BINDING PROTEIN INVOLVED IN MITHRAMYCIN 



Score E 
(bits) Value 



38 
34 
34 
34 
34 
33 



0.29 

3.3 

3.3 

4.4 

4.4 

5.7 



Comments : 
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CT1033 

Nucleotide 

Genomic coordinates: 

Start: 121099 

Stop: 123634 (SEQ ID NO: 282) 



Amino Acid 

MFTHLTRAFRKMNNLVNRSFIDVHRVVAELSYPEFEEDVKNPESSIYRTPISLFQNKDIV 
TIVGDYILSPKTDSFQVLYPIKKVIEHFPVIFHCTHNNAPLWVHLLDERHHRLLQSLLTY 
EIVNAKYRGIVVIPYYRRPINYQTGKSLLMSKIASVKVLDILMRCGSYKFISLMCMINKK 
NNTNFLHCCASKWGEVGSKMMLHIAEMFFANPTTSQHLSDASSFPDAAAEDDKGKTPAHL 
AIQEDNADALLFLISLYGAPWFQDNNSYMKSALELKSNKCVKVLSFAADKYEILPNINNN 
QLEPDTMCGVCATSVEEDENEGKTTSLSWYQMNCKHYIHCECLMGMCAAAGNVQCPMCRE 
DVGDEVLERCPPTIFRWLKLAERSEHNRVLFEAKKQEFYKQMEAMKPPRVWPPRRTFLT 
PARRGERAIRIAREIATNAIAEATAQGDVNSYFPVLIDGSGEEYEEEGEEFFNSEEEALA 
FGRP FLEDEEEARQIQMRQFAELSRRG VS VN 1 1 NNDN PHRH I S T VN I VQPV YGVEKS PAA 
SFIYWMLKNDVFESIRSRDTRVGGERVPVMNLSNDKRALFHAASSMLCDFATETNSQIVG 
LDFQAVYDPHHISNYIETFGSPLHAYPGAVTFLDGAQDYYAESIRYDNDIVSFSEMASEL 
HITEALDVFEGSLLSPLFKKIRTGKSYSNWNDHLRRRNYARDIAEEFVRVCENSLASREH 
P PVHVHPFRDGAI PI LI E Y I V DFI HHC I TWSMQVNALHCMRKY IEHENTNVHLLNLRPTD 
ERVEVLRVSQLRWSRLFNEQYNTRMSLSTKRLSLMKIFNHDLGVSKFGVYKLLDIIEMYC 
FTLI 

(SEQ ID NO: 283) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9XUM8 (Q9XUM8) W02A11.3 PROTEIN 39 0.17 

YEYC_CAEEL (Q21802) HYPOTHETICAL 64.6 KD PROTEIN R07B7 . 12 I 38 0.22 

CAB40955 (CAB40955) HYPOTHETICAL 22.5 KD PROTEIN 37 0.50 

YBR2_YEAST (P38239) HYPOTHETICAL 13.2 KD PROTEIN IN ORC2-TI 37 0.50 

Q47955 (Q47955) HHDA PRECURSOR 37 0.50 

082239 (082239) F17A22.9 PROTEIN 37 0.50 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has sequence identity to nucleotides 1 to 208 of CT1033: this 
corresponds to nucleotides 1234 4 9 to 123656 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=1206 

5*stop-1225 

3'start=1269 

3'stop=1287 

5'primer=GGAAGCAATGAAACCTCCCA (residues 1206 to 1225 of SEQ ID N0:282) 
Tm5=58 . 56 

3 , primer=GATGGCTCGTTCGCCTCTT {residues 1269 to 1287 of SEQ ID NO:282) 
Tm3=58. 97 

probe 1 =TTGTTCCTCCTCGC AGGA (residues 1232 to 1249 of SEQ ID NO:282) 

probelstart=1232 

probe ISt op= 1249 

directionl=Forward 

Tml=68. 99 

scorel=l . 99 

length=*82 
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CT1034 

Nucleotide 

Genomic coordinates: 

Start: 123757 

Stop: 126556 (SEQ ID NO: 284) 



Amino Acid 

MAADLLEItAIQETIQSELEEIADTEFLNYLPHKTGICEEAAANGRPYLPTLEMRNEVDHF 

WSQDNRKIfKLLGHFCGNLWEAFIAGSIDAETCVGFLRSQATGLGYPLLKKLALIAREDK 

SNTTNYNLYIDRNSMMKQVFSAEIDKRPSSIQNTSHTKSSPVYLKLIDRRTECLALDWLD 

ASKRTAKEIGAARKVCFLQNLIVAILIPAYTETFVLDTGNELEQQVLDDAYFNAENKDKV 

DEMCVVAILSTLHNLFVRKSLPHHLYNAPFRLPPFGQHPIINIENSSFFNEDTTPILASI 

SIPSSMVIKHHTRKNSRWRCPNNLMTAAERSIFLRGVLTVSGDYGWFSVIVGSTIMPSVL 

F YGDRKHLINTVKSNNFSAITCS YWNKYMDCRS YGFE 1 1 DT PENNCGFRI RAAI DCSNTD 

FHSPVTRVNKKKTSIINAVKNPFFIRHTEPKWYNKN7^MCGEVLENVGVTLEQHVRVSDEY 

MDRFGSLLLGREKKWTCNYLDRIKSLETISNNLKGKIDTMCKILETKYNYKSSSLYYKQI 

TATS DDPIKMKI I AS INKRRYLCNILEFAI I SS EKKDEVEEDHTKTGNGGCAFSKYKKKQ 

LEPKQHLIVKVNKYIEAFSLIKMLRNDCERNKCRFKEAEIRECANELVRELYRASARSYV 

HDLVLKRTNVHLTWQRPYDENANTIMSLIPKCKLHTVLYDKDSRDVKLLKFLRTRDGNYN 

PIRHSMLELVYGEEYAKDVSTVTCFEWLKWCSKKGVIKYEDFLDRYEKTGEEDKDEREFF 

RLKKCSRDHTKDIKKIENVLNSDTLYSYSLDKNVQTHASSSTVVKNDTDGKTSMVGWDYI 

FSIGKGEKTTKKRKLETIDISSSDDDDEEEEEEDEGKRMKMNNCSSSIKNKSKNKNGRMC 

CTDILNWEPSLPNTLSFNCVKSMDVLNLL 

(SEQ ID NO: 285) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

062235 (062235) F36F2.3 PROTEIN 46 8e-04 

AAF04442 (AAF04442) PUTATIVE TRANSLATION INITIATION FACTOR 43 0.012 

060313 (O60313) KIAA0567 PROTEIN (FRAGMENT) 41 0.037 

Q904 91 (Q90491) DNA BINDING PROTEIN E12 41 0.048 

BAA82145 (BAA82145) MYOSIN HEAVY CHAIN 2B 40 0.063 

097303 (O97303) PFC1060C PROTEIN 40 0.063 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 647 of CT1034 : this 
corresponds to nucleotides 125649 to 126295 of the genomic reference sequence. 



TaqMan Primer/Probe Sets: 

5'start=1391 

5'stop=1410 

3*start=14 68 

3'stop*=1489 

5 1 primer =AAAATGTTGGCGTGACCCTC (residues 1391 to 1410 of SEQ ID NO: 284) 
Tm5<=58.49 

3 * primer =ACGTCCATTTCTTTTCTCGTCC (residues 1468 to 1489 of SEQ ID NO:284) 
Tm3=58.24 

probe 1=AACAACACGTCCGTGTTA (residues 1412 to 1429 of SEQ ID NO:284) 

probelstart=1412 

probelStop=1429 

directionl^Reverse 

Tml=69.00 

scorel=l .99 

length-99 
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SBV.ORF022 .txt 

Nucleotide 

Genomic Coordinates: 

Start: 265664 

Stop: 266297 (SEQ ID NO:286) 

Comments: EST confirmation of transcript not predicted computationally- 

TaqMan Primer/Probe Sets: 

5*start=230 

5'stop=252 

3'start=321 

3'stop*343 

5 ' prime r=CAAATCAAAGGCCATATGAAACC (residues 230 to 252 of SEQ ID NO:286) 
TmS=58.41 

3 ' prime r=GGTGGTCTTTATTTTCATCACCG (residues 321 to 343 of SEQ ID NO: 286) 

Tm3=58.41 

primerScore=0 .86 

allelel= 

probe 1=GAGGATAGCCAGTTCTAT (residues 265 to 282 of SEQ ID NO: 286) 

probelstart=265 

probelStop=282 

direct ionl=Reverse 

Tml-68 . 99 

scorel=l-99 

length=114 
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C003 

Nucleotide 

Genomic Coordinates: 

Start: 126709 

Stop: 127126 (SEQ ID NO: 287) 

Comments: EST confirmation of transcript not predicted computationally. 

TaqMan Primer /Probe Sets: 

5'start=175 

5'stop=194 

S'start^eS 

3'stop=283 

5 * prime r=CATGGTGTGCCTAAAGCAGG (residues 175 to 194 of SEQ ID NO: 287) 
Tm5=57.84 

3'primer=TGGCACAATCATGGGTTGA (residues 265 to 283 of SEQ ID NO: 287) 

Tm3=57.81 

primerScore-0 . 91 

allelel= 

probe 1=AATGGAGGCATCGAAGAA (residues 212 to 229 of SEQ ID NO: 287) 

probelstart=212 

probe lStop=229 

direct ionl=Reverse 

Tml=69 . 02 

scorel=l - 97 

length=109 
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C010 

Nucleotide 

Genomic Coordinates: 

Start: 184094 

Stop: 184931 (SEQ ID NO: 288) 

Comments: EST confirmation of transcript not predicted computationally. 

Tap^lan Primer/ Probe Sets: 

5'start=547 

5'stop=570 

3'start=655 

3'stop=678 

5'primer=AAGATGATGAAACCTGGTTCAAAA (residues 547 to 570 of SEQ ID KFO:288) 
Tm5=57 . 75 

3'primer-CGTCATGGAAGAAAAGGAATTGTA (residues 655 to 678 of SEQ ID NO:288) 

Tm3=58.24 

primerScore=0. 67 

allelel* 

probe 1 — CTCCTTGCAAGTGCGATA (residues 598 to 615 of SEQ ID NO: 288) 

probelstart=598 

probelStop=615 

direct ionl=Forward 

Tml=69.01 

scorel«l . 98 

length=132 
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C020 

Nucleotide 

Genomic Coordinates: 
Start: 60122 

Stop: 60341 (SEQ ID NO: 289) 

Comments: EST confirmation of transcript not predicted computationally. 

TaqMan Primer /Probe Sets: 

5'start«2 

5'stop=26 

3'start=118 

3'Stop=14 0 

5 ' primer=CAACCTCCACATCTCCTTCTAATTC (residues 2 to 26 of SEQ ID NO: 289) 
Tm5-58.27 

3 1 pr imer—AACGCTTTGTTTAACGTGCTTTT (residues 118 to 140 of SEQ ID NO: 289) 
Tm3=58 .18 
primerScore=0 . 61 
allelel= 

probe 1 =ATTCAAGAGGTCACAAACA (residues 8 0 to 98 of SEQ ID NO: 289) 

probe Is tart =80 

probe 1 Stop=98 

direct ionl=Forward 

Tml=68.75 

scorel=l. 50 

length=139 
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C028 

Nucleotide 

Genomic Coordinates: 

Start: 277009 

Stop: 277324 (SEQ ID NO: 290) 

Comments: EST confirmation of transcript not predicted computationally. 

TaqMan Primer/Probe Sets : 

5 f start=293 

5 , stop=315 

3'start=384 

3*stop=406 

5 * p rimer =GGTGCAGTTTCATCCTTACCATT (residues 293 to 315 of SEQ ID NO: 290) 
Tm5=58.07 

3 'primer =TCGTCATCATAGTCGTCGTCAAC (residues 384 to 406 of SEQ ID NO: 290) 

Tm3=58.72 

primerScore=0 .86 

allelel= 

probe 1=TTCCACAACCACCACT AC (residues 320 to 337 of SEQ ID NO: 290) 

probelstart-320 

probelStop=337 

direct ionl=Forward 

Tml=68.94 

scorel=l . 94 

length=114 
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C036 

Nucleotide 

Genomic Coordinates: 

Start: 115067 (SEQ ID NO: 291) 

Stop: 115380 

Comments: EST confirmation of transcript not predicted computationally. 

TaqMan Primer/Probe Sets: 

5'start=99 

5'stop=117 

3'start=158 

3'stop=177 

5 *primer=CCACTGCTGGACGCATCTC (residues 99 to 117 of SEQ ID NO: 291} 
Tm5=58. 94 

3 ' prime r=CGACGGACAGTGGAGCTCTT (residues 158 to 177 of SEQ ID NO: 291) 

Tm3=59.04 

primerScore=l - 25 

allelel- 

probe 1=CCATCAATGAAGAAGCGT (residues 131 to 148 of SEQ ID NO: 291) 

probelstart«131 

probelStop=148 

direct ionl=Forward 

Tml=68.97 

scorel-1 . 97 

length=79 
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C037 

Nucleotide 

Genomic Coordinates: 

Start: 186828 

Stop: 187047 (SEQ ID NO: 292) 

Comments: EST confirmation of transcript not predicted computationally. 

TaqMan Primer/Probe Sets: 

5 f start«63 

5'stop-82 

3 f start=133 

3'stop=157 

5»primer=TTGACATCAGACCGACCCAG < residues 63 to 82 of SEQ ID NO: 292) 
Tm5»58.08 

3 ' primer^GGAATTTTAClCTTCCTCCATCTCA (residues 133 to 157 of SEQ ID NO: 292) 

Tm3=57.92 

primerScore=0 . 62 

allelel- 

probe 1=CACCCT*CTAAACTCGAGC (residues 88 to 105 of SEQ ID NO: 292) 

probelstart=88 

probelStop=105 

directionl^Forward 

Tml-69.49 

scorel=l . 50 

length=95 
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CO 19 

Nucleotide 

Genomic Coordinates: 

Start: 141541 

Stop: 141647 (SEQ ID NO: 293) 

Comments: EST confirmation of transcript not predicted computationally. 
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SEQUENCE LISTING 

<110> PE CORPORATION (NY) ; 

THE THIRD INSITITUTE OF OCEANOGRAPHY, STATE OCEANIC ADMINISTRATION , 
CHINA ; 

SINOGENOMAX CO. LTD. 

<120> PRIMARY NUCLEOTIDE SEQUENCE OF THE 

SHRIMP WHITE SPOT BACILLI FORM VIRUS (WSBV) , DISCOVERY 
SYSTEMS CONTAINING THIS SEQUENCE AND DETECTION KITS AND 
ANTIVIRAL TARGETS FOR DETECTION AND CONTROLLING SHRIMP VIRUS 
OUTBREAK AND SPREAD 

<130> CL000895-PCT 

<150> 99124717.5 
<151> 1999-11-24 

<160> 293 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 305107 
<212> DNA 
<213> SHRIMP 



<400> 1 

ggatccagac 

tgtcgtgagg 

taatttctca 

taggaatgat 

attatcaaca 

caggacggca 

ggactttttg 

tatatatgaa 

aaaaaatgtg 

cttaacagat 

agtgtgttga 

gctgaaaatt 

atatccctct 

ttgtttacat 

ttctttttta 

ttcctggttt 

atggtcaagt 

ttcttgttaa 

cccgccagta 

tcatgagaga 

cactgagata 

cgatcctttg 

aagtactgga 

gaggaagaag 

tggtacattt 

ctcaggtaca 

tcatggcttt 

atctgttcaa 

ttttgacaac 

cgccgctcat 

cttcttccat 

tgtctatatc 



atcaccccaa 
ataaaaactc 
tttttaaaat 
atgtccgctt 
ggatggaata 
acttgtcgtc 
caaacacctg 
tttcaagcta 
tttatttttc 
aatgcctatt 
tcctattttt 
cccttcctgt 
ttgtattatt 
ttagtgtgtg 
ccagttcaga 
gtaatgtgcc 
ttattatttc 
gttctatgtt 
tagcggcgta 
aaaaaacaga 
tattgtgagc 
aagccggcat 
tagcctttac 
ttcgcctgca 
tgaaggtgag 
tccacgctca 
aatcttgttg 
aactttccac 
tacaggatct 
gcctgtttct 
ttcttctcca 
atcatcatca 



caatcgcaga 
cttcatgtga 
acgtcaaacg 
tcttattaga 
gagaagctct 
tagcgcaatc 
ggtttgacta 
caatgtaaat 
cccaacctta 
tgtatcatcg 
cttcaaaact 
agcatttaga 
gtatactctt 
gtctccgtct 
atcggacgtt 
gtttacaaag 
agattcaaca 
ggttacaact 
aaccccccac 
tattaaagtt 
agcaactgcc 
cttgaaatga 
tgagttgaaa 
ctggcaattc 
agcacgtctt 
ccttttatac 
gttatccctc 
gactccattg 
ggttgaatat 
agtcttgaaa 
ctatctaaaa 
tct tccccaa 



cccgattaca 
tgagttgaat 
gtttatggcg 
tttgattttt 
gggaggaact 
ttattacccc 
caataaatct 
aattggttaa 
aacagatcat 
atgtcttcgt 
gttggtttga 
cctggagaca 
gcatacaatt 
ccaagtaaag 
ccttggccag 
ttaaagtggt 
gggtaggcgt 
atagatataa 
atgtgcattt 
tgttatattt 
gtttcagcat 
tgtgcagtat 
gtttcacggg 
acaaaggagt 
tcttcaacca 
caacgtctta 
gatctacgaa 
ggtccagagt 
atgcagatga 
ggagtgtgtc 
attcgtctat 
gagatgaaga 



gtagcagaag 
aaattcatga 
cagagagctt 
gaaaaattga 
actggggcta 
ctccctgaaa 
ttccctaaca 
taaaataaag 
tgccaggaga 
caaacgttgc 
aatataatac 
gtgaaacaga 
ctaattcttt 
atgtgaggat 
ctacatatac 
ttcctctagc 
ctttatcctt 
ccacgagtat 
tgaagaaagt 
tatttagtcg 
actcgaaaga 
tatttctgct 
aacagcttga 
gctgattatt 
aaggtgtttg 
cagttccttc 
gaatctaata 
ataatctgac 
tgcatcaaaa 
caaatcttcc 
atcattatca 
aggagatggg 



ttggaaaccc 
gattctctac 
catctaatag 
tagagttttt 
ttcttatgat 
ccgttggtat 
atgaaaggta 
gtatattttt 
aaatcgcata 
tccaaacaca 
aaatttgtta 
gtccactgtt 
agattcggct 
tatgtctcct 
ctttccatag 
aacaccgtta 
cttgtccaat 
caatgtcaaa 
tgtacaaagg 
tagaaatcac 
agaatagttg 
agttttgttg 
acagttgatg 
gttgttatga 
tctaatgtgg 
agatcttccg 
tctccctcag 
ctttctatgt 
tctacttctt 
tcttcttcgt 
tcaaagtcta 
gttcgtgtcg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 



WO 01/38351 PCT/US00/28888 



ggggaggtgg aggagtagga gtaggagtag gtggaggagt aggagtagga gtaggagggg 1980 
gtggaggtaa caaaccttct ccaggcacag ttcctgttcc tccacctcct gtattggacc 2040 
cagtatttac actcggttgt tcaaattcaa aaacttcctc ctcttctttt ccgtcttcta 2100 
cagcaccttc ctcgtctctt ccgatcggcc tattttcagg ttcccttcgc cctaatacag 2160 
ctccttcggt gtcatcttct ttttccttct ctttatcttt ctttgttttt ttattggatg 2220 
ttgtagtagt tttatttggt cgtgacatga aaatcacacc aaccaccaca cccacaaaac 2280 
acacaaaaag tatgagagat aataaaagga acatcattat tatccttatt tttttacatt. 2340 
caattatggc accagaaaaa taaaatgatt tgtacaaaca cgagttaccc ctccacacgc 24 00 
ctgacctctg ccgcgaattc ttcataccct tccatattct ttgctgcaaa ggtcacgtca 24 60 
tgttgatgag ctaccatttc atatcgcata tctcgccaaa aagatccatc cgcaacagcg 2520 
gccgtgcatt cttcttcaat ggcagaagtc atgtcagctg tcattttttt aataagatca 2580 
cgtagtattt cctccctata caaatctata accttctgcc taaagacagg cctctctgct 264 0 
aaaagagaag gattttctgt aagctggtcc atggtgtaat aaagtgttca tttatctaac 27 00 
ttttaaaacc caaactcttc taaaatatct acagattcca ggtactctgc cgccattgtt 27 60 
gtaatcattg tatttgcctt gtctatcact gcttgtcttt tatcttccac atctcccacc 2820 
tcgtcttcgt gttttttcct ttcttcttct agaagccgtc taaagacagc ttctctaatt 2880 
gaaggtatta gcccctcaac tacctgagaa gcttcctgtt ttacatcatt cacaaattcg 2940 
ggcgattgta gtacgagcgt gatcgcgctc ttgtagatgt cttccaaaat gtccattttt 3000 
gttgtttaga cctagataga gagtgggaga cgctattaat ttttttgttc atgtggctca 3060 
tgacctcatg tccgatatat gtattggttg ggtgtaatac aagtctctac aaagcaatag 3120 
aaggggtaaa aacaaaacac accatggtgc taactctttc ctgtacgact aggcgcgtag 3180 
cgtctagcaa ggggaatttc tctaaggaag atgcggtgtt ggggaaccag ttccccattt 3240 
taaagaaatc aaacaacttg tcaattgcaa gacctccctc aatagaatct ttttctgcat 3300 
cagtggaaaa aatattcagg gaatggaacg aaagtggggg agaaaaaatt ttcgacatat 3360 
ctcagaatga agaagaatgg atggatatca tatccttagt ggaaagtgta tatgaacctg 3420 
tattttctaa atcacttaaa cctgataaat tggcagataa aacatgtcta accgcggctg 34 80 
cctttgcagc actagcttct gccgtggatg aaaaattgac aatcttatca ggtagtgatg 3540 
ggagtgtgct tcaacgtaca acaaaggtta tgaaaaagga ccccaaaaaa atagcagaat 3600 
ctcttttaaa taatgaaaaa tggacatcta ttttattgga caggttaaaa acagccaaga 3660 
aacttctaag cagacgaggt gcactgaaaa gcgccgaaag agtagaagta cttcatcggt 3720 
tgaataaact caaggaggct cctcttcccc accatccgag cctatttgat aattttagtg 3780 
gaggaaaaac atcagcagta tctgctggaa cagtcatcgc atcagatatg catttcaaat 3840 
tggttgaaca tatttttaag gtctccttta gaaaatgggg tccctgtgga gataaaactg 3900 
aaagcgggga agaagaagat gaggaagaag aagaagaaga aaagaaacat tccatatcaa 3960 
gattcgtgct tcaatttatg aacggacaca acgggcaaca ttatcatagg cccgaaagtg 4 020 
cttctgttta cttttgtgat tattatgact atttggccta caggaatctc cctaatgagt 4080 
acaaattatc gtcaatgcat cctggcacat tcaatatgga ggatttacct ttccgccctt 4140 
tcgcagtacc ttcaacttat aagacagaat tagagtacaa aaggtttgtg caatcaacaa 4200 
atcttcccca gctaagtttc gactatgggg agtttttatg ttactgtatc ttcggagcag 4260 
attggtacaa acacctgggg gatgtggtag attctctaga aaatagttcc atgatatcat 4320 
ttgattctca gacattgagt ggtgtgtata agaacactgc taattacaaa aggttgggga 4380 
agaaaagaaa cggaatagcc gatttggccg ttaggagtat ggcagaattt atccgcactg 4440 
aagcgcataa ggcattgaca gcagaagaga tggaagaaga agaagaggaa gaagaagcgg 4500 
aagaagaagc gatggaccag gagcctgcag aagtagactt cctttcagtg cctcatttac 4 560 
gccgtaaaat tcgtcaagct gtttctgtgt taaataactt tgtggagaac gatctttcta 4 620 
tattggtttc taacttcaag aatgtgttaa ccgatgatac tgtatcagga acagatacgg 4 680 
acaactttgg ttctagtgga gaatttgaag cattatcttc ccatttattc ctttcaagaa 4740 
tattggatga agtgcacatt cttaggaata ctgatataca aagaacccta ttttcaacgc 4800 
acgtgtctct gtcggataaa tctcccccta gccgtgtccg tggaagcaat gtcaacttta 4 860 
ataataacgc tgggaacatt tcttccctgc aaacgtatgg cggtatagaa gagttgcctg 4 920 
aaaatgtact agtcggtttg tccggaggat ttgaagatac cgacatgtat tccggagagg 4 980 
atgttgttgt cgtatgggat ggttgtgatg gaggaaaagt gctaagtgtc accttcaatt 5040 
gtggtgataa ttttatccag ctccatgaaa aaacagcaga aacttttaag gatgatacgg 5100 
atttagttga acgaataaga gatgtgcttc agactgcaag taagaccgga aaccttaaca 5160 
aaaaagcata ttcaaggaag aacatctacg ctgttttgcg tgaaaatggc attgagcgcc 5220 
ccggggacga ttttacagaa aaggggattg ctctcaagga taaaacaaat caaccccctc 5280 
cccctgcaag aagtgccaag ataacggttg aaggagtcaa gggatttttc agcggttttc 534 0 
gtgacatttt ggagacgagg gcgctcacca catatagtgc agaaaccttc agagatttag 5400 
gccaaggcat agtaaaagag accgaaggac tgacagctgc aacagtggca gaaacatcct 54 60 
tctctgaagg tttagctgaa agtttaaggt ctgatgcgaa tctaggtcta gaattttcag 5520 
aggacgccaa aacggttgta ttcaaaaatg acacctctcg ttctttattg gaagaaacta 5580 
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gggcattaag 
ttagtgccga 
cccttgaaca 
tgaagaaact 
aaattaacga 
atgcagttac 
tggtggccgg 
atctcaacgt 
tttcttgtgc 
aaatagacca 
tccctgaaga 
aagatgctgc 
tgccttgggt 
ggatgtccac 
aggctatttt 
ctgtaataaa 
cacgcctcaa 
tcatactgat 
caaagaattc 
caaaaatagt 
aaactgggcc 
tccctgccac 
tcaacgactt 
acacattaaa 
taaaaattgc 
tataagtttc 
aaacagattg 
atggctccga 
gtccaaattc 
aaaagtgggc 
tgtgcacatc 
ccgtttttga 
aagtggagta 
ttattgctag 
ttacaatttt 
agtacgtgat 
aatattttgt 
atcatcacga 
ggaacctaaa 
agcagctggg 
aaccccgatc 
tagaactggg 
aactggctcc 
atataacagt 
gtccctttcg 
ttcggctgct 
ggaattggga 
gtagtccagt 
catgtttgag 
agaggaggta 
atagaagtta 
ggagacatct 
tgtttcttgt 
cgattacggt 
ttgataaatc 
gaagatcgta 
tcctctccaa 
tcttcttcct 
gaagaagatg 
tcctcgacat 
tgccttgtgt 



agcaaacaat 

tttagatgct 

aaacttgaaa 

aaagaaaata 

cactgaactg 

agtgttgatg 

gtttttaggt 

cgtggaccac 

agatagaaac 

tgttatcgca 

cggaggaccc 

tcaaggagaa 

ggcaagcatg 

tttaaaggca 

tgaggttgca 

taaccccttg 

tccatccaac 

agttcgcttc 

agctaaaagg 

attggcagaa 

catgagatta 

aacaaattat 

ttatacaaaa 

tgtatatttt 

attgttttat 

tgatgacgtt 

cattccgtgt 

catatcttgt 

tcctctccca 

ccagccagcc 

aacatatgcc 

caagtaagtg 

tttttcttgt 

aataaagaaa 

gttccacatg 

agaattgcag 

ctagattgaa 

ggcaaaagat 

agtttccaca 

aaagttgtcc 

ttgttgcttt 

gtattcttag 

ttttcagtag 

tggcgaataa 

attagttggc 

ggtgctgtcg 

ggaggagctg 

tctaatacat 

gagggatggt 

ggagcaggcc 

tttttatccc 

ttttaataaa 

ctgttggttt 

gcctcttctt 

tatcaacgta 

gaaagtttgc 

gaagagaatc 

catcgcttga 

tagaagtaga 

catcttcttc 

ttgttacagt 



acttcttttt 
gaatttgctg 
gatctcgaca 
gacagttatt 
tcaaaggcta 
aacaactttg 
ccagccgctg 
accagcccta 
accggatggg 
ctagatgcat 
aaatcgaaat 
tgtggatctt 
aaagatttgc 
gtttcttccg 
gaggacgccg 
ttcatatttg 
tggaaaactg 
tttgcagggt 
aaacagactg 
aaggacaacg 
gaggagcttc 
tccaaatctg 
ataataaaca 
atatattttt 
tcctatacac 
tttccatcac 
ccatcaacat 
actgtcgatt 
taacggatgc 
tcgtcagtac 
ataaaatcgg 
actggcaact 
ttagaagcat 
acaccaacaa 
gtcataattt 
tgataggcac 
gcattgcagc 
tgtcctttga 
ttatagcaat 
atattaattg 
ggcagatgtt 
ccagtttgac 
agttggtttt 
tggagtcaac 
acacgtcttt 
cagacgcagt 
caacagataa 
caaaatcctg 
tgttgatact 
ctttttatag 
tgatattata 
tttagggagt 
tttgagtctt 
ccttttattc 
cttttttatt 
atcagcagat 
gacgttagtg 
agggatggcc 
agccatttct 
ttcaatctta 
gctagaccat 



cgtcgtttgc 
cagagatgag 
aattcgaaga 
tgacagagaa 
cagattcagt 
gaaaggttac 
tcgccctggt 
aaggtgtcat 
ctaagccaac 
cattcttaac 
ataaggccta 
gggcaacatt 
ctaaaggaca 
ttcttttgtc 
tggtggggtt 
gagtgcctct 
gcctcattgt 
cgggcccgct 
aacaattcga 
ccaatagtaa 
ctgggcatga 
ccaagattct 
cagacatcat 
attttagtaa 
caacaataga 
ccattctgat 
ccgtgttctt 
ggatagtttc 
gacgataagc 
acatggggtg 
aatgggtcaa 
gtttggcagt 
gtcgtatata 
ccataggcca 
ctcagaaaag 
aggcggatat 
aatctcctgt 
tccttttgat 
gtgataaaag 
cccactttgt 
cgccatccaa 
tgcctcccct 
tatgtttttc 
ttcaatagta 
atctagcctc 
tgctattagt 
catggcagta 
ttcttcccct 
acccgagctc 
ccctaaaggg 
caagcagaga 
cttctccaag 
ttgttgttta 
ctgtcagcaa 
agcctgtttc 
gtggattctg 
tctgattcag 
gaagacgtgg 
ttgtttgttg 
ttcagtgcac 
aaagccctcc 



aagggacatg 
agaaacatac 
gactatacca 
tccagaaagg 
attggggaag 
aattgtagta 
gcatgcgtcc 
cagttataaa 
caagcacccg 
tgaaaatgga 
cgcaccaatc 
cgacgacccg 
atccctctcc 
cataggaaag 
ggcgagcaag 
tggatttggt 
attttcaata 
aaccctgaat 
agacggggga 
acttcaatcg 
agatttgcgc 
gggctacaaa 
aaaaatggat 
caataaagat 
tatagactta 
tttattccta 
cttgaagatg 
agctagggaa 
ttcagcagaa 
cttcagagtt 
gtgcatgtcc 
aaattgttca 
gagagcgaca 
attcattcgt 
ttcctcaaca 
tccgacggtc 
ttagcatgga 
aaagcttcga 
ctgtccaacg 
aacgcagagg 
tctagttttg 
ttcggttctt 
tctgatgcag 
tcactcttca 
atcctgttag 
ccatcagatg 
atatcatgga 
tcctccattt 
atgccgtaca 
ccattatcga 
aaaatttcag 
ttttcttctt 
gcttatgtat 
gtaaaatatc 
tggtcgtgga 
tatctgaatc 
agtcatattc 
gagaagaaga 
ttgttacctg 
ggttcaattg 
tggaagcagc 



ggcgtccaag 
cccgatgcag 
gaaagtcaag 
gctggcaaag 
aaactaggca 
ggggcttctg 
agaggggcac 
attgtggact 
ttcagggaag 
gcatatgtat 
tgtggaacaa 
cattctgtat 
tgcgataaag 
gatgttgcag 
gcaatttcag 
atagctgcta 
ctactagtgg 
tggtttggtg 
ggaaatcgtt 
aggaggaatg 
ccagttttct 
tctaaaccct 
aggtaaaata 
tatacatgta 
tcactaagat 
ttcttctcca 
tgggttatat 
tgtttttctg 
gccagcaggc 
tttgttctat 
tttaatttac 
ttgatatgag 
gctaaggcag 
ttagtagtta 
ggggaattat 
ctcttccgag 
caacctgtat 
tgtttataaa 
aagcatatac 
ctaaattttc 
ttatcgtatc 
tggggaagta 
attctccaaa 
ctaattctat 
agtttcctgt 
cgtccagtat 
tgattttgga 
tgttggtgtc 
ccgaatgacg 
cagtactatt 
tctacttgca 
attggttttc 
ttggggttgc 
cttttcgtac 
attccgtgaa 
ttcttcctcc 
ttcttcttct 
ggaggatgag 
cccatgttcc 
ctgctgataa 
cgatgctcca 
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gggtccattt 
tttactttca 
agtcgtacag 
tttgtatcaa 
cattaacaaa 
tgtccacatc 
tttctccttc 
cttcgtcgtc 
cgtcatttat 
ctggcttctt 
aaaggccatc 
acataatctt 
tgaaaccacg 
tgtgggaatg 
ccaaactctg 
caggtacatc 
ttctcaccgt 
ttgatagagc 
cattgtcggg 
tagagaaaac 
aaggtgcatc 
aagtgtctat 
cacaatcaga 
cggatttaac 
gcccaagatc 
gcatcgcgag 
atgcaacccc 
cctttgtcat 
cttgatcgat 
atctatcggc 
atcttctctc 
cgtttctgtg 
aaacgcaatg 
cggtattgcc 
taatacattt 
tcgttatacg 
attgtgtgat 
ctgaaggcac 
agattgtgaa 
ttgtattagc 
cactttcatc 
ccccagtttg 
tcactgcatc 
cagaagtact 
caaatacttc 
cagttttacc 
ccatatcaat 
aaagcgcatc 
ttgttttccc 
cgatattttc 
acggggaatt 
cttggtgttc 
aatcgatgag 
tttgaagagg 
tcctactaac 
tcacgtactc 
cttgtatgct 
cagcattgct 
tctctaagaa 
taactcggcc 
tcattgcact 



ctgatcgcgt 
gccagccgtg 
gcaaaatttg 
agaaccttcc 
aacactattt 
gtgcccgatt 
ttcttccccc 
gtcttcttct 
tatatgtctc 
cctgaccttt 
atacctatca 
agactctttt 
ggacaatttc 
tgctagtgca 
tataacatta 
taaacattca 
cactagttga 
taacgttgta 
aacatcttca 
atccatcttt 
actttctatt 
cagggcagat 
tttaaagtgg 
aatggacaat 
gtctgatgat 
gacaattact 
tcctctattt 
actggctgtg 
aaattgttcc 
atatgttcct 
tgcgggcgag 
agtcctaccg 
gtaacgttta 
tttaggtccc 
gtttgtgttg 
cgaacacaat 
tgaatctata 
aagcataacg 
agctgaagcc 
aacaggcgat 
cttggcgatc 
ctctagtgac 
ggtgacagaa 
tgctgtatta 
accccatccg 
tttggaaggt 
gctgcttatg 
ctttttctgt 
atcatgtgcg 
cattacttca 
aatctctgct 
tcgtataatc 
tacttgtcta 
ggaggcttta 
caattgtcct 
gctgtgctca 
cgtgtaggct 
ctggaatggc 
aatacggaac 
cttattattg 
cttgaaccta 



acatctaacc 
ttagtcatct 
ggtaaacctt 
gcacctatta 
tctttagtga 
gtgttatttt 
cactcccttt 
tcttcctccc 
ctaaaattct 
aagaaacttg 
tctctggcca 
ttaggtgaga 
ttagccaaac 
gcattatgta 
gttgccgtca 
ctgataggaa 
tgtgcatctt 
gcattttctt 
tcaaagtcca 
ctattcctag 
ttatttgtct 
gctgcatcac 
actaaatgca 
atttgattat 
gtgattttat 
tcagtgaaga 
gtcctcatca 
acaatctgta 
ctcataactg 
gcctttgaat 
tctgtagaga 
atagtctgta 
gcgtacatgg 
agcattataa 
gcagttttag 
ttcctgtttg 
gggttgcctg 
tgagctgggg 
atcatgactg 
gaatcgaatg 
gtttgtaaga 
atgaccatct 
gttgccttga 
atgaagtatt 
tcttcttcat 
ttttcctttt 
tcaacttccg 
gagtaggttt 
acatccctat 
tataaatcat 
actaattttt 
tttgtgaagg 
gcgtttaaac 
caattagtag 
ctatttctta 
gccatagcat 
ggcgcacttt 
gtagcgctgg 
gcatcagctt 
gatgatttca 
ttaataaggt 



ttagcaataa 
acggaatggt 
caaactcgcc 
aatttccaac 
gtgttagttt 
tatcattgat 
ctgcttgttc 
cgccaaacct 
tgtccaataa 
catccatcat 
tgatcgcttt 
aaagttggtt 
ccagaagatg 
tagtaaaaag 
ctacaggtat 
tcactccctc 
ttatggctct 
cttgcaaagg 
tgccgttatt 
gaccagaagg 
tggaagtaac 
gcactcgctt 
gacacaaatt 
gaggaagtac 
tttgggcacc 
taccattttc 
tcatctttga 
cgatattccc 
cggcagcaat 
ctttgatcct 
atatgaggaa 
ggaaagcaat 
agttattaga 
cgtcgacttc 
tgttgttttt 
ttatttcagc 
ctagcgccat 
aagttcttct 
cacggcacaa 
tccccacgtc 
tgcgtgtagt 
tttttgattc 
tattcaatag 
gtctcctcaa 
tagcaccgtc 
tcttgatagc 
gagacgttat 
ttatagcatc 
cgcgcacaac 
catcatttgc 
caacatcttc 
cccttctgac 
agtacgaagc 
tagagcagtc 
aaagtttaat 
ctggagtagc 
tccttagtat 
ccataaccgt 
gagataatag 
tgcgcttcac 
ctgacactga 



ccagaaatac 
gtctttgtct 
tgtgcagtta 
cataaaagag 
ctttctattg 
gacagaaatg 
ctcctcctcg 
ctccctcgtt 
ttttgtaatg 
gctgttcatt 
gacgctcaag 
ttggaactgt 
agaacctccg 
caaaccagga 
gagatttgtc 
tttctctcct 
atcataatca 
taaacaccct 
gttattatat 
gagcattaga 
ttcgttattt 
ctcgcgtggg 
agaaaaaacg 
agacgtgaaa 
tataactact 
taacgtcaaa 
aaaatgttcc 
catagtcttt 
atctatggaa 
cttatccaaa 
ctgagggact 
ggcgttataa 
agagtcatgt 
tttcgtgttg 
aaccagaaac 
gttactatcc 
gagcggttcc 
attggtttta 
aagtctatag 
tacgatgccg 
caaattcttg 
gtttgttgtc 
tgcaccttta 
cttttcaaaa 
gtcataatct 
atcgtataaa 
tattcctcct 
ttgtactata 
caccacctta 
agcacgtttc 
ctcttccaat 
atgtcctcct 
caattcgtct 
gactccagcc 
aacctgttcc 
actggacact 
gtggtcaatc 
gaacgtatca 
tttcaagtag 
tttaaagttt 
agggtttgcc 



gtgctttttt 
ttagatgagg 
tcaaagcata 
tctatgtgtt 
cacagtatta 
tcgtaagcac 
tcttcaaatt 
tcttcctctt 
aaaacacact 
ctcatatcac 
aatatatcgt 
ctaaaattaa 
tacccttctc 
ttttctttag 
atatctaatt 
tgaccaacaa 
tgggggaagt 
tcgtcttcat 
gccattatat 
gcagttaatg 
ttcgtattca 
agaagattgt 
aattgagatg 
gaaagagcat 
agcgctatgt 
ccttctacga 
atcaagtgta 
aggactaacc 
ttattactgt 
ctatcaaaaa 
gaaagttgac 
ggtacatcta 
aaagagagac 
ttgaatgcac 
aatcccctat 
tcccctattg 
gtgtcaggga 
ttctttaagg 
cctgagaata 
tattttgcgt 
agaaaagaat 
cttaaacgct 
catgctgcca 
cactcgtcgt 
tcgttaaaat 
tcttgagcaa 
cctccattag 
cttcttagtg 
aaacgcctgt 
ctctttctgg 
attcctcctt 
actttttcag 
atagcgtatt 
atgctaatac 
agaaacactg 
tctttgaagg 
atgtgcaaat 
tactttaaga 
ttggacattg 
ctgtcgtcac 
cttttttgta 
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tgttgcgcaa 
agtcactaat 
caatgggcat 
attccagatt 
agaaatctcc 
tgaaaaatct 
tcatggaccc 
gtagttttct 
gcatgccctg 
ggtgcctttt 
aaattgatgg 
tcatgacgaa 
atcctccact 
ctataatgga 
gcactgctag 
catcggttac 
atgaaggttc 
aaagtttgtg 
gttgccttaa 
cggatcaaaa 
ttcccatgct 
tctataagcc 
cttcttatgc 
tccttcgggg 
taccgggcca 
gtcaatatcc 
gggagttaca 
gtggtctact 
ggggaaagcg 
ttttcctttt 
tccgtatgaa 
gatatgtctt 
tatttcactt 
tctcctctca 
acctgtagaa 
gggtttattt 
cgaaaccatg 
ttgaggtatg 
gaatggccta 
taacaatgaa 
gctaggtatt 
cgtgatgatg 
tttaggtctt 
catggcagga 
tgagtcttgg 
aaacgaaggg 
tctgaatttc 
cttattatta 
aattttgaat 
gaattctgta 
agatgaagga 
tgttcctccc 
atctgtcaga 
ggttggggat 
atcttctcca 
aactgctctc 
gttggaacag 
tgatgtagat 
gatatcatgt 
cttcaaattg 
agatttttcg 



aactctagta 
attttgagtc 
aaccttgttg 
tgtgcgtaaa 
tattgatctt 
aatatccacc 
acttttcccg 
ccatgaaaat 
ttgacagctg 
ccacacgcca 
ccctacgtgt 
ggctgccaat 
cgttttcttg 
catgttcttc 
tgcctgcaca 
atttttgcag 
tgggtgggag 
agccatggta 
gaagactctt 
atttggtata 
aaagagaacc 
tcagaagtat 
ccccactcga 
gttgaggaaa 
ctcttggggt 
actctgtact 
ttggagcaat 
gaaaaaatat 
tactgcaaag 
aggtaatcta 
cgcattaata 
agaaattctt 
gtcctcatcc 
tcgttaaaat 
ttgtactctg 
tgactcaaaa 
cgactatttc 
tttccggcca 
tgctgccccg 
gacgggcgga 
ggtacaaaga 
ggcgcttggg 
tgctggtgaa 
acagtcccca 
ggattattaa 
tctatgttgg 
atatcctttg 
cagaaacagg 
tccgccaagg 
tgattccgta 
ggtggtggtc 
gaacctccga 
acgactatag 
gtttctatgt 
gatttatgtg 
gaggtataac 
tacccgccag 
ggaggaggag 
acaccaaata 
accatccttt 
atggtggcct 



tcttcttcct 
ttgtggaact 
gaatcgaggt 
tcagagtacg 
gaagaacgct 
gagtctttag 
tcgctcttga 
ttaggatcac 
ttgaaaagag 
ctgttgatgg 
ttttgtacat 
tctcttgttt 
tccactttaa 
ttccaccctc 
gaattaatgt 
aagcccaccg 
agggtgccaa 
attcaaccag 
gtgtactttt 
ctggttagct 
aggagttacg 
ccacaccgtc 
cgagtgactt 
tgtttgacgc 
tatgatccaa 
cggatgagga 
caaaatagct 
cgtcggtagc 
gattggacac 
aatcaaacat 
acccctcttt 
tcctcttcat 
ctctcgcgtg 
tacgttcaaa 
gagctggagt 
gtgtagagaa 
gcaaacattc 
aaacattagc 
aggaaggttt 
tgattacatg 
gcacattctt 
ccagtgcaat 
tctggtcccc 
ctaacggtgt 
cgcccggtac 
gcatcataac 
ttacatcttg 
tgatggggtt 
aagaagattg 
tttgttgttt 
ctttgtgggt 
aatatgttgg 
atgtgtgcac 
ggtcaccggg 
tgataaatgt 
acacagatgt 
tattgtaggc 
gaggagcgtc 
aagaatcgta 
gtctaaactt 
cactcaatct 



cccaagtctt 
cgtcacataa 
ctgttcctcc 
tcataaataa 
tactatgaaa 
cagacgatac 
aagacgttgg 
gaagacctct 
gcttagagca 
ccgtaattac 
ctaagagtgc 
ttcctacacc 
aattcctttc 
tccctttgta 
ccatcggccg 
accggaatgt 
tctctttagc 
aaaatcaacc 
tttacgatac 
gctatttttg 
tacagccata 
cctgaaccac 
cttgaataga 
atcaaacgct 
acgaaacgat 
agatgatgtt 
ggggattgtt 
tccttcttcg 
tttcaatgta 
atctcgcgct 
caatacaaaa 
taaactatcc 
tttctttatt 
aaagtccctt 
acctccaggg 
taaaagttca 
tgtggccaac 
tgctggctca 
gttcaggacg 
gggaatattt 
agaagtattt 
tttcttggca 
tcttttagat 
cctgtcatcg 
acacttcatt 
acaggggttt 
agaaagaagg 
tttgtataaa 
acgcaatttg 
ctttaataca 
tgaaggattg 
gtatttaggt 
agaacactct 
cacgtttcct 
tcctccagca 
gcaagtttgt 
caaaatatcc 
accacttacg 
caattcagta 
tttccccgat 
atctatatgg 



tgccagttcc 
taatgctgtg 
tgttataaat 
aatggtgggc 
ttccttaaca 
agaattttcc 
tttattagca 
agaatttgta 
cgtcaaccat 
cttgtccgaa 
cttttcttgt 
agtaccatct 
gaccttatct 
gaattgaatc 
aggaagaaat 
ttcactctca 
atcttcttcc 
aatatatgtt 
ttgaaacgag 
gcaaaataat 
ctagtctcta 
gaagtttctt 
gagaggggtg 
agacgagggt 
tgcaaacccc 
ccgaacaatt 
tcgctgtcaa 
ggaagaacag 
ctggaagttt 
agcacagtag 
gtatcctcag 
caagaacttg 
acgtattctg 
atttcttcca 
ggagttctga 
gatccgccgt 
atactatgtt 
atgtaaaagg 
ggagctaaac 
tctagaggag 
gttgcatcca 
gtagtttcgt 
tggtcggcca 
cctattattg 
gtgctcacat 
tccatacatt 
tcgttattta 
aattcaggat 
gagatgtaag 
tcataataga 
aaagcgcaag 
ctgcattgcc 
ttaactacac 
gaccaacaat 
acaaatggtg 
ccctctttca 
acatcttcat 
cctgatgcac 
tcgtcttttt 
acagcatcta 
tgttcaggca 



tttctaaaag 
acaaagttgt 
ttcagtacaa 
gtttctgtta 
tccctgagag 
acatctactg 
caaggcactt 
accacttctc 
ataaaaaagg 
tcttgaccaa 
agtatgacac 
ccgatgataa 
ttcacaaggt 
agggctagac 
ttggggtagg 
taaagagcga 
tccagaaata 
gtaacaagaa 
tacgcctgca 
agtttttatt 
caagtaatcg 
gcacgtccgc 
ttattgtagc 
ctgactgtac 
acgcatcgtc 
tttcacttcc 
atatcctatt 
agtaggatgt 
ttgatgcagg 
cagcataagg 
atttttttat 
atttgagata 
tctgagacaa 
gcctttggtc 
tataagagga 
gaacactacc 
catagggttt 
gtgcagtaca 
aacttctatg 
gtaaaacagt 
ggtttaaagg 
cgatacttga 
agaaaattcc 
catgcctgtt 
caaacgtaac 
catacatacc 
gtacatcctt 
cgttgcattt 
aagagttgcg 
ttggtgtcga 
cagtcatcga 
agcccccatc 
gttctccagt 
aaggtttagg 
tcatcacacg 
aaggaactgg 
catgtccttc 
tatggtgatg 
ctatattatc 
gatatttgtt 
cgagaagaga 
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tgaggatcta tttgcagcta aatcatgact 
tgacggaacg ttccatttat tccctttaac 
gtgaggtggg taattgcttg catcttccac 
ttcttctagt gtcttttcag ggttagctcc 
taattcgtgt atagcttgaa aattaacaga 
agcgcagact agcgcaaaaa tacagcctaa 
cgcggtgaag cgcatacttc ggttggtagg 
catcgttcct gatattacaa tggatatctt 
tttagggtct tataagacgc agactgtgaa 
gggcgcgcct accaacgctg attttacacg 
tcttgttaac cctggagctc cttccgatag 
tgactctttt gtgtacaact acaaggatgc 
tccctttaac gagccagctc ttcatgagca 
aattaacaga tctcgttttg tgtgtaaccc 
tcgctattct ggaggaagta ctccccagaa 
aaattggttg gtcatactta gaaatgctgc 
agacgccatc gaacgcattc ctgatctaag 
gaatgcaatg aggagacttc gtgtatggcg 
atcagcgggc atgattcgct acgaaccctt 
ggtaagaaga aggcaactaa aaggaagtta 
tttctgtaaa gaatcaactc ttctggctga 
ctggacgatc gttgaagctg tcatccggta 
tgaaactaat gccatccctg aagactcttc 
aaaaaaccaa cgcgacacaa atcaaggaga 
tgccatcaaa gaacatgaaa gcataggaga 
taaaacatgt ctaacagagg agcaacgaga 
gaatttgtcc ccggcccatc taacaaatct 
agcaggcata tgggttatta tgcagagtct 
tttaataggg tacgaagctc aacgactttt 
tgctcttttt atatctcaga ggggaattgg 
agaagtgatg aaagaaagag ctgcaaatgc 
ctacaagaac aatactaacg agtctaataa 
tacaatttat ggtcaaatgt ctgattgtaa 
acctctccat aaatcgagga tgaacatgca 
ggatatgggt ttggcaaaga tacgtgtagc 
acatagaaga aaaatgcaag aaaaggcagc 
actaagaggt gaagaagaag aagaagaaga 
agaagaggaa gaagaagctg gaacatcagg 
agaagatgat ggtgaggagg aggaggagga 
gggtgaagat atgaatgggg aaaataactc 
ctccactcaa caacctcccc aaaagcgtca 
gggaaaaagg ttaaaggaaa gaga taa tat 
cgatgaccgc caagtgaatg tggagagcat 
atatgttaaa ggtggaaaat gtggagataa 
tacaaatcta ctagattcag tgttccattt 
cgcaaatagg cgatcagaaa ctggcgtata 
ctatgaaaga aacgtatcaa aagacagtag 
tgagcgggca agagaaaggg atgcttcctg 
agactctaac aaccctgaag atgtggaaca 
ttcattatca ggggtagaag attctctcgc 
attcacaagt cttgtcacca atccaatttt 
tcttggaaga tttatcgtat taaataacat 
tctagtggac ggagatatgc ctatgcttct 
agaaaagggt acagtgaaaa acacgagaaa 
attgaacgtg gccactgctc aatctgcagg 
agaacctgga atcaagcaaa tgttactccc 
catggaaaag ggtcgaggag ggagatcggc 
ttgcaagatg ctgaagtgtt tattcttcaa 
tattgaccct gcgtcacgtg ccaccttatt 
aaaatacaag aacattgact gggtaaagga 
taaatgggtg ggaactggag aatacactaa 



ttcgaaccga gaattccttt ccttcgtttc 16620 
agtcaaattc atcatgtcca ggagagcggg 16680 
cccgtcatgc ctacctctgg cttctaggta 16740 
ttgaactgct ttcaatttac ttttcatccc 16800 
cattggtttg gctacattat aggccattgc 16860 
cacgaacacg gtcacgaata acgttatttg 16920 
gtaaagtaac ggatctcttc gtcgttcatc 16980 
cctagataaa tctcatacag gtgccaccct 17040 
aaataattct acgttaaaaa agcttaaaat 17100 
cacggtatca ggagtagctt cttctctcta 17160 
agaaaagtta gttttggcct cttcttattc 17220 
agtggtgacc gctgaggctc ccaagtggtg 17280 
catcatgaac agacttgaaa aagctggtct 17340 
tgttaaatcg gctggagaag tatgcggatt 17400 
cttaattttc ccgattggag catcagagat 17460 
tcgttttggt acagtggcag catcggccaa 17520 
agaaggtggt acaagtaaac atgtggcaaa 17580 
agcctttaac tggatagcgg aagcctccag 17640 
ttcggttggc tgtgctctat acgatcacga 17700 
tgagcgaaat gtactatttc ttggtgacca 17760 
tatttcaaga gggggaagaa gttctgattt 17820 
caagaataga catgctcgaa caatcagtaa 17880 
tataaacttg gagtgggagg atgttctagt 17940 
tgattctacg ttagaaaaga ccttagaagc 18000 
aaaaaggaaa aaacatatcc tagagtttat 18060 
aatgattttt aagggagttg gaggaaaagg 18120 
ggccgatgcg atactggcta ataatgccaa 18180 
tttaaagcaa atcaatttct ctatactcca 18240 
aatgttcaag ttgtatatgc ctgcattact 18300 
tgacgtgttt ctgaatggtg tgtttaacct 18360 
aaaaataaga gacatggttt ctcgagacgc 18420 
tttgggcaat tattcccaat tcgatatcgc 18480 
tgcaattcct ctatctatta atattggagt 18540 
agacattgaa aagaccatac aacacatagt 18600 
ggaagaaaat agagtagcta gacgtctttt 18660 
cagggagcga gcagctgcga gacagagaag 18720 
ggaggaggag gaagagggag aagaaatgga 18780 
tgttaatggt tctggttatg atcaagaaga 1884 0 
agaggaagaa gaggatgaag aagatagtga 18900 
aagaaaacgt aaaaacactg gtaatacttc 18960 
gcgaggtaaa aatgcgccca tttcaaccaa 19020 
tggaggattt ttattagcca caattcaaaa 19080 
acaaaaactt ttgacggcca ggcaaagaaa 19140 
tggactacca gaattgttaa ttgaaaaggt 19200 
cagaaagggg tctattctta acagcataca 192 60 
caccacaaag gcaaactgta tttgtgatta 19320 
taataataca ccccattcat ctgaatgtat 19380 
tgccgaatct aacaaacgcc cttgtcctgt 1944 0 
acgtatgcgg gaattgatca tggaccctcc 19500 
tatcgaaaga gtacttcaaa atgagatatt 19560 
caatgccgtg ttaggtgctg aaaaaggaga 19620 
tgtaaaattt atgaatatga ctattgcttg 19680 
agactcaaga ggcaagacaa aaaaccttct 1974 0 
atttttcaaa ccaaatatga ctgcagcaga 19800 
acaccaatac atgaacgctg gtcattgtcc 19860 
tgattgtata atgaaactga aatccatcgc 19920 
ccttcacaga cagaaatgtg accatgcctt 19980 
tattgaccct tcaaatgctg cagatacatt 20040 
taggcttgat gatctttgta gggataggaa 20100 
tttactagac cctgtaatga agggaacaaa 20160 
cattggacgt gactcaaacg tggccgcccc 20220 
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tgttgatttt 
ttcacggaaa 
cgagagcagg 
gttggaagtc 
gggaggagaa 
agacgttcaa 
agaacaacat 
tgcttcatta 
ttctgaggag 
attattggat 
cagtgtcacc 
ttactgacac 
atccaagcgc 
gcaacccact 
cactaaaatt 
tgctggctct 
ccaataaaag 
tactgaataa 
actacaaggc 
tgacagtact 
atgtttctgc 
aaaagaaagt 
gtaatttact 
aacttgaacg 
tgacaagtga 
ttgacgagga 
ctagaaagaa 
aagaagatiga 
atgcaatcat 
acattaaaaa 
aaaaatacgt 
ccactgaatt 
gtcagtccac 
atttgatgca 
tcgtgctgtt 
aaaaatccga 
aagagacggc 
gtgctgctac 
ttagagatat 
ggaagacatc 
tgatagattt 
tcttggaaag 
tgcaactctc 
acaacaacac 
gagtacgttg 
cgatgactaa 
gccagagcat 
aattgaggct 
atgcaatata 
gtagacgtaa 
acctgatggt 
gaataaaaat 
cagtcagttc 
aagacgacga 
aactacttct 
tgtctgtaga 
ttcttgcaaa 
tcgccgcact 
tgggagaaaa 
atggacgatc 
atggctaaga 



tacacaattt 
ccaaatgacg 
gacgcttcat 
aacggagacg 
gaagaagaag 
gtacaggaca 
caacagggtg 
ccagctgata 
ggggaaaaaa 
ttaatagata 
attcaagaat 
gtggtctcat 
tatcccagct 
ctttaaattg 
caaaatgtat 
cgattcgttc 
tgaaaacaaa 
tgttggattc 
tgcaaacttg 
aaaaaataac 
ctctgcacaa 
tggagtttgt 
ccaccatcag 
cgcacaaaat 
ggaggaagaa 
agatttctat 
gaagaaaatg 
agaagaggca 
tttccccaat 
gcggtcacgg 
tggaaattgt 
tacttcccta 
cagaggggct 
ggtaaacagc 
tgataagaaa 
gtattttgat 
aaaaataata 
cccaaaaatt 
tgttggattt 
tatttctgca 
aatgataact 
aggagtggtg 
cacctccata 
tggtagcaat 
tggattgagt 
ttcttcttcc 
agatctttcg 
taatggatta 
cgactctggc 
aaggaaaaga 
gagtcagggc 
tgggggcggg 
cagaagaaga 
ctgcgaagaa 
attccaacaa 
agaatataac 
ggatcttctc 
caeca tatcc 
tggacgtttc 
tccagcgcgc 
aggagggaag 



tgaaatacac 
tgtattacag 
gtgaaagcta 
gacgtgtccc 
aggaaataac 
gtaatctatt 
gagaagaaga 
acgattcctc 
gtccagaacc 
gcgatggaag 
gaaaatatta 
ttggtcaaca 
aacgaactga 
gaggaccaca 
atagatgata 
gtttcgcttg 
ttctgcgtaa 
cccgctggat 
ageggtaaaa 
accaataaaa 
caacttgacg 
tatgatataa 
aatataatag 
aaaaaagtca 
gaggaagatg 
gaggatggtt 
gaagaagatg 
gaaacttgtg 
ggacaaaatt 
aggaaggggg 
aagagectag 
tttatgaagg 
ageegtataa 
ccttcgagtt 
actaaggaag 
gtcgtatctg 
aaacatttgg 
cccctcaatt 
aaaataccag 
atggggctcc 
aatggcgcct 
cccagagatg 
tttagaagta 
tcttctagtt 
ataagcaagc 
ccgtcatcct 
ataacgacaa 
cagaaaaaca 
gcattcaagg 
aggactttat 
aatgaegtea 
gcttctaggg 
gtaaggaaca 
gaaggegact 
eggcaacaag 
aacagagtaa 
tctactgtat 
cggcactctc 
ttcctataag 
cacctacaag 
gtaegtcega 



aatgattgat 
tactattgaa 
ccgcccgacc 
ttacccttcc 
aggegttatt 
cgatgtggaa 
ggaacttcct 
ctcacccgcg 
gtacaatatt 
aaacaataac 
ccagaatagc 
tttctggagc 
atagatactg 
taacgaggga 
aaaatggact 
catctttttc 
aaattcccca 
taagtgggcc 
geggtataga 
gagcaactga 
attctgaaat 
atgtgtccag 
gacaacatct 
aggaagagga 
aatatgaaga 
acgatgaaga 
aagaggatga 
gtgctaatgg 
caaaaaggaa 
agtgctctgc 
gtataaagee 
ggagtgaagc 
ggtcactact 
ggaaatgggc 
aagttgaggt 
aactccctag 
ctctgattga 
tgattaaaac 
aagaagtgct 
tctctgtaca 
ttgcgacgtc 
ggagtaacac 
ttcgcaacag 
cttcttgtgg 
gtggtataac 
eggecatgat 
tcatccaaga 
tgtctgacaa 
cgctcctaac 
tggcatctgg 
atgatgecca 
tgtataaaag 
aaccccagtt 
tttcttccga 
atagctgeae 
gcggttcttc 
ctccgaacga 
ttttcaaege 
agcactattg 
gategtatgg 
tcgttggcca 



gatggtgtaa 
agageggatg 
ttgtttgacg 
agtgaaaccg 
gacgacagta 
ttattcgata 
tcagcaatat 
catattccct 
tttgattctg 
cctaaaagag 
gtctaattac 
tcctctaact 
gaccaaaact 
tgaagatact 
atatcagtct 
ccatggagct 
cgacactcgc 
ttttaagaga 
tggectttea 
tattttacat 
gtctegtact 
ttcaaggcaa 
gatcgaattt 
aaatggtgag 
aggaggttgc 
agagggegat 
agaagaagaa 
tgttattgat 
gaaaaatggt 
taacacttta 
agtagggtgt 
tgacagctgt 
caataaatac 
taaccctccc 
taagtttgaa 
taatattaaa 
agactttctt 
tatgacgagc 
cagttttatt 
atttgatcgt 
atgettgaac 
gtggctccac 
aggagtgaat 
agggaataag 
cctgaaacca 
ctcattgcct 
tttctcagaa 
gagcaaagac 
atgcacagtc 
agagggtgtg 
ccagttccag 
agcccagcgc 
tactatagca 
gttaaatcca 
agaagacgac 
caccacagcc 
aaagaggaac 
tetatctgea 
actaccacaa 
agaeggaatt 
ccatggacga 



taagtgttcc 
acttactcac 
ecagageggt 
tagaagactt 
ctgaaataga 
tccctgaaat 
ctgaagtgtt 
catttgggaa 
ccctcgacca 
tcgactggaa 
gtccgagcct 
gctgagaaga 
aacgtgttat 
ggtacaataa 
gccgttttaa 
gatttggttt 
gctgaatctt 
tggagcatta 
ggttccatgt 
ttggtgaata 
tttaaccacc 
gtgaaccagc 
agaaccaagc 
catgaagaaa 
ctatcagata 
gacaatagaa 
tatgacgatg 
tgtgaagacg 
aaaaaaacaa 
tcctttgtgg 
ccgcccccat 
tataatactt 
tctgttaaag 
gaecgaeggt 
attgaatgtg 
gtatggttaa 
ccagctatgg 
attttctctg 
cctatagaat 
ataatagaag 
aacgeattet 
aeggaccttg 
attggcggta 
ggcgattatg 
ccacctgcag 
cagcccacgc 
gtttctggga 
gtgtttaatg 
aacgataaaa 
gtacgaagaa 
gaagaatgcg 
egaggttegg 
gtctctgacg 
acgcactctc 
gaegttttag 
ggagacagag 
tctgccgccc 
aaaacaaagt 
cattgaagat 
ggtcctcgag 
attggaggta 



20280 
20340 
20400 
20460 
20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
21180 
21240 
21300 
21360 
21420 
21480 
21540 
21600 
21660 
21720 
21780 
21840 
21900 
21960 
22020 
22080 
22140 
22200 
22260 
22320 
22380 
22440 
22500 
22560 
22620 
22680 
22740 
22800 
22860 
22920 
22980 
23040 
23100 
23160 
23220 
23280 
23340 
23400 
23460 
23520 
23580 
23640 
23700 
23760 
23820 
23880 
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cctgaagaac 
ccaccaaaac 
gcaccatccc 
acctaccctc 
gattatcctc 
tgtaccacac 
aatctgaaga 
ggttgtacaa 
cattcttttg 
gtatttttgt 
aaagtttttg 
ggtggtcggg 
gtcatttctg 
ctagctcctc 
ataaggccta 
gtttttgaga 
gtcagccg tg 
agtcatttct 
tctagcccct 
cataaggcct 
gtttttgaga 
aacccagaaa 
ttgtaccaaa 
ctcaataacg 
ttttggctta 
tttgtcacac 
atagaa tgtt 
aacactgttt 
tccacagtcc 
gcaattcgac 
ggggaagaaa 
caggcggttc 
attggggcgg 
ttcaaagact 
ggttactagc 
ctcccaca tc 
tacattattt 
gcggtccacc 
tgacccagaa 
gaatacaggc 
tggtatacat 
tcggaacttg 
aatt t ttgaa 
tccagacatc 
tttctcccac 
tccaccatca 
actccttcca 
aacatgacga 
ctgcaaactg 
tactacgcta 
ccctctcaat 
ctgaagaaaa 
gcgatggcat 
aaggaggaac 
agacattgat 
atatggagga 
aggtgtctgt 
taccaaaaaa 
tatgtgtgtg 
aaaattt ttc 
ttccgagggt 



cagccacttg 
gcaagtcaat 
ccgtccgttt 
cctacgacga 
caccatacga 
gtcaagtcct 
gggagaagaa 
gaagaaaata 
cttgctgagt 
gtattcttta 
agatggagat 
tccagaaacg 
gaaagggttg 
tctgtaatct 
gtcagcggtc 
tggagatgaa 
ttccagaaac 
ggaaagggtt 
ctctgtaatc 
agtcagcggt 
tggagatgaa 
catatatcaa 
aactcagaaa 
tacgacactg 
gatgcattat 
atccacaaag 
acacccttgt 
ccgaagattt 
ttgatctcct 
ttgatctgga 
tacgccaatt 
tccataatct 
gtaggaactt 
gaggccattt 
aaggagctca 
tttcataccg 
ctgttataca 
cttcgaactt 
aaatttttga 
agttctgcct 
ttctagcccc 
acattcgatc 
aagtttttga 
tctagagtgg 
tgacgcagac 
ctgccgaata 
gaaatttgct 
ccctttacgg 
tgtttcgagg 
cgggaaccca 
ggatcaaggt 
ccccatcgac 
gacccacctt 
aagaacgtgg 
gatgtcccat 
aggatttgtg 
tttgtgatat 
tttattatga 
ttcatgctag 
tgggtcgctc 
ggaccgctgg 



ctacacttgc 
attcagagag 
agaagagctt 
cggatcttct 
cccatcagaa 
cagttctatg 
ggccctgggt 
ggattgattg 
gtacatagct 
ttactgccta 
gaagggaaaa 
tctgtctgtt 
taatttatta 
gacgttgggt 
caccccctaa 
gggtaaaaag 
gtctgtctgt 
gcaatttatt 
tgacgttggg 
ccacccccta 
gggtaaaagg 
aagggatgga 
tctcatcaca 
gacctgggaa 
caacaaacac 
aggccatatt 
cgatccagtc 
ctttgtcccc 
ccaaccagtc 
aaccaatgtt 
gtgtatctgc 
ccttgccaga 
tgccagtcaa 
ttttggttgt 
cagtcttata 
gggtcggcta 
tttctagccc 
gacattcgat 
aaagtttttg 
tgctccagaa 
ttcgtgtaat 
gactcagcgg 
gatgaagatg 
ctagagccca 
tatataagct 
cactcttgtg 
ccagttatca 
cagattgatt 
aagagaaaag 
tacaagtacc 
gaagtaggaa 
aagtgcggag 
tacaacgacg 
gtaaagaaga 
tctataaaac 
cacaaactcc 
aataaagatt 
tgtatttcat 
cccctctttc 
gagtttagag 
gtcgacccaa 



ggctacacct 
ccttgcgctt 
gtcgacgtgc 
acatcgtctt 
aatccacaga 
acggatcacg 
cttggccgtc 
tatcgatgac 
ttatatttgc 
tttttatggt 
agagggcgct 
ccagaaatgt 
tagttggtat 
cgacccagcg 
actggagtga 
agggttgaaa 
tccagaaatg 
aaagttggta 
tcgacccagc 
aactggagtg 
ctagtaatag 
aacaactatt 
tgacaaatct 
ctcccaccta 
ttttaccaca 
ctccatagaa 
catggcgaca 
atccatgtat 
gttcaaaaaa 
gagtttcttc 
ttcaaatgct 
attggggagt 
gtccatgttg 
tgctgctgag 
tatagactaa 
gaacccagaa 
cttcgtgtaa 
cgactcagcg 
agatgaagat 
atgattctaa 
ctgacattgg 
tccaccccct 
aagggggaaa 
gaaacatccc 
aaccactaag 
ttgctccgca 
agactgagaa 
ttagagatag 
ctggatctgc 
gtcgccccaa 
catgcctccc 
tggcgttcct 
aagagtatat 
accgccaaga 
aatattccaa 
acacatttat 
gatgttttta 
tgggatggct 
acccttcatc 
ggtggaccgc 
tgtcagatta 



ttattagacg 
acccagaact 
cagaaggagc 
cacaagccga 
ggtcccaagt 
ccagggccaa 
gcaacaactt 
gaactttgca 
caataaaaca 
ttaacggaaa 
agttcatata 
gtttagagat 
attatttctg 
gtccaccctt 
gctgaaaaaa 
ggatagctga 
tgtttagaga 
tattatttct 
ggtccaccct 
agctgaaaaa 
aaggttgtgt 
ctttttatac 
gaatgtccca 
catgttttac 
ttggcattca 
tcagatccat 
atagtaaagt 
tgggaaaaag 
tccctaataa 
gtattttctt 
ccacacttcc 
aggccttcga 
gcaaagagat 
agaggagcaa 
cacccccccc 
acggtttgac 
tctgacattg 
gtccatcccc 
gaagggggaa 
agatttctgt 
gccgacccag 
aaactggagt 
aagagggtgc 
aactagtttc 
catattttgc 
tagcctaaca 
gaaggaagaa 
aaagacatta 
acatgatcga 
tagaacacat 
tctgcgaccc 
gtactccaac 
aaagaagtgc 
atacttcaga 
ttttattttt 
taatatggta 
taaatgaaac 
tttcactcac 
tccatctcaa 
tgggtcggcc 
cagagagggt 



cagggcaccc 
tctccccgat 
gagttttttc 
atgtgaagat 
gtgtgattat 
cctcataaaa 
tagctactag 
tgtttttgac 
gaccccgaat 
aaatttttga 
ctgcccaata 
ttctggacaa 
gtatacattt 
cgaacttgac 
tttttgaaaa 
ggccacctgg 
tttctggaca 
ggtatatatt 
tcgaacttga 
tttttgaaaa 
aactcggcat 
agtagtttta 
ttccgtggcc 
atacagaaaa 
ttctcttcag 
caacaaatat 
cttgtccaaa 
tacagtatgg 
agaaagacag 
cattgttgga 
ctttatggat 
agaacaggat 
cagcagggtc 
gtagtagtgt 
cccaaaaatt 
tcacatctgg 
ggtcgaccca 
taaactggag 
aagggttagt 
gcattatttc 
cggtccaccc 
gacccagaaa 
tagggtgcgc 
tgggacattt 
acacatttca 
atggccccaa 
gaaagggatg 
atctgcctca 
gtatacaaag 
cgaggattgg 
atggaagaga 
tacaatgaag 
aaaacaattg 
caagctctag 
ttcaaggagg 
caccctaaaa 
gtattattta 
caatctggcc 
aaatttttag 
taatgtcaag 
ctagaaatat 



23940 
24000 
24060 
24120 
24180 
24240 
24300 
24360 
24420 
24480 
24540 
24600 
24660 
24720 
24780 
24840 
24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
25380 
25440 
25500 
25560 
25620 
25680 
25740 
25800 
25860 
25920 
25980 
26040 
26100 
26160 
26220 
26280 
26340 
26400 
26460 
26520 
26580 
26640 
26700 
26760 
26820 
26880 
26940 
27000 
27060 
27120 
27180 
27240 
27300 
27360 
27420 
27480 
27540 
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gtaccagaaa 
agaaatcttt 
tgttggcggc 
caaaaatttt 
gcctaatgtc 
ggtctagaaa 
aaatggcttg 
cggcccaggc 
cttcatctcc 
ggaccgctgg 
cagattacag 
aacttttcaa 
aacatttagg 
taataaaaaa 
atctattata 
ttttacaaga 
gggaagtacg 
ccaatctgtc 
tggtactact 
cattcttctc 
gttcagccca 
aagaacgagg 
ggagagagaa 
gggcaaactg 
aagtactaga 
cagagacatc 
ccttctagca 
aagagcatca 
attttcatta 
tgtggtaaag 
aggcttcaga 
aatgatacaa 
cggattcttg 
tctaacatag 
ccttttcaca 
caaacagatc 
caacagtatt 
aatattgtcc 
ttgaaaatgc 
ttcagcaaca 
ggaaagaaaa 
attcaatcta 
ggatgatggt 
tatcagtcgt 
agatatgtca 
gatattgacg 
gaaaacaa ta 
caacttcttg 
aaacttgaat 
tctcaaaaat 
gtggatgttt 
agtcattccc 
gataatatct 
aggagaattt 
gtccagaaat 
tcgtggagct 
gcgtacgccc 
ataggggtgt 
atcgtaccgc 
ccatcatcaa 
gaggagtgaa 



tagtgtacca 
gaacccattt 
tgtcaatagt 
tagaaaattt 
aagttccgag 
tatgtaccag 
tccagaaatc 
agcctcacag 
atctcaaaaa 
ctcggcctaa 
agagggtcta 
catgtggcat 
tgagaaagat 
tcatacagtt 
ccctattctt 
cactattggt 
ttctctgcat 
ccaacatttt 
gaaatcttct 
atgttcttct 
aggataattt 
tgctttattg 
atctaaagca 
tgttaatgcg 
tttaccatac 
gtgtttgtga 
ttcactatac 
agaaaattgc 
ataaaggaaa 
ggggaagata 
atgttaagtt 
taacacaata 
gtaagaaaag 
cctttaactt 
ataagattat 
ttgaagtaat 
gcaccacaca 
atcatttcgg 
tggattgttt 
gtttttgtag 
gacttgataa 
gctatagatc 
gatgtttcta 
tttgcctttg 
aaaggttcaa 
attcttgaaa 
ccacttcttg 
atccttctca 
ccagttctgc 
ccattatcca 
tcgtaaaggc 
tcgcagatgg 
ctaaactctt 
ctggaggtaa 
gacata tatg 
gcttcagcca 
gcttttatac 
gtcccagttt 
ttattggctg 
tatggaggga 
gaaggacgat 



gttataagaa 
ctggaacaga 
agcctatgta 
ttctgggtcg 
ggtggaccgc 
aaatagtgta 
tttgaaccca 
ttacctgttc 
tttttaaaaa 
tgtcaagttc 
gaaatgtata 
acaccgtatg 
aatacacatg 
tcggactcat 
ctacgtttac 
tcaaccaaca 
acaggacacc 
tcatcaatac 
aaacacactg 
agagcccttt 
gaggatgcaa 
acatcgagac 
gctttcaata 
gtatcattga 
aacatattaa 
ttggcgatgg 
ttctatcgaa 
acgaggttgc 
aggccaagct 
ttgcagcgca 
tttcttgttt 
gagtattccc 
cttcaagatc 
ca tgagggtc 
cgcacagttt 
tgtaatcaat 
aaccactacc 
cagagaattt 
cagccttctc 
tagtctcaca 
tcacactcct 
tagcgtccat 
ggcaagaaaa 
taacacaagg 
gtggtgcaga 
agagcctgaa 
cagaactagt 
taaccttctc 
tgtcatcacc 
tagagaccag 
ctctgaaggt 
gggaagaatc 
ctttattgtc 
ttatgtcatt 
gttcaatgtt 
tgttgatgat 
aaagatcccg 
tacatccaaa 
ctataaaaga 
gaacatcaat 
agaactggaa 



aatgtgaccc 
cgtttctgga 
ttcaatggtc 
ctcgagttta 
tgggtcgacc 
ccagttataa 
tttctggaac 
acacccttcc 
atttttctgg 
gaagggtggg 
ccagaaatag 
aacgtatagc 
tataaatttc 
ctgtccacac 
aagatacttg 
tctctttaac 
gggtaatttc 
aatgtccaca 
gacaattatc 
taatttcttc 
tataccccaa 
cggtggcagc 
tggtgggtat 
acatcaataa 
tctttgcccc 
caatcattag 
atcgtatatt 
aacagctgca 
gtttgttgtt 
aaccattaga 
gactagtttg 
cttttcatca 
ttccatgaaa 
tgttatcaca 
gtcggtcttt 
ctcttcaaag 
cactgcttcg 
tggcatcgtg 
tataaacaca 
gtactcttct 
gtaagcacta 
gttcatttct 
aggtctcccg 
agattcgtcc 
ttttttcatt 
tctaataaca 
agactttttc 
tctttcttcc 
agtgccaaac 
aagagaatat 
tttgcagacg 
agacttggta 
taatttcttg 
cagaagggcc 
ttcaagtact 
gtctcacata 
tgtaagaaac 
gttaatatat 
gggagcacct 
atttgaacct 
caggaactct 



tttccagaaa 
gcatggtcga 
ttttaccctc 
gagggtggac 
caatgtcaga 
gaaaatgtga 
agacgtttct 
tactagcacc 
ctcactcgag 
ccgctgggta 
tgtccgatta 
caaatatgta 
aaatacttta 
atcatttatt 
ttcatcaggc 
ttgatcagcg 
tccacgttgc 
gcgtgtcgtc 
gctgtcgaaa 
cttatcccca 
gcgacaattc 
aacaaggggt 
actaggctcg 
tgctgcttca 
atgggctcca 
gggcgttgca 
atattcttgt 
gcatggaata 
agaatttcaa 
ggtgacatgc 
ctagtatggg 
aaataagtgt 
cacgagactt 
acgtttttat 
tttcttacca 
ggcctactgt 
tttattgcaa 
tcaaatatac 
tcaacaaact 
accatttctt 
gccctttctt 
tctctgaagt 
ataataaaat 
acaaaatact 
tcagccacgt 
ctcgaacatt 
aggctagcca 
tcagcctgtt 
ttgatgccgt 
tttacgaaca 
gttgtcaatg 
ttgttgttga 
aaactacttg 
aattcccttt 
tcttcaagca 
cgactgttga 
tccctccggt 
ttttttaata 
gctcacttgg 
agtcagggag 
atccattttt 



tggcttgtcc 
cccagatatc 
ttctccatct 
cgctggctcg 
ttacagagag 
ccctttccag 
ggagcatggt 
ctctttcacc 
tttagagggt 
gggcccatgt 
ttaaaaattt 
ccacagtcaa 
tttgcataca 
aggaactggt 
actgccattc 
tcaggtctta 
gtactcaaca 
ctgatgatat 
cgcacattag 
gaaattatgt 
agtattggcc 
tcttcttctt 
tatcctctct 
gttacactta 
actagtgcat 
ccatcgctcg 
attttttagc 
cgttttcatt 
tacctactct 
ctttcgcaga 
aatatttgtt 
gtttgagcca 
ttacacacaa 
gtgatacgat 
taatctccct 
taatcatcca 
cattttttag 
tgtaaagggt 
ctactgccat 
cttctatcga 
cgttggtcca 
cagttgacac 
tgccattgga 
tgtatccgaa 
aatcagaggt 
tttcaacgta 
aaacaccgtc 
cctttgaagt 
gcgtctcgcg 
aaaagtcgtc 
cgttgataaa 
taaagaagta 
aaggaacagg 
ctgtgaaaac 
cctgacggta 
gtttatccat 
tcagttcagg 
taacaaaaaa 
acatcattaa 
atcctagaaa 
ggaccccaaa 



27600 
27660 
27720 
27780 
27840 
27900 
27960 
28020 
28080 
28140 
28200 
28260 
28320 
28380 
28440 
28500 
28560 
28620 
28680 
28740 
28800 
28860 
28920 
28980 
29040 
29100 
29160 
29220 
29280 
29340 
29400 
29460 
29520 
29580 
29640 
29700 
29760 
29820 
29880 
29940 
30000 
30060 
30120 
30180 
30240 
30300 
30360 
30420 
30480 
30540 
30600 
30660 
30720 
30780 
30840 
30900 
30960 
31020 
31080 
31140 
31200 
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tgaggttctc 
gaggagttgt 
ccaagaagaa 
ggttatacga 
ttggtgctga 
ccaatatact 
ggaatcctat 
atgtggctaa 
gcgtgccctt 
gtctaaaacc 
ttgaggtgtt 
taatggctcc 
aaccacacag 
ttgttactca 
tatataagag 
actagctctc 
gccaatatgg 
tttgaaagtc 
ggaaagtgtg 
aatgcagaag 
ttattccttc 
tgtacttttc 
ctagccatga 
actgtagaaa 
gtagtaaagg 
aggtgcaagt 
gatttgacat 
aatctttttt 
gtatgtagat 
agaagatatg 
gttacaggtt 
attttatagg 
tgatgtagat 
acctgacggt 
tagagttgca 
ggcaatgaat 
ttcagagcca 
aaactttatc 
tgaagtgaaa 
taacatccca 
gtactctgaa 
attttataaa 
tgaatctgaa 
tgttgtggac 
ctttaaagta 
tatagtggcc 
ctatggcact 
cactgaattt 
ctttatgagt 
caaggcagta 
cagaaatact 
gggcctctcc 
tggaaaaaat 
aggaagaaag 
ttaaagaaaa 
gggaatttat 
ttgaagatac 
caactactcc 
tcgaagaggt 
ggggcaggta 
aaagacccaa 



tcttcgagac 
ggaagaactc 
gatacacatc 
tagagcagag 
atatgatacc 
aaagaccctg 
ggatcttcac 
tggagaattg 
caatattgca 
gggagagttt 
aaagaagcag 
agaaaaaatt 
tggaaacttg 
tttattccta 
cccgtgttag 
tctctccact 
acttgacggc 
tactccccaa 
ggaattttga 
atatggagaa 
aagagtttat 
cccaatacat 
aatggatggc 
atgcggcatc 
tgattgttga 
acccaggccc 
gtgatgagat 
attcaacacc 
gtctactttt 
gattctaata 
ttagggatta 
gtaggtaaac 
aatggggtag 
ggagatggat 
aacgtcatct 
gattcaactg 
ttcagagaag 
ctttcaactc 
aatcaattag 
gagactgatt 
aaggtattag 
ggtaaaaaat 
acaataaaag 
gaacagacct 
aataaggaga 
aatattactc 
agcacagcaa 
gtttgtggtc 
gatactgttt 
gaaattttag 
cagtgagatg 
tagttatggg 
taggggtaaa 
agaggacact 
tgtaaccgaa 
gcatgatttt 
accgggtttt 
aaaatacgta 
ttataatcct 
cgaagaaaat 
taatgtagct 



gacactattc 
ttgtggttca 
tggaacgcta 
ggagatttgg 
tgttcttccg 
agagaaaatc 
cttatggctc 
tcgtgccagt 
tcatactctc 
atcctcactc 
ttgtgccgcg 
gaggacttta 
cagatgaaaa 
gaaatggtgt 
ctcctcgatt 
caagatggcc 
tggcacaaca 
ctctggtatt 
agcaagtgga 
tctcaacagt 
cagcaactgg 
ggacggtggg 
tagggatgtg 
catatggatg 
caatgcatca 
agtgtcatac 
ttctggattc 
catgatttta 
cgatagctcc 
cttctatttt 
taataacggt 
gcaaatatta 
aaggaggagg 
acgtagatct 
ttcaagaagt 
ataagataat 
caaccgtaga 
tagatttaac 
aaaattttga 
atttatggac 
ggttctcaga 
atgaggtaac 
agaaggcagg 
acaaatcttt 
aaagggaagt 
ctctaactac 
aggaaaagac 
tgagtaacaa 
tcccttcatt 
gtaaaagact 
gaaggggtta 
tctattgatg 
ataaatgaag 
ataagaagta 
aaaattagag 
ggaaaaatt t 
gaaagtgctt 
tctatgaaca 
tcacctatca 
gcaggttcga 
gaaaatctca 



cagttctcac 
tcaggggcaa 
atgggtcgcg 
gacccgtata 
attatactgg 
cagatgatag 
ttcctccatg 
tgtatcagcg 
ttctgactca 
ttggtgacgc 
tccctagacc 
ctatcgacat 
tggctgtttg 
aatcgctgtt 
cagtcacaag 
tttaattttg 
acagacccta 
gaggtgatga 
ggtgctatgt 
ggttcccatg 
attgaagaga 
gatggttcac 
actttctttg 
taccaaaaac 
aacccaatgt 
gttattgaag 
tttgtataat 
gtttttatag 
ttcacctata 
accgccaagc 
agcattaata 
cccttcttca 
aggaacaaca 
ttctcctcaa 
gtcaaaggat 
ggaagaaact 
acgtgaagtg 
agaggaacaa 
ctatgaagac 
aactcatttc 
tataaatagt 
tactggaaat 
aaatagtctc 
cttccctgca 
tactgtatcc 
gccagttgga 
ctatctattt 
gtcaactcct 
tgacgaagca 
aggtgtagga 
ttttggataa 
tgggtacggc 
aaaccacaat 
tagtggctaa 
caatgacaga 
caactggaga 
taaaggcaga 
gtttacgtat 
tgggtattag 
aagaattggt 
ttattagagt 



taccaagaaa 
tacagacgcc 
ggaatttttg 
cggattccaa 
aaagggtatt 
aaggatgatt 
ccacatgact 
aagcggagat 
tctgatggcc 
acacatttat 
attccctaag 
gttttatctt 
aatcatgtta 
gtgggcggag 
agcgcacaca 
aagactctac 
cccgccccaa 
agaggcgtct 
cgtatttctg 
tcaagacaaa 
ctgatcgaca 
gtgggggata 
tgtttgttga 
tactagcaat 
tttctgtatg 
gccatggagt 
aaaaccccat 
tatataaaat 
tagaggaggt 
aaacggccag 
gcttccgttt 
tcctcttcct 
acgacaccaa 
aagaaggctg 
caaggagtgg 
gaggcgagaa 
tttaaggatg 
tttaaggaca 
atgacccgtc 
gatccgaaaa 
atagaaagaa 
gtagctgtcc 
atccgtaatg 
ttcaatcaag 
atcaataatg 
gcagcttccg 
gtaatagaca 
ctcatggctc 
gaaagacctc 
agatacacaa 
gatagaaaca 
tattttgcgc 
ggagaagatt 
tgttatcaaa 
taaggaatta 
tggaggaacc 
atataaaaac 
cgatgcgatt 
agaatacggc 
ttttatgacc 
tgccaaccag 



attttctgga 

aaagaattgg 

gacagtagag 

tggcgtcatt 

gatcaattgg 

atgacggcat 

gctcaatttt 

gtcgggttgg 

agtatggtgg 

aatacccaca 

ttgaggattt 

gaggggtatc 

aggaatttcc 

catatttgtg 

cacgcttata 

aaatctcttt 

tatcatattc 

cgtacggcaa 

gctcgaagat 

ctgcttggca 

tggacagtac 

ttttacttcg 

taggaataat 

tggtgcaaag 

taatgcgtgt 

gggtcattct 

aagaaacaat 

caagaataaa 

ggataaaaac 

ggttaaatct 

catcctttat 

cagaattatc 

ctcaaccttc 

aactaagaac 

cctttagacg 

tcaataactt 

acacagacaa 

ttgttatggc 

tcatctttga 

aatatgacac 

tatcctctac 

tcgttgattt 

tcgagtttat 

ttttcttctc 

gatgtgtagg 

gacactacat 

agtacgatac 

taaatattct 

tgacggatgc 

acgccaacat 

attgcaaaaa 

cgtcaattca 

atgggcacaa 

gagaatactg 

aatgacaata 

ttccatctct 

gttccaggag 

aatggaaaaa 

accattcgca 

aagattgaaa 

cagtataatg 



31260 

31320 

31380 

31440 

31500 

31560 

31620 

31680 

31740 

31800 

31860 

31920 

31980 

32040 

32100 

32160 

32220 

32280 

32340 

32400 

32460 

32520 

32580 

32640 

32700 

32760 

32820 

32880 

32940 

33000 

33060 

33120 

33180 

33240 

33300 

33360 

33420 

33480 

33540 

33600 

33660 

33720 

33780 

33840 

33900 

33960 

34020 

34080 

34140 

34200 

34260 

34320 

34380 

34440 

34500 

34560 

34620 

34680 

34740 

34800 

34860 



WO 01/38351 



11 



PCT7US00/28888 



ttatgaggat 
tgtttatacc 
cattcgtaag 
tttattgagg 
atagggaatg 
cacgataagc 
agacaaattg 
aatgtcacaa 
aaaaaaaata 
aatgacatga 
gaaagtataa 
tatactgtat 
gcaacctatt 
gaaataccag 
acctttaata 
gtgacaggtt 
aactgcaaca 
acagaggcat 
tatgaaatat 
gtagttttat 
atcagagcta 
agatgtcctc 
acctgggttc 
agaaggtaca 
tgtggatgcg 
ttcaagtgcg 
aacaccataa 
ttttctttcc 
ccctacagcc 
agggggcaac 
atatgaacta 
aggggcggcc 
ccttttcata 
gaaaaactta 
tgcactcaaa 
tgctgagcag 
ataatcaacc 
cttatgtttc 
attcttccct 
gctgctgccg 
aaaacaaggg 
taaaaggcag 
tttatagatt 
tcaacaatga 
acttaatgag 
gcttcacctt 
ggaacaggtt 
ctctaagagc 
tattatcgag 
aaatattaga 
tactactact 
cttcttcgtc 
aggttgaggt 
agtcaggcaa 
tactaatacc 
caggaagtga 
cctcatcatc 
tattatcacc 
tgggagctac 
tacccattaa 
cctgtctaga 



ggtgtttttt 
atataatgtt 
gaaaattcct 
aaataataat 
tttctgctgc 
taaacacatt 
tggataaaat 
ctaatgggac 
taggtgataa 
agagtctagc 
caggaacaaa 
ttgtcaataa 
ataacacccc 
attcgtacag 
taaaatggag 
atcttgcttc 
actgttttat 
catttaccct 
gtatagtagt 
ttaactctta 
tagtacagaa 
cgtgaagtgt 
aaattaacat 
ggactagcat 
ggagggggca 
ttagacacaa 
tcttccatga 
tcagggtcat 
aagtcaggca 
accttgacag 
ggtttatgga 
ttacagcctc 
ccttcagaca 
taaaaggact 
acatccccta 
gtactttcta 
ttcttacagc 
ttcatcattt 
tttattgtct 
ctgctgctgc 
ggtaacacat 
tagaaaattt 
gggcagggaa 
actgtgtaca 
acaggaacca 
caatttctta 
acctctagat 
agaaagttca 
tctacacata 
catcctgtcc 
acttcctgag 
atcagagtct 

aggggcggat 

agacattgtg 
actaatagta 
ctctgagtca 
ttcaatgaaa 
ttcatcatca 
atacttaagc 
agctgcctca 
tagagtttca 



atagactacg 
cagaaaacaa 
gatgaaccag 
aataataata 
cctacgagag 
cctggacaga 
acgatcccaa 
tgcattttcc 
ccttcagagt 
tactggagcc 
ggactggctc 
ccttgcaaga 
tatatatggc 
caaggtcgaa 
aggcgatacc 
catctctgaa 
agaggcagat 
caacttaccg 
agtttgatgg 
atcttatata 
actggtgctg 
tgcgtctgcc 
ttggtgcatc 
catagtccct 
tattagccgt 
acctctcagt 
gggcgtcacg 
cagagaagag 
caaaaactgg 
gagcaggagt 
actcctggca 
caaaaacctt 
acatcatgaa 
tttcattctt 
tctccccttc 
tagatgctct 
tggcacactt 
ctgatgaaga 
ttcctttctt 
tgctgatgat 
gtaataaaat 
gtacatcatt 
gatatttgat 
tgtgcaattt 
attagctgct 
gagcacaaac 
atacaatctc 
gccagatacc 
tacctcttga 
tcgtcacgcc 
gaagtagagg 
gaatcaacat 
gaggtagggg 
tcatcatcat 
tcaaaggtaa 
tcattatcat 
ttgaaggatg 
tcatcatcat 
cattcacctt 
tgattatctt 
agcatgaaac 



aaacaaagaa 
aggctcttaa 
aagggagtat 
ataatggcat 
ttgatgtact 
aatgttgaat 
acaacatctg 
cttttcgaag 
ggggacttta 
atcttatcta 
aaggctataa 
tcaacactcg 
gggtattgta 
gcggaacata 
ataaacaacc 
gacgcagatg 
atgagtagcc 
gaaatcgaag 
aggacaaata 
catttcatgt 
gagctgatta 
gtctctgacc 
aaagggggtg 
cacaatcctt 
gtctagcaac 
gtcttctgct 
ccttcctgcc 
tttgtgaacc 
cttatttttt 
ggggggtaaa 
aggtatt ttg 
cttagctatt 
cccattctta 
tgcaagaaga 
cccatacttt 
cagagcatca 
cttaaccata 
ggaagttttg 
gctgacgata 
gtcagttccg 
gcattataat 
tacaggcagg 
cagcttccga 
cgtcatccaa 
tattagcact 
tcttaccaat 
caagcttat t 
tattggcaat 
catctacaca 
cataccttgg 
gctgattcac 
taaaacaaag 
cggatgacgt 
catcatcatc 
tattgttact 
ctacctctgg 
tcacaccaga 
tactcaagca 
catattcata 
caaaggtctc 
tcaacttggt 



gggggtgtcc 
agggcgtagt 
cataatacac 
tacaggaaaa 
cacccacaca 
catcttcaga 
acatatctga 
ataccttaga 
ttgatggccg 
gacagcgaga 
tgggttgtgg 
ataatgatga 
aaatggctat 
cagttgaagg 
taataacaat 
tgcaggcgcc 
tctacatgga 
gagctgatgc 
aaaaattgga 
acatttcatt 
atcctcctgg 
ctatcaacca 
tgaagaggtg 
gttcttcccc 
tccaacatct 
tcattgtcat 
tgatgcctct 
cagcatggtt 
acattcttat 
ggagtggtat 
ccctccctgt 
ttagtatgca 
tcaaactgaa 
catcccgcca 
atttcaggaa 
ttttccaagg 
tcctttattt 
attgatgtct 
caccctccct 
gccattttta 
gtaacattct 
catatacaaa 
agggtcagta 
cctagcatta 
ccactttgaa 
tacatcattg 
ctcagtttcc 
ctcattatac 
cagtctctta 
cctctttcta 
tgatgacgaa 
gttaacatta 
agaagcacta 
attaacatca 
tctaggggga 
ggggacaaac 
caacaaacta 
ttgagtaatg 
aggggggata 
ttcactgaat 
ataagccctc 



aaggaggaaa 
acttactttt 
gcactagggt 
ggatataact 
tatgcagcat 
agagaaaata 
aacagtcaat 
aggtatggtg 
taaaaagctc 
ttttgttgca 
tattataagg 
tgacaaggca 
aaaggactat 
aagaaagatg 
catcccttca 
attactttta 
tgagaaaaaa 
gaatgcagtc 
tacatttcat 
aaaaatcact 
ccatgacggc 
tcctacctga 
aagaataagg 
ttctaacagg 
gtgcctggta 
tattaggaat 
caaagagccg 
tcttagcctc 
agggaagaag 
attcaccctt 
cctcttttag 
ttttctcaat 
aacccctcaa 
ttagcctaat 
acctattgag 
ttgatgagct 
ccattctttt 
taaagacgat 
tcttgctgct 
atgactctaa 
tttattttaa 
aacacatcac 
tcaaacccat 
ttagaggctg 
gcgtacttac 
acagttgtat 
tgtctccttt 
tcctcatcaa 
ctctcagagg 
ctgttactac 
gaaggtatta 
atgttaagag 
ggcatattca 
taaagtgaac 
acagtcaagt 
actacagtat 
ggggagacat 
tcaatagtat 
tccttgaatt 
gcatgcattt 
tccatataaa 



34920 
34980 
35040 
35100 
35160 
35220 
35280 
35340 
35400 
35460 
35520 
35580 
35640 
35700 
35760 
35820 
35880 
35940 
36000 
36060 
36120 
36180 
36240 
36300 
36360 
36420 
36480 
36540 
36600 
36660 
36720 
36780 
36840 
36900 
36960 
37020 
37080 
37140 
37200 
37260 
37320 
37380 
37440 
37500 
37560 
37620 
37680 
37740 
37800 
37860 
37920 
37980 
38040 
38100 
38160 
38220 
38280 
38340 
38400 
38460 
38520 
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catcctcatt 
ttgacatctt 
aaaatgagtg 
cagggggtat 
cttcaaatct 
ctccatttga 
aggtaggcac 
taaagaggga 
acagtcccct 
cgattttgat 
gacatcaaat 
ctggaagcca 
aagaactcgg 
tcatcgttgt 
ccagatatgc 
tttgcaactc 
agtttttcaa 
actgtatcaa 
tacaacaagc 
cacctaatga 
ggacacacac 
ttaacacact 
cccctattca 
atgaaagtgt 
cgagcagtct 
gtgctagtct 
gccatataca 
agaatgtgtc 
aagccaactt 
acaattgacc 
caactcttct 
tgcctcttac 
ggaggagagg 
aggttgatga 
ggaacagttg 
acaggccgtt 
gtattgatgg 
aaatacacga 
aatagcccat 
tattggtact 
gttttactgt 
agtccactat 
taagctcggg 
ctctgctgag 
tcacccccac 
atctggctac 
actaaatggg 
ggatctggat 
accacctcac 
ctatattccc 
cccataacca 
cccaggcgct 
aagccagcag 
caatactcaa 
gttggactct 
tagctctcgc 
gatcttccaa 
tttctatgtt 
tttactcttt 
cgcctcacct 
ccgaaggcgc 



ctcattaaca 
attaagggta 
aaaagaggca 
ataggcccta 
ccaggtgaaa 
ccagttatag 
ctcatcaacg 
gaaaagccaa 
taaaagggaa 
agaaatcagt 
taaaggcagg 
ttacactgaa 
gaggtgtgtg 
aacaataacc 
tatctataac 
ttgtaattac 
acatacttct 
ccccctttac 
gtgcgtccct 
catcatcaat 
cagtcacccc 
caaaaacaaa 
caatgtaata 
taaagtcctt 
tacggaagtt 
tttgtctaat 
ttatgtccat 
tagtgtcaaa 
tagggtcctt 
tatggccaat 
caatgccttt 
actgcctaat 
gagcaacttc 
cgtcatccca 
tggcttcgga 
ttttattccc 
tagtagtaat 
ttaattcaac 
aatatttaac 
tacttttgat 
tgggcactac 
tgtcactgca 
ttggctgcta 
actgatgctt 
ttccagaaat 
gggttcactc 
gaggacctaa 
gacccacctt 
cccatatctc 
ccaccttatt 
gatctggacc 
agcggtgtat 
aagagcaaca 
gctacaagaa 
aagttactta 
cggtctgata 
ttcatacaag 
atgctgaata 
gacagaaatc 
gaacgctgca 
ccaaacttcg 



cctttggcat 
tcctttacac 
tacttgtaca 
cacatgggac 
agacactcca 
caatccgtgt 
gcgacggttt 
cgctatagca 
tatgtgcttc 
gtctgttccg 
gccttccatg 
actgtacgtc 
aatgtactgc 
tagagggtat 
aaagttatca 
attacaagtc 
tacaaccttt 
ctctggtaaa 
tgactcataa 
actgatacct 
aactaccccc 
ctcttcatag 
gaccccatat 
cacatcttca 
ttcccagttc 
tcgctcacag 
aatctgggag 
aaatctcatc 
tccatagaaa 
tctcaactgt 
aaagtagtaa 
caccttggac 
agacttcttg 
agtcttgata 
ggttgaacca 
ctgatggtaa 
tacagtgggt 
gcccggcaca 
gttacgttaa 
gggtttggag 
actgcactgg 
ctgcactagt 
gagctggagc 
ctggcctgcc 
ttgccgtcga 
ctggccatac 
gggtgttata 
gtccttctta 
tcaccccatc 
cactcgctcc 
ccagtcttcc 
ataaggcggc 
caacaagcac 
tggcctctcc 
gttctgagga 
tgaagaaaca 
gtaagcacaa 
ttttgttggt 
tccaaccagt 
gccgatctct 
tccccggctg 



tggggacaaa 
cttcatcatt 
cctcttcata 
ataacaggtg 
ggtggacact 
ccgaatcgta 
cagccatctt 
actcatttct 
atggcaaggg 
ctttgcaaca 
ttagaacaag 
attgaatagg 
ctgattagtt 
gagagtccca 
gcctttatat 
tcagccaaaa 
cgggcaatga 
cgtcccatcc 
tactccccac 
gcaaatggca 
ttaatattct 
ttcatatcct 
ttaatctcaa 
ggcacttcga 
tcacactggt 
gcacttgaag 
gacggaatat 
tcttcatcgc 
caggtatcat 
tgacaggtat 
ggggaatcac 
gcacacacgt 
aaacgtttag 
gtgatgttgt 
gacgcgtagt 
acatactgct 
ccccccattg 
ctaataaagt 
tcaaaaaaaa 
cggagactga 
tgacgtagta 
agtagtagta 
tggaactgga 
ccggctcgct 
acgccagttc 
cctgtttagg 
acaacatgta 
tcctatcctt 
tcacctatac 
agtttcaaca 
cttttatccc 
gcggccaggc 
tctctctcct 
agcccccgcc 
actaaaggaa 
cttgctccat 
gttctcacta 
tgaagtgaaa 
acggagacta 
tggataaggt 
cttgtgttgg 



gaaagtaggt 
aacatagtca 
cactttacgg 
attttcattc 
tttcctgcaa 
atcacccata 
gttgttgttg 
ttactttaaa 
catattctgt 
acatcaaaaa 
caacaacatc 
cacctagaca 
cttccgtgtt 
agtcaatcat 
cgacattaat 
caaaaggcat 
aagagtctgt 
tttgaagcat 
catctaaact 
tctctaaaca 
ttctgtatac 
tcataaactt 
aaaaggtctc 
ccaaatcaga 
cagaaacttc 
gcaattggaa 
tacgactaga 
aagaagaggt 
ggacatactt 
cataattgtc 
gttcaatcca 
ttgaaaaggt 
ggggaggggg 
tgttgttgtt 
tgttgttgtt 
ggtggtggtg 
cctatggaag 
gatgtgtttt 
caacaacaat 
agtaaatcca 
gtagaagtag 
gtagtagtaa 
gcttgtatgc 
tatatacaag 
tccaacagag 
ggtctaaatc 
agcgttgggt 
atccttgtcc 
ccccatctca 
ccctgttctt 
taaccggcac 
cagaagcatc 
tctacctaga 
gcaccaagtc 
ctaacttcat 
ctattcgagg 
tacactttgg 
aatattctaa 
tggatgtttc 
gatgtccgga 
tgcgaaggtt 



gctagaatag 
gcagacaaca 
aagaaccaca 
accatatgct 
cacggcatcc 
cagatgggac 
ttgtttactt 
caggttcatt 
gattgaaggg 
gtgcatgttt 
ttccagtata 
gtgtccccta 
cctaaaatat 
tttaggttgg 
aattccctca 
gttcaccttt 
tattggtgcg 
gacaacccca 
atccttgaca 
taacccctga 
agaaaggcca 
gacaacagtc 
ccagggacag 
caaacaacca 
cttaattagc 
cacctctaca 
caactcctta 
gggccaatag 
gctagactta 
ctcaagacta 
ctcattatac 
gcgagtagtg 
cgtgatgcta 
gttgtaagcg 
atattcttcc 
gtctccacca 
aaacaacaac 
ttatataaaa 
aatagtatat 
gaaaacaaca 
tagtggtaat 
gagcgttgga 
ttggctgtgc 
ttgtcccccc 
tggggtccag 
cacccccggc 
aagaagatct 
cctgtctacc 
cccatcacct 
gccgagccaa 
catttatgcc 
agttctctgc 
agagacctgc 
cttacaccat 
acgtctcgac 
agcacgagaa 
actttgaaat 
gtccaattcc 
acaatggccc 
cctctatctc 
gtgaaggcac 



38580 
38640 
38700 
38760 
38820 
38880 
38940 
39000 
39060 
39120 
39180 
39240 
39300 
39360 
39420 
39480 
39540 
39600 
39660 
39720 
39780 
39840 
39900 
39960 
40020 
40080 
40140 
40200 
40260 
40320 
40380 
40440 
40500 
40560 
40620 
40680 
40740 
40800 
40860 
40920 
40980 
41040 
41100 
41160 
41220 
41280 
41340 
41400 
41460 
41520 
41580 
41640 
41700 
41760 
41820 
41880 
41940 
42000 
42060 
42120 
42180 
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tggtgagctt 
ccatggagtt 
gccccgtgac 
tgtctgcaaa 
gacaagaagt 
gtttgcttga 
tcttagtttg 
atgtgaacgt 
caagtaataa 
ctaataatag 
gaagatcaaa 
atcatggcat 
aagttgaaac 
atttgagtaa 
tgttcatctc 
tgactgcatg 
atatcctcac 
aggggactgt 
atctggacaa 
tggcattcat 
ttgcaaagca 
caaaggttgc 
acgaaaattg 
acgccaaaca 
gtgcattccc 
gtgacacaat 
ttgaggacga 
aagccgagaa 
aggaagaatt 
ctaatgtgga 
tagaaaggga 
tggaggagga 
agaaccaaga 
atgcagatag 
gtagtagtag 
agaagaaaga 
catcagaagc 
tatcaccatc 
cgcctgagcc 
gatttagtta 
ctactgaacc 
ttttggacat 
tgaccgcaaa 
gtttcgatga 
cctccccgat 
ctattaagag 
aggttgaaaa 
accgtggttc 
aggatgaaag 
aacattcccc 
atagtacaac 
ataaaagcca 
aatgctcaat 
gaagtt ttag 
cagttaggcc 
ggtctgcctg 
acaagaagaa 
aaaaccaatg 
gtgtaaggaa 
agcttctcct 
acaaaaaaca 



ttgccagaag 
ccaaggagac 
tgagaggcag 
cttctctcca 
cgactcatca 
agagtgtggt 
caccctgttt 
acttaaccag 
taatagtgtc 
taacaacatc 
gaaagttaca 
gtttgccggt 
tcttacaagt 
aatgtgccgt 
tgagattaac 
ggataagttt 
tcgctcaaac 
ttattataat 
gagctggggt 
gggtattgac 
attccaacat 
cgcaggattg 
tcagtttatc 
acttctccgc 
atttgcaaat 
cattgaattg 
aaacatcatc 
gaccctgtat 
cttcaggcct 
ggaagaggaa 
agaaggatct 
gaaggaagaa 
agaagaacaa 
tgacagcgac 
tagcagcagt 
agaggaagta 
tgcttcctct 
attggacata 
tatcgttgct 
ctcttcattc 
agcagctgtg 
gttggacggt 
attctttgag 
tcttcttaaa 
tcacccttct 
gtctagggca 
ggaagcccct 
ctctggttct 
ggatatgaaa 
taaatatgat 
atgcatgtcg 
ctctcgtaaa 
caatactcaa 
tcctaagaaa 
ctataacaag 
accgactcaa 
ctatattttt 
tattatatgt 
agaaaaaaaa 
ttctttctgt 
acacaaattc 



acccgcttca 
gattttaact 
atgttcgccc 
gtctctgatg 
acttccgctc 
agtggctcta 
ataatccaag 
ttgtttggac 
ccatcaactg 
agtaacaagc 
gccacagcca 
gcccctgttg 
aagagcacca 
aaaactgccc 
ccctccaaat 
gtagaggggg 
tatgcctcag 
gacaagtcaa 
aacaagaatg 
attgtaaagg 
ccggaagtgg 
aagtcgggga 
aagtatgaca 
acaggactta 
attgtgacgg 
atgtacaaga 
aaggtgaatc 
tttgagattg 
acatctgttg 
gaggaagaag 
gataaggaag 
gaacaacaac 
caacaacaac 
agtgatagca 
agtagcagca 
cctgccaaga 
cccaagagaa 
ctccagactg 
acaacctcac 
gtaagaggag 
cccgctgctg 
ttacctggcg 
ggcatcaccc 
gccaccgaga 
agcaactcag 
ggttcacttt 
aaacgtaaga 
gcttcttctt 
tcaacatcaa 
gctgtaaata 
tcatcagata 
cactcttctt 
aatgtcaaga 
gatgagcttg 
aagcgtgata 
atgatactca 
ataatatttt 
ttaaaatcaa 
catgattatt 
gtctttgcag 
tctactgctc 



ccaccaacat 
actctgcctt 
ttatgaagag 
acgtcatcaa 
aaatttctgg 
atagtaatag 
tgttcaagtt 
atgttgtttt 
ttgttaacaa 
gtgttggtgg 
aaaatccctt 
atgttaattt 
tccctaaaga 
ttacccccct 
atgaccgttc 
atactgctgg 
ccgccagggc 
agtttctgtt 
ggaagaaacc 
tgtgcgctaa 
aagaacttgt 
ctttgtttga 
cagaagggtg 
atcaatacaa 
taacatccgc 
caaaggatgg 
cagcagaaga 
attccgatga 
ttgctgcccc 
agcagatgga 
atgacggaga 
aacagccaga 
aacaaccaga 
gcagcagcag 
gcagtagtag 
ttcagaagag 
tgagagtaga 
cagttgatga 
ccaaggcagc 
atgacctttc 
ccacttgcac 
atatagtaat 
taccagatgg 
ctgataacat 
cccccagaaa 
ttgacactga 
agcacttgaa 
ctcattgtat 
aggttcacaa 
gtgatgtaaa 
gtgatgcaga 
cctccacaag 
agactgttgt 
gtgatttctt 
atgttaacac 
atcaatgtac 
tacatgtctt 
ccccatttgc 
tctgccatta 
attcgggagt 
tccctcacca 



tgtgatgaga 
gtgtgcaagt 
tgaggacgaa 
cccttcaagc 
tatgtttcaa 
ttcccctgtc 
tttggtgact 
tggatcactt 
caacaacaaa 
tagtaataac 
taataatgta 
ggatgacttt 
agaggtaaat 
agaaattcat 
aatgttttgc 
cgttaaacgc 
cgtgtatgaa 
ccacgataat 
tagactccca 
ggggattcaa 
gcctcctatg 
tgactgggac 
caaaaagcac 
taaactggaa 
ctctagtgat 
cgtaaaggga 
aaagaagaat 
tgaggtgtgt 
gacaacaccc 
agaagaggag 
cgcaccagca 
agaagaaagc 
aagagaagag 
tagtagcagc 
cagtgaaaat 
aaagaggcta 
agaagaacaa 
gatgatggaa 
gacacttgca 
agtagctggt 
ttccgatgtt 
gcaacctggc 
tactgataat 
tataaccacc 
ggatattgat 
tgatgatagt 
aaagaggcgt 
gagtagtgat 
gtcaccaaaa 
taactcatac 
agcacagcct 
tgataagaaa 
acagtctcca 
gtcacgcaag 
cactaataat 
aataataatc 
aataacaaac 
atgattaaac 
aaacaacaac 
ctttctaaaa 
atccccctat 



gaagttaaag 
atgccccaac 
gaaatgggag 
ctcccctctg 
aacgtgtgga 
tctaggacag 
aaagtgtcta 
gatgtggctc 
ccctcgacct 
agtggcggcg 
gatggggaca 
gtttttcccc 
gtagatgaag 
acctttaatg 
aagggatttt 
ttccgtaact 
gcgtcaatta 
gttaaccctg 
gctaacttga 
aagtatatgt 
gctgtatacg 
ctgcctgaat 
agtgagttat 
gagggacaga 
gatattcacg 
gtctcaaaaa 
aatagagtac 
gagagaacag 
ctcgtacctt 
gaagaggaag 
caggaagaaa 
aatggtaatg 
gagaataagg 
agtagcagca 
gaagctgaaa 
agtgaaaggc 
caacaacaac 
gaaattcctg 
ctcaagacag 
aatacttccc 
ggaaatgact 
gaatgcgacg 
gaatgcacag 
acatgcttta 
aattgcagtt 
gaaacaaatg 
aacaagtccc 
gaagaatcag 
gctcatgtta 
aacaatgtta 
aaaagccata 
cagaaccaac 
cctagtttta 
cacacaaagc 
gtagtacaga 
ttagtactta 
aaaataaaag 
taactatagt 
aaaaattcta 
ccacagcaat 
cgcccttatg 



42240 
42300 
42360 
42420 
42480 
42540 
42600 
42660 
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42780 
42840 
42900 
42960 
43020 
43080 
43140 
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43260 
43320 
43380 
43440 
43500 
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43620 
43680 
43740 
43800 
43860 
43920 
43980 
44040 
44100 
44160 
44220 
44280 
44340 
44400 
44460 
44520 
44580 
44640 
44700 
44760 
44820 
44880 
44940 
45000 
45060 
45120 
45180 
45240 
45300 
45360 
45420 
45480 
45540 
45600 
45660 
45720 
45780 
45840 
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ctccttctcc 
ttcaacaaca 
cgacctctcc 
atgtcaagca 
ttagtgataa 
atatgtttgg 
ctacaacaac 
acgaagaagg 
tcgggatctc 
aactcaaaaa 
tcgcagcaga 
atatgtgcgg 
tgtatacttc 
ccaagaaatg 
tatgcatggc 
tcttggctga 
acaaagcttt 
atactaacct 
gaatacatgg 
acaagaggga 
acactagtaa 
ctgatatgca 
tgggtgtgaa 
cgaatgatgt 
attctagaat 
gcttcacagg 
tagaatgtga 
tctttcatcc 
acaagggaca 
agcgtagagg 
tggatgtatg 
gtaatagatg 
gcagagacgc 
actgattgat 
actaatgttt 
gatgtatttt 
cacaggggag 
cacacgcaag 
ataactgggg 
gacaagaaaa 
acacgtgtga 
tttcctctcc 
ctctccagat 
ccagatatct 
actgaattag 
taccagtgtg 
cctttcatgt 
tttagtagtc 
ctggaacctc 
tacacctgtc 
ctcagaagaa 
ttttgaagag 
gtttgaataa 
actacaaaat 
tactggtatc 
actccggcag 
gcagcttcac 
gtgcgtgtaa 
ttatcgcgct 
gtcatagcca 
gcgtaggcca 



tctctatact 
atgccaccaa 
atcgtctttg 
agaagtctcg 
cgatgaggac 
tgattcagat 
accacttcct 
agcagcagca 
tgaagctgaa 
gatgatgaaa 
ttataaaaca 
tgtcgcccct 
ttcctatgta 
tcaattcctt 
ccctcccggt 
acatggagat 
ccccaatctg 
ttgttggata 
cgatgcgtca 
agaacaaaag 
tatgggaaca 
agaattccaa 
taccaaccca 
accagaattg 
gagtgcaaat 
aggtgcgaat 
cgaagatgat 
attttatgag 
gacttgtaat 
aacagtcccc 
tactgccatg 
tatcgtaagg 
ttcatgtgat 
tggttgatat 
tatatatgat 
tgaaactaca 
tcttagagag 
taaaaacacc 
aatttcctct 
aaattagtga 
gcgcttctgg 
agatatctgg 
atctgggtaa 
ggctgtacac 
ctctctaaac 
catataagaa 
ttatatttaa 
ggtacctttt 
atggtagctt 
aatctggatc 
gcccgtttat 
tatgaagaag 
aatatagaga 
ttgtacacat 
tgatttcttg 
cagaagttgt 
ggcgttgttg 
agatgcctct 
tctcaaaaaa 
taaaatcaaa 
tacagctagc 



tcctccccat 
tacaattact 
tgggatgatg 
aaccgtcccc 
gataacaagg 
aacatgccat 
actcctcgat 
gcatcagcac 
ttgaaaaaaa 
gatcctctcc 
agggcaaaca 
cctcaatttt 
tttcctcctc 
aaggggagaa 
aaaaggggag 
gaatacaagg 
attgctcgtg 
gttaatggag 
gagtatgcaa 
accaatggga 
agcactgatg 
tcaatgggaa 
cttcccaacc 
gacatgatgt 
acaggaacat 
gtagtagtgc 
ccaagtattc 
tatatggcca 
gttgtccagg 
aaattctgta 
tgcaaatggg 
aggtcatgta 
gttcccaagt 
gtgacatttt 
tctatttttt 
ctctggcaga 
gggtgtgaat 
acttcctaga 
ccagatatct 
tatcataact 
gcgcgacaag 
gtaagaaaaa 
gaaaaaaaat 
atgtgagtgt 
ttttcctcat 
ctggagagat 
taaatataat 
ttctgaccac 
catttgttgg 
ttgtcggaaa 
tgaggattgg 
aagaagatag 
tacatttata 
acattgagag 
atattcgaga 
agggataggg 
ttgttccagg 
gttgtattgg 
cttggccatt 
t tggtcagag 
ggcacgttga 



tcacttctga 
accccacacc 
acgatgatga 
ccattttttc 
gagaagaaga 
caacttctac 
ccatcatgga 
cgtctattgc 
tggaaaagaa 
ctcacctata 
tttcccttta 
gcgctgaatt 
ccacacctgc 
aggctttgag 
gtgtattttt 
tcccaaggat 
cagacacact 
ttgtcgtatg 
caggagaaaa 
gtgccagtaa 
tgcaagaatt 
caaataccaa 
ctcccccaag 
ggctttattc 
ctcccctgtc 
ctaatggatt 
ccaattctta 
aatatctatc 
agtggttcaa 
gtaacatttc 
cgaagactgt 
catccatgct 
gcagggaaag 
ctgtatattg 
taaaaacctt 
atcagaccag 
ctggctaggg 
aagggaggga 
ggctgtacac 
ggggaatttc 
aaaaaaatta 
aaattagtga 
tagtgatatc 
tctaatctca 
ttcttactcc 
gggagtttta 
caccatggat 
ttgtgcatct 
atttctagga 
aagcttcgta 
aaacgtatta 
tggtattgaa 
ttgttttatt 
aaaaaattga 
gagttagtag 
gcggtggcac 
ataatctttg 
accaccttat 
tctccacaac 
gtgatattat 
atattggagt 



ctatcaattt 
ccctccatct 
tgacgaagaa 
gtacatggaa 
atgttttgga 
tgcccctttc 
tactgattcg 
cgcctcttct 
aaagaggaag 
tgtaggagga 
taaagttgac 
gcccacccca 
catgcataat 
gaaatggatt 
ggctcacttg 
gtttgtttca 
gtgcagtgat 
ctttgataaa 
ttttgatact 
gaagaggcgt 
ccaaacgatg 
ccccatagag 
attggtaatt 
gccttccaga 
taacacccca 
tgtccctccc 
caattacgaa 
ccctcttgtt 
gggatccttc 
ccacgctttc 
aattagacat 
cgcatatcac 
ggttcgcaac 
tgtaaataag 
taaaaatata 
acccctgacc 
gttcatccct 
aggggtgact 
gtgtgagcgc 
ctctccagat 
gtgatatcat 
tatcataact 
ataactgggg 
ttttttatat 
tctttgtttg 
aataacaacc 
atttcaaata 
tgcagtccaa 
cataaactac 
ttctctgcaa 
agggactata 
gaataataat 
tgcattatat 
tacaatttct 
tagcagaaga 
cggtggcagc 
cgcgtttatt 
tcctatattc 
gttgacgggc 
ggttgtttag 
acttgctgtt 



tctgattcat 
atgtcctcga 
gacgaaaaag 
actgtatctt 
tcaaactttg 
cctcctccct 
gatgaatgtg 
tctatccctg 
gaaattaaga 
gaacctcctg 
cctagtatcg 
tccatagatg 
aagaaagggt 
cacgagaatg 
gaacaaagat 
agagtattga 
atgacattct 
gatgatggag 
gtagtgttcc 
ctcacgcctg 
ggaacaaata 
acttcatcag 
actcctttaa 
ggaggtggaa 
attcctacct 
acgttcccct 
gaggataaag 
ccatcatata 
tctcttgcaa 
ttttgtaata 
ggacaatatt 
tacattgttt 
gacatggatg 
ataccaataa 
catataaaat 
taaagcagac 
caatgttaat 
tggtgatatc 
ttctgggcgc 
atctggctgt 
aactggggaa 
ggggaatttc 
aatttcctct 
agaaaatata 
tatgtcctac 
cttcttaatc 
agacattatt 
ctcaaatcgt 
ttaaaaacat 
gtttaaccat 
atggcaacaa 
aataataatg 
ataaaaaagc 
tccttttttt 
agtagcacca 
ggcagcggca 
catttcagtg 
tccgatgtcc 
tgcccatgca 
aaggtccaca 
tcttggttta 



45900 
45960 
46020 
46080 
46140 
46200 
46260 
4 6320 
46380 
46440 
46500 
46560 
46620 
46680 
46740 
46800 
46860 
46920 
46980 
47040 
47100 
47160 
47220 
47280 
47340 
47400 
47460 
47520 
47580 
47640 
47700 
47760 
47820 
47880 
47940 
48000 
48060 
48120 
48180 
48240 
48300 
48360 
48420 
48480 
48540 
48600 
48660 
48720 
48780 
48840 
48900 
48960 
49020 
49080 
49140 
49200 
49260 
49320 
49380 
49440 
49500 
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acggtatgga tcaccttact tccatcattc aagaattctg cctcttctct gtgcagaaga 49560 
gatgtgcctc tgctaactgt cccgtcgtcg aggacaaaag caggtccagg tatgggtgtt 4 9620 
tcgcaatctc cgtaagaagt gtcagcaacc gtgaagtggg aaggatatag tccattatct 49680 
ccaactccag aagtgagtgc agcgctataa acttctccgg tgatttggcg agtgactgga 49740 
ttacttagcc ctgcgtcaat ttcagacata tttccttctt cgggtgtggg aaggaagggc 49800 
ctaggtgggg cgagagtgtc gttgtttctt tcagccgtag tagtagcaag agatgtgacc 49860 
caagaatatt tcttgttctt agactgtctt ccagaaccgt cgtcgataac aagagatggt 49920 
tcgtcctcgt cactgctacg gttgccattc tggttcttct tcctctttga atccttcttc 49980 
ttagacttgg acttgttctt ggggccagga acaggtataa catccacatt gggtgtagaa 5004 0 
gacatcctcg ttccagtaga ttcagtcaga gatgtaagtg ctggcacggg aggcatggcc 50100 
gtaacagaga taccttgtgg tactcggaac attgcagaag aagacgtaga attagagttg 50160 
atacttgtaa cagcagaagc agaagtccga gaggcgatgg cagcagctct ggcggcagca 50220 
atcatagggg ctgtagtggt gcaaatcggc agagtgttgg atgaagtcgt ggcagcagag 50280 
gtagagttga tgtgagacat cttgatttct tggttggttg gttggttgtg gactattgtg 50340 
ttctcagaga ggacttgtag tcgactgtgc tctacccaga tccacagccc ctttcatatc 504 00 
cgattttcgg gggttgccag atactaatat actcctccac gccctcttct tgcgccatat 504 60 
ctataagggt cgaaagttgt ataccttgag ttgatccggt tataaccgcc attgttgggt 50520 
ctttctgaga atcgagagtt gcgatcacgg catggcctcg tgggggaatt ttcattgacg 50580 
taagcagtaa cgtcagaaga attgttggag cggggttggt taaagtgcct ctggtgttgg 5064 0 
tggcggtgtt ggtggtggcg gtggtggtga gcagcagcag gggcgtcgta gtgttcgtat 50700 
tgctcacaat aatagtcctt gggcacttgc attcttcttg gatgaatata tctttctctt 507 60 
tcttcgtatc tgggttcgtc ctctcttttt atgtgttcag tttttatgtg agttactctg 50820 
tccatctcca ttttctcaga atttcttccc cctccattga cgtcatccga ataactcaac 50880 
tggcgccggt gatactgatg acgtcttatt cttcggacct tattatcccc tccttctgaa 50940 
tcatcagatg attcagagtc agatgaatca gacgaatcgg atgagtcaga ggaagaagaa 51000 
gaagaatcgg atgagtcaga agagtccatg gagccactcc caattccact actaaattca 51060 
gcttccaatt ctctgacgac atctatattt ttagtatctt tttcttccac atcttgattc 51120 
gtattagggt atgtcacacc agtttcgttg ctttgttctg cgaggatatt ttcgaaacct 51180 
tcaagaaaag gaatgtcaat aaaaccttcc atttctacga tctccccttc ttccaattct 51240 
tcctttttgt tctcttcttg ttggtttctt gagacggtgg tttcagtaaa atttgggtta 51300 
tctgtagatc caaatccacc agttccacgc acgcttcttg cttgattttt gttgtccttt 51360 
ttagaactag agtcaataat cgttctctct cccgtggttt cattaatata cacaatctgt 51420 
tcttcctcga catcacaata tcttaagaaa atcaactggg caatgctggt tccctttctg 51480 
attggcacac ttttttctgc actatggttg cgcagaatca ctttcaattc tcccctataa 51540 
tccacatcaa tcgttccagt aggtacacta gtgttgttct tccatgtcat cccagaacgt 51600 
gacacaattt gtccataaca gccgtcggga aacttgtcta ttccatatcc agtagagatc 51660 
ttggccagtc ccataggttc gatatccatt tcttcagagg gaaataggtc gtaggcgaca 51720 
gaaccgggcg tggcacgtct gggcggaagt gcagtttcct ccccgggagg ggcgaatctc 51780 
ataaacacga cagatgcaga tgagtccatg gctggataca gggagtacac agtggagacg 51840 
ggatgtaata agtgttcaac gtctggagga caagtgagta ccttcctcgg ttcctcccac 51900 
ccttttataa tcgtgaaacc cccacacccc cgctaaggca gtaaaaaaaa ttgacccacc 51960 
atcacccctt gtccttgaag gaggaaacta gaagataact tttatcatgg caacatttac 52020 
tgaacaggat cataaaaatg cgtttttata tgctaatgag aagctgaggc aggaaagaat 52080 
atacagactt aaaatgtctg agccttcagt ttatgctttt attgacataa aagaaataga 5214 0 
aaatggttgg gaaaaagaat tcgggctttt agtacaacca ggacagaaat tagctccttt 52200 
cagggatatt tcttatgact caagcaaact tgattgtgac gcattttctt gcataccttc 52260 
agatatactt cattctgata atgaaaaaag agtaggagag tgcaactttg ccgaacacac 52320 
ctctgtctca tttcctgtca agaaccctga gggaaaaaca ttgcgccatt tcacggcatg 52380 
tggtccaggg tgttaccgga ggtacaagca aagagacccc catactggtt tgccagtagc 524 40 
cagaggcgtc ctgatgcaag atcacgttga ccatgaaact ggaaataaaa tgtgtgaata 52500 
tctgaaccag agcttagtca tgtgggctgc tgtcccttgg atacgacctg gagaccttac 52560 
tgaaggttac aacacaacac acgtgcctgg cttcgcattc aaggaggatg acgaaaggga 52620 
ttcaaaaaga gttaaatatg aaaatgtggt catttcaaag gcttattgtg atttctttaa 52680 
acagtattat gacgcagact ctggctcatg ttatcgatct ggatggatga aatttgtcca 527 40 
tttaatgttt gggcagtatt ttactaatct ttcttacaat ttagctaacc caaaacctta 52800 
caatttaact ggaaatacat ggtctgatgt agtttctgta ttgacagata accctattgt 52860 
agatgcgggt gctgctccgt ccagatctga aatggatgaa attatcacca agaagaagtt 52920 
caacgtattc ccttctgagc agacatctgc tcgccagaaa gcagagaata taatacgttc 52980 
tcagtacgga gatggtgtag aaatagaccc atcttctgtg gatgctttaa tgcagtttgt 53040 
taatagggaa ggtgtagtag gaacagagaa aaaatccgac cgcctcatgc gagtggcaga 53100 
cgctgttatg gatgcggcta tgcgccttca agtcatgggt ctggacgaca gtcaatctag 53160 
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acgattattg 
tttttccagt 
taaaggcgct 
aagcagatgg 
tatcactgaa 
caattcatct 
ttcttcgacc 
agtagtgaga 
ggacgataat 
ggccagtaat 
ctttatagac 
gttctccgta 
agcgatacag 
aattgttata 
ttggtacgat 
agagtttgca 
aatcgtcgct 
aggagcacga 
gggaaagaaa 
aacaatgaac 
caccacacaa 
tctagaaaca 
tctagattac 
caacactact 
atttctggtt 
caatattggt 
atcatatata 
tgctctagtt 
aggtgtcggg 
cgagagtgac 
tcaaaccaaa 
aggtcccaaa 
actcctcgtg 
tgtggaaaat 
tgttttcatc 
tgtgcaagaa 
acacagatca 
ctcaccttcc 
tgaagatgat 
tggagaagat 
gaatgaagag 
tgccgccgct 
aaaccacgca 
actgctccag 
ctccacaccc 
cgagctcttt 
agacatctta 
ggaaactccc 
taaaactagc 
tatacttcaa 
agacgattct 
aaactttcga 
cgatgacacg 
tgtatcaaaa 
agatttgttc 
ccacccaaaa 
ctctaaaata 
ttcagacgcg 
taacaacaat 
taatataagc 
cacaaacaat 



ttaaaaaata 
tctctcaaat 
tcagctaaaa 
tcacccgaga 
gacccaaaac 
atcaagagcg 
agaaaactgg 
cagttcttgg 
attcctctga 
atgtgcgcta 
tcggcgagtg 
ttcttggagg 
gcatttgccg 
ttcgtaatac 
cacattttta 
aaggcaggaa 
atcaacgttt 
aaatcaaaga 
agtaagtttg 
gcttttgggc 
gagggaagga 
ggtcaaactc 
gtgggagagg 
agtgataacc 
ctaggaatag 
ctagcttttg 
aacatgaatg 
ggagatatca 
gctaaataca 
taaaaagtga 
ttcataatca 
aatattcctc 
aatgaccgtc 
actcttaacc 
aaagaaatgc 
atattcgaag 
gtttcaccaa 
cctcctccat 
gaagaaatat 
tcagaagagg 
ggagaaaaca 
gccgccgccg 
gttagtgcgc 
caacagcaac 
aacaaattca 
gtttgtttcg 
cccattgttg 
ccagtgaaac 
tttgaaaatg 
aggcctagag 
gtagacatta 
gtgtccgaat 
tcttcaaaca 
ttaaatattg 
gagacgggca 
gctagagttc 
actcgcaaag 
atcttttctc 
aataacaata 
agaaaaatga 
tgcactaacg 



tgattaaaat 
taattggagt 
gaaaggaaac 
cggtgacaga 
accctgcgcc 
gttcttcttc 
aagaaaaggc 
acattttaga 
tgataggcgt 
gatttgcctc 
ctatcacttc 
agtctgttat 
atctggcaac 
aagtacttgg 
gcccagagga 
atgtagatgt 
tccttcaaac 
ttgattttct 
tctacataca 
agcgtataat 
gggatgacga 
ttaaggatta 
aatgggtaag 
tattcaagaa 
gtgtattggt 
cgttcgcagg 
ctatgggtgt 
aaacagaccc 
acatgattac 
gggttataac 
ttgtgaagac 
aatcaaaaaa 
ttggaaatca 
ttaacagtct 
agtcctattc 
atggactaat 
gttcccctat 
cgatgccttc 
acgaaacaga 
aagaagaaga 
agtacgttct 
ccgcagctgc 
atgactacac 
agcaacaaca 
acaagttttt 
atgtggataa 
ctgaatacat 
cgtgcaggag 
atgctgtgga 
atttccctat 
aaaacccctg 
attcgcgtca 
cttgctttat 
aatttgaggt 
ttttgagcga 
gaaatgttgc 
aaactataaa 
ccccaagtga 
ataccagtgt 
ccatcacaaa 
gtaactatcc 



gagtagaaac 
aactttggcc 
ggccattaat 
agaagatgca 
ctttgtggac 
ctcaatatgg 
gagtgttttc 
aggtaaactg 
ggaccaaata 
ctcagctctg 
taggttagcc 
agaatttgta 
tcttgccgcg 
gttgattttg 
tttaaagaag 
gggtgtagct 
tgaagaaaac 
tcaaaaatac 
agaagcagct 
aacagctgct 
aacagtgact 
ctcgtctgct 
aaatactgcc 
aactgtttct 
agcgtcccat 
tctcctagca 
agtgaattcg 
cagaagagta 
agatttcgtc 
ccccccccca 
atcgacggtc 
taccatggat 
tcgaacaaac 
agaaagggca 
tggatgtata 
tactttcgaa 
acctctttct 
tattaaagaa 
tgaaaatgtg 
agatataatt 
cgcattctct 
tgctgctgct 
actgtccgcc 
ccagcagcgc 
actgccgagt 
aattgcgcaa 
tatcaacggc 
aaaggaagtg 
agataaacat 
ccctaaaaaa 
gggttcgtgt 
ttttaatgaa 
atactctcaa 
aatgatggag 
ttcttcatta 
attgttttat 
atgttcagag 
taatcctaat 
gaatatcgag 
ctaccaatgt 
cgatcgtggt 



aacccagaat 
ataaaaagat 
aatggcgaac 
cttttatttg 
at tttacact 
aacgatatac 
gttaaaaatc 
ttttcggacg 
ttgagagaag 
gagtctagtt 
gtacaactgg 
gttgccgcaa 
tcagccttaa 
gatcttgcgc 
caggttctag 
caaccagtca 
ggggaagaaa 
ttccattcta 
caagaatact 
gatgatagtg 
aagaaaatga 
gttaactata 
ctaaaagaag 
cttgctagta 
attacacttt 
tttattgcac 
gacgcaatat 
ggaatggtcc 
tctccaatgt 
acaaaggctt 
gaggaaggtc 
tcgttgacta 
aaaccaatca 
agtcttttga 
cccaagaaca 
tggagagatg 
acaaaaaaat 
gaagaatttg 
gaagatttca 
gttgatgacg 
aatcatctga 
gacattgaga 
ctccagcaac 
tcctcatctg 
aatggcttct 
tataatggac 
cttggtctga 
aaagatgtgt 
ctcgcattcg 
atcaccgcct 
ccgcttttga 
ttctcaggag 
aaaaacccca 
ggaattataa 
gctacggcta 
ttttctgtat 
acggataagg 
ataagtgctc 
gacagaccta 
atggcatgca 
aaccagcact 



atgcaagaca 
ctgttttctc 
agcatagaag 
caagagaaaa 
caccagatat 
tttcaagaat 
ttgtggtgaa 
gatacgaatg 
tcattaaggc 
tggttactgg 
cagccagaac 
gccttcggct 
ccgttattgg 
tagggttagg 
tgtttaggag 
cccctgaaga 
agaaggaaga 
ctcctttgat 
tigggaggaag 
acaccaccac 
ggagtattat 
acgcctcccg 
agacaaggag 
tggccggcgc 
tacgtttcac 
ttatgagtat 
acaggtctac 
agcgccacgt 
tagacgagat 
taaattagac 
cgtgctcaaa 
atacagtaac 
ccgaacaaga 
agttatattc 
agtacacgaa 
gaacaaaagt 
cgcctcggtc 
aagaagaatt 
taaatggtga 
aagaagaaga 
ggcgccagac 
agaaggacaa 
agcaacaaaa 
agaaggtcac 
cagaacagac 
tcgtggagct 
aatgcagtat 
ggtgtcagcc 
cagaatcgcc 
atttttgttt 
aaagtggatc 
ttaaaaatga 
acattgaaat 
cccatagaaa 
tggcattttg 
atttaccatt 
tacatattgg 
accagaacaa 
tccgaaataa 
aggaaagatg 
tgtcacatag 
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54420 
54480 
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54600 
54660 
54720 
54780 
54840 
54900 
54960 
55020 
55080 
55140 
55200 
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55380 
55440 
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55560 
55620 
55680 
55740 
55800 
55860 
55920 
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tgtaaaaggg gaagatttct ttaagatttt aaataatagt aaagtagatt cattgaaaaa 56880 
attgagcaga gtactgattc ccgctcctcc ctctggaaat tatacatcta agttttgtga 56940 
tagaagctct atgtgccata gcttcttttg tagagggatc gaaccagtgt ctacttcttt 57000 
ctcatcggac agttttgaaa agactaaact tgttttgtat ggtaaagtcg ttgacgttat 57060 
caatagttat tctgccataa aaacttccca taataataga atcagggtct tttttaactc 57120 
tgaggaaaaa gataataaga ctatcccctc tagagccgaa agtgcaaaaa atgcattcaa 57180 
ggatatactt gttcacgaat gtaataaaga acgagctgtt tcatattttg agcaaaacaa 57240 
attatcctct aaagatgggc atctatctaa caagtggtgg atcgaactta atgacttgaa 57300 
cattatgttt gagaaacacg tggaagattt ttacaagaaa tgttctaaag taaatgatgc 57360 
agaatcttta aaggatattt ttaatgattt tgaaaaaact tgtgataaat acaaaactgc 57420 
caagagggca attattggag cacaagaccc ttctacttct actccctcta aaaaggagaa 57 480 
tggtatcact aggattatta gtacattatc cgaatttcat tcaaaagatg aagctacagt 57540 
aagtgccctt ctcgacaaaa caatgctctt gggatcgagg acaataatgt ctggtgttag 57600 
atgtgttata cgtaacaata gtgtgttttc gggctttgaa aataagaaca ctaataataa 57660 
ttgggaactt gagattagac actatgtcat ctctatggga ggtgctgcag tgacaaagat 57720 
ttccgatgaa gatttggaac aattcacgcc tgtaagaggt gctgtctctg tcactacagc 57780 
acctaatgat aagctacctg taggggcaca tcagacatgg aaggatgaac aaacactaaa 57840 
aacaaacact aaacgtaata gtctatatga ctcttacaat tcaaaaagga ataataggga 57 900 
taataataaa ataaaaaatc gttcattaaa actatcagat tttaattgga gaacacccaa 57 960 
tatctctatt caagaattta atgcaaataa agatgatgtt aacaagaaga ggtacgcaga 58020 
agtcgtggcg tcagctgctc caaagtcacc ttcaccaacg agcagcagca gcagcaacag 58080 
caacagcagc agccctcctc tttcaccgct ctcaccaaca gtgaagaata gtaataataa 58140 
accattgtat attcctcccc ataaaagaat gaccactact gctgtttgac caatttatgg 58200 
gttattttat aatgttgtgt gatattttat acaataaaaa tatataagaa aacaaccaaa 582 60 
tgattttgta tatccttgtt ctgtgttggg cggaaattcc tccgcgccca ttatgatgaa 58320 
aaaattgcta gaggccttga atcctggtca cgacaaggtt aacaacaaaa aaaattacct 58380 
gtatgtgcaa gtcatatcct ccagtcccac gcatgagaac ctgttattgc accacgattc 58440 
gtgtaacaaa gtcgtatctg cacatgtgca taaattcctt cacgctaaaa tgaacgacaa 58500 
aattgccggg agcaatgggc gtgagagccg gttcctggaa aaccccacct ccgttccacg 58560 
cactacctca ctattgcacc attttttggt gtataaaaag gcacggagag gtagctaaaa 58620 
catccactcg cagagacatc accgtcaaga tgacgtctcc tactagtgct ctccgcgttg 58680 
ctactactgc cctgctgttt gctgtggtgg ttacggctac tgtgatttct gaagctgggg 58740 
gaactaagat agacgaccgc tgcgccttcc cgccatgtga ctacttcccg gaacctgaat 58800 
gacgtggaag aagatgtgtt agagctgctc gaagggggaa caacaacaat aacaacggaa 58860 
cccgtgatgg agccaaagtc tttggcgaaa tcgcaaatgg actgattggg cgtaagaggc 58920 
gtgaagcagt atctctaaca ccagtccatg aagatatgcc agatttcttc ccctaccctc 58980 
accctgagca tcccattgga gggcgtctac cacgtgaagc agctcctcca cttaaaggtg 59040 
cacttggacg taaaaggcgt gaagcagctc ctccacttaa aggtgcgctt ggacgtaaga 59100' 
ggcgcgaagc agaatccttg gaggaagaac ttgtgtctgc tgaagaagaa cgtgaaaagc 59160 
gcgaagcagc tcccccactt aaaggtgcac ttggacgtga aaagcgcgaa gcagctcccc 59220 
cacttaaagg tgcacttgga cgtaagaggc gcgaagcagc tcccccactt aaaggtgcgc 59280 
ttggacgtga aaagcgcgaa gcagctcccc cacttaaagg tgcacttgga cgtaagaggc 59340 
gcgaagcagc tcccccactt aaaggtgcgc ttggacgtaa gaggcgcgaa gcagaatcct 59400 
tggaggaaga acttgtgtct gctgaagaag aacgtgaaaa gcgcgaagca gctcccccac 594 60 
ttaaaggtgc acttggacgt aaaaggcgtg aagcagctcc tccacttaaa ggtgcgcttg 59520 
gacgtaagag gcgcgaagca gctcccccac ttaaaggtgc acttggacgt aagaggcgcg 59580 
aagcagaatc cttggaggaa gaacttgtgt ctgttgaaga agaacgtgaa aagcgcgaag 5964 0 
cagctccccc acttaaaggt gctcttggac gtaagaggcg cgaagcagct cccccactta 59700 
aaggtgctct tggacgtaag aggcgcgaag cagctccccc acttaaaggt gctcttggac 59760 
gtaagaggcg cgaagcagaa tccttggagg aagaacttgt gtctgctgaa gaagaacgtg 59820 
aaaagcgcga agcagctccc ccacttaaag gtgctcttgg acgtaagagg cgcgaagcag 59880 
ctcccccact taaaggtgca cttggacgta agaggcgcga agcagcagca gcagctatgc 59940 
ctccccctga agacgatctc gacttctttt acgcacctgt tgctttgcct ctacatggag 60000 
tatggaaagc accagaacct acaggttaaa atgtggctga acatcacata tacaagttaa 60060 
ttggataatt gaaacaattt gactttttag tacttaatgt ttattaattg aaacaattgg 60120 
actctttcct acctaaggtt tattaattgg tacattgtta acacttgctc tttgctcata 60180 
gcagacatac ctgctatttt taacgctttg tttaacgtgc tttttacaat tgatattggt 60240 
acatgtttgt gacctcttga ataaattttg ttagcatcaa tatttatttt atttttatcc 60300 
ttttcttgtg tgttagaatt agaaggagat gtggaggttg ttgatgtagt agatgctgtg 60360 
cagagttgct cctggtgttg ctcctcttct tgattcacat tgtcctcttc aacatactcc 60420 
tctaatccaa agcgctctcg aattttctcc tttacaccct caccacaatc tcctatcgaa 60480 
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gactgtgcta 
ttgttacttc 
tctttgtcct 
ggcaatgtac 
ccctcttgcc 
ttctgagcca 
aggggcaaaa 
tggcataggt 
fcgtcactaga 
ttcttccccc 
tcctccttct 
tagtctcatt 
tgtaatatca 
attagtatca 
tgcaccagcc 
atacttttca 
ccggcaattc 
agtttcaatt 
ctgtcctcct 
gagactactc 
ttcagtgatt 
gtttaattct 
agaaaagtga 
cgtgctaagc 
tcctacaaca 
gtaatcttcc 
tctcctcgca 
atttaaaaga 
taaagaggct 
tttcttccac 
aaggcgtctt 
tggaagaaga 
ttcctttata 
ttgttctaga 
ggatgtggta 
tgccatttgg 
agaggatatt 
tggtgaccaa 
atcaaccatt 
gattccttta 
atttgttaac 
ggaacgtctt 
taaaccaacg 
gttcggcagc 
aggcatggac 
attatactga 
ttttaaagag 
tagaggatga 
gagccaatat 
tatctgataa 
ccaagaataa 
tgaaacaccc 
gtatcggagc 
tcaccatgtc 
acgttctcag 
cactcttgtt 
ccactattgt 
ttaacaacaa 
caaggactgt 
tgtataatag 
atttcatcat 



atataatttt 
tgcccatagc 
tttctataca 
acctatagaa 
ccatagtttc 
ggtgttctgc 
ggccccccat 
gacaataatg 
gattctagat 
gacgtggacg 
ttattatttt 
ttcttattgt 
gatgagtagt 
gaattgccat 
gtgaaagtat 
tcgatagttt 
gcacatgcat 
aaagctgata 
ttctctagtt 
ttgagaaaag 
gggtcgaacc 
tccgtgtcag 
atatttttaa 
attacatcta 
ttttctggtg 
tcagcaacag 
gtctcattat 
tctaggtagc 
tgtctgaaga 
ggtctgctct 
tgtgcggatg 
ggaacaggag 
ggaactgtct 
ctacaaaaaa 
tatccaagtt 
aataaacaac 
ggttataagg 
caagaacaaa 
gaactgatag 
agagaaactg 
cacaaggact 
attgattttt 
gacaagtaga 
taagcaagca 
gacttcagca 
tgatgatatt 
atatgcaaaa 
tgtcattata 
ctatttcgcc 
agtgctgtgt 
gtggacgtat 
cgactacaag 
atcggaaaaa 
tatatacagc 
taataggagg 
cttaatcaga 
aaagaagact 
aaagat ttcc 
tagttctctc 
gacagattat 
tagttaacat 



tgtagtagaa 
tggtagtatt 
aaaggataat 
cagtgttatt 
gtatctccat 
ccataatttc 
cttccagtgc 
aagggagtgg 
ccgggggagg 
gttgctgttg 
ctgcttctag 
tattttttat 
atttgcctgg 
gtgcattatc 
ttgccagaag 
tattaaataa 
tattattctt 
tttggcccat 
gaattgatcc 
agtgtaaaag 
cgaatttgat 
aggaagggaa 
gtacatttcc 
atgataagaa 
tcaagagatc 
tttggttgtt 
ctgcatcatt 
tactcatctg 
cgcgagtgat 
ccaaggccct 
attctagaaa 
agagcaatac 
catctattgg 
cgtcagacag 
tgaagaaaac 
ttccaaagtt 
attatgttgt 
tgattattga 
gcatgtataa 
cgttgcacgc 
ctatacctct 
ctgagagaca 
gaggatcttt 
ctggctattg 
gcagcagcag 
acacaagtgt 
cttatagata 
cacactagag 
ataattgacg 
ggttctctca 
ggatgtaacc 
gaagagacta 
ctgaacatta 
ccagaaatgt 
gctgtaatat 
ctagatatag 
agtcccattt 
cacattgaag 
cagagagatc 
atgaaaaaat 
ctttttaaat 



gtagagaata 
tttgaaggca 
cgttcatccc 
tcttctatat 
tttactattc 
tgatgtttta 
actattccca 
tgaaaacaca 
aggtgtaggc 
ttctactcct 
ttccttcatt 
gcgttcttta 
acaaataagg 
ttttgttata 
agccgaggct 
tgtaactaaa 
ctttttatta 
gattttaagg 
ccattcatta 
tgtttgagta 
ctgatttctt 
agaaattata 
gtctgttgaa 
aggggtaact 
gtctgtttct 
gttatagaaa 
agctcctcct 
ggagggaagg 
cacaacttca 
tctcccagat 
taagggagag 
tgcgtgcctt 
tacaattatc 
ttatgcggca 
aactgaagac 
gtatgaaaaa 
ttcaatngaa 
tgaagaatct 
caataacaag 
acaatctctg 
attttatgaa 
actacaacaa 
tcaacacgta 
aaaaacacgc 
cttcttctaa 
ctattgcaaa 
atttagcaat 
atgcctccaa 
gtgatttatg 
accgggaaaa 
taaactttga 
catctacaaa 
cccactattt 
cagccatgat 
tgatcttttt 
atatcaaata 
ttaaggaacc 
atggaggaga 
ctcgtgtact 
aaaagtacca 
ccacaatcag 



atggtacaac 
tgccttaaaa 
catgaaaaag 
cttccagtat 
ctgtctctct 
tcaacaacaa 
ttagaaggaa 
ggataatcat 
aacataattt 
tctacctcta 
accatttcca 
gcttgctttg 
ttatcactca 
aactctttgg 
gatgaagagt 
ctggccgaac 
tccattgagg 
ctcgcacaca 
aatttttctt 
gttatacaat 
acactatcca 
tcattagctg 
gaagggtgct 
ctttctcctg 
gtcactccca 
gacccataag 
cctcctcctc 
ggagggttaa 
acgatggacg 
gaaaaacatc 
gatggctgct 
gaacaactaa 
aattctaatc 
ttatcccatt 
gtattgcctc 
actcttcaac 
gacgacgaca 
tataaaacta 
tttggtaatg 
aggtacgaca 
aacagcacgt 
ctaaaacaag 
tacagggaat 
agcagaaaga 
ttttaataat 
cagtgttttg 
atcttcttta 
ctctacagtc 
tgtataccct 
ggcactgttc 
tatcgttgac 
acatatacgt 
aaactacttt 
gaatgagaat 
cataggtgtt 
tgggaatgtg 
tatccagaca 
agaaagtgat 
tttaaccagt 
aataccatat 
agctaaaaca 



gttattgatg 
aagggatata 
tgttagatgg 
ttataccaga 
ggtgaaccca 
tatcagctga 
gtggactagc 
ccacaccctc 
ccatctgcat 
cttctactct 
cgcctgttgc 
ttacttcata 
tggctgtata 
gtggtttatt 
tttgcttaac 
acttactatc 
cctcctttgc 
gtgccatttc 
gtagagtcag 
ttttcaacgt 
gtaaatcctt 
gaattgattc 
ttagatcaca 
tatcgagatt 
ccaatctctt 
tctgaaaaat 
ctcctgccac 
gggttgtata 
tcgagttcgg 
aacccgttat 
gctcgttctg 
tagacgtttg 
tttcaaccag 
ctagttttct 
attcttztacg 
ccatagaaga 
atgttgatga 
ttggagaaaa 
aatttataag 
ctgaagctaa 
gcacatgtaa 
atggaatgga 
gcagtaatac 
agaggagaaa 
gtacaacaag 
aataacccct 
cctcctgata 
agagtagatg 
aaacaatata 
tataatagct 
gctgccatca 
aaaatattgg 
atccaataaa 
gaagaaaaaa 
gtcattttat 
ggcgagagag 
gccatcaact 
gttatagctt 
tcatttggat 
caattgtttt 
tagtgcagtt 



60540 
60600 
60660 
60720 
60780 
60840 
60900 
60960 
61020 
61080 
61140 
61200 
61260 
61320 
61380 
61440 
61500 
61560 
61620 
61680 
61740 
61800 
61860 
61920 
61980 
62040 
62100 
62160 
62220 
62280 
62340 
62400 
62460 
62520 
62580 
62640 
62700 
62760 
62820 
62880 
62940 
63000 
63060 
63120 
63180 
63240 
63300 
63360 
63420 
63480 
63540 
63600 
63660 
63720 
63780 
63840 
63900 
63960 
64020 
64080 
64140 
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atacaataca 
tacttttcaa 
tttctccagt 
gggtttatac 
aacacatctc 
cctcctattt 
tggacatcat 
ttgatatcta 
gattctggaa 
tctgggacaa 
gtttgtggat 
ccttgctcgt 
tccagtagta 
aaataatcca 
tctgtggtgc 
caggacgagg 
gctzggaggta 
cgagcactgg 
ttccctcttt 
tgtctttaaa 
tgattctggg 
cagacataaa 
cactatgtat 
tgttcatata 
atctttcatc 
cttctctctt 
tgtacgttat 
tttcttcatc 
gttcatcctc 
ttacagcttc 
agtcgtcagt 
cctcctcctc 
tgctgttatt 
acgatactat 
aattattacc 
caccttcagt 
ttttagtaac 
tttgagggtc 
cagattctac 
tactgatatc 
tcaaatatga 
tcaaactaac 
ggtctcgtac 
gtactagttt 
taccaacact 
tctcccatcc 
cccatgtgtc 
gcataatcaa 
tcgtatttgg 
cacccgaaga 
tttcagagtc 
tcaggttcca 
gattttcatc 
cttttctgtt 
gggagaagtg 
catcctctgc 
ttagaacagg 
gcatcatcat 
tttcattcct 
1 tcgtcatcat 
taaggttttt 



ataatacatt 
tcaaagccga 
acatgggacg 
aacttcttga 
ttataataga 
tagatttatt 
ttttacctaa 
tataatgtac 
cagggatctt 
cacccatgga 
taaaagtctt 
caccaattac 
cttcctttcc 
cactctcctc 
acatattatt 
agaattggga 
gaatgaagac 
catgaattgc 
attgagagtt 
atcgtccaat 
agtctttttt 
attagttcta 
tatgggcatt 
cacgataact 
gtcgatgcct 
attggcgtcc 
attgtcaaga 
catcacgggt 
ttctatatta 
taaaaagtca 
ttctagatct 
tacatcatcc 
ttgttcagtt 
tcctcctcct 
aaaaacatca 
gtggctctca 
aggtgcactt 
attaccactt 
atccgaccct 
tgataaggtt 
catgacattt 
gtcagtttcc 
aaaatctata 
cttatcacat 
atcatcacag 
gcccacacaa 
gtactcacag 
tagtgtactt 
ctttgttctt 
attactgttc 
cttaaattgt 
tttcaccagc 
gttggtgtgt 
accagacata 
tttcccagga 
ttcggtgaat 
gatcctcatt 
ccgctcgcac 
tgattttgcc 
cttgatggct 
agacttgagg 



tctccctcca 
gtcttgttcc 
aaacactaac 
tgtctttaca 
aacagactca 
aaagggtgca 
tacacttaaa 
atcttcaaat 
ctttgcagtc 
tagggcgata 
cttctttcca 
acgtattgtg 
ttcaacaaga 
taaagacaag 
attcaaacct 
ggagttatgt 
cacagcgtga 
ccctgagagg 
gccaaaagtt 
acttttggaa 
agttcaggca 
tctgtgcata 
aattgctggt 
aaaaggttca 
tgaactggtt 
attaaaactc 
gaaatgcatt 
tgtttattgt 
tttttagcga 
gaatctttat 
gcctcctctt 
atttcattga 
tttaagattg 
cctccatttt 
atcctttctt 
aaaaatccac 
tttctggttg 
ttacgtttat 
gtatttttac 
gttattgtac 
gccgctgcat 
aaatcaatca 
get tgagaca 
tttctacaat 
tctttattat 
gtacagtacg 
ttactaacaa 
tcaggtgaaa 
cttttagctg 
atgcacctaa 
gtaccagtaa 
acatcagaat 
tgcttcaaaa 
ttataagaat 
gtctccagat 
gatgeattag 
atgtccagta 
gagtagaata 
ttttccttac 
ggagecttte 
agatttttca 



atgatacaat 
ttggataatt 
ctcatactag 
tcctcagtag 
gaatcatact 
aagaaagggg 
cctatctgtt 
ttttggatat 
gtcatcattc 
geattggeca 
aaacgaagaa 
tcccttttgt 
ggtattccaa 
gaagaggatg 
tgagaagagg 
tagttagtga 
ggggegacca 
gtagccattt 
tattatagtt 
tcctctccca 
tatactgeaa 
ttttatcctc 
acacaatagt 
tggtttttac 
catagagcac 
cacttagacg 
tgtcgaattc 
taagttttga 
gcgatgacgt 
actcgaacgc 
cctctttttc 
attttttaaa 
acgtctgctc 
cttctcccaa 
tagtcaatat 
gaattgattc 
cattaacctt 
tattagaaaa 
tctcattatt 
caaaaacacg 
ttacagccag 
agtgtgttgg 
gatagaaaaa 
tacttttaga 
tcttcctagc 
tgttttcaaa 
ttgeactatt 
geatgagegg 
caaaaccacc 
agaacttgac 
gacgacacac 
cgtgggagaa 
cattcaataa 
cgtactccaa 
tcaaaaggca 
aaaggtacat 
atgcagatgc 
gtagatcgtt 
ttctttcttg 
catcegtaaa 
teacteggat 



gttgatgttc 
tatccatgac 
gattegtate 
gcatttcacc 
ttaaatcttc 
tttcttcctt 
ctcttacaaa 
tgegattaac 
tggaacacgt 
gtttcactag 
agaagacget 
ttaatatttt 
gttcttcaaa 
tgcatttggg 
acatttttct 
tacgtcttgc 
tggcttagat 
agtgttttca 
gtctgtgcat 
gttcttgcgt 
cattgtgtat 
ttctgtttta 
attatcctct 
actataagga 
caegttacta 
tgataataaa 
gttaacaatt 
tgatacagaa 
catttegtte 
tttccttgcc 
accttcctcc 
gaaatctaaa 
agtagataga 
tgcttctgga 
tcccctatca 
gaaaactgac 
acttttctta 
tgggataaga 
attagcaccg 
ttgattaaag 
ggctctatga 
aggaaaacac 
tttgattgtc 
caaattcccc 
ttcttgttca 
tactgeaact 
gtagtctgat 
cgattctgcc 
caactcgttt 
atggccaatc 
gtttttatga 
aagattcata 
atcttcccac 
tatcttcctc 
tgatgttaaa 
geatttgetc 
catcagacct 
ggtcttccta 
tctgatctgt 
gagtacagta 
catcctaaaa 



tcctcccata 
cagcttagaa 
attgtgggaa 
caataaatta 
tatgeattea 
cttcttcttt 
ggactgaaaa 
atttttgtat 
gcaggcaggt 
ccacattttt 
acataegtet 
gtaggcaaat 
atccttttct 
cagcaagatt 
ctttactgtt 
egctgetgat 
atggccacgc 
cteggtattt 
gtcgatacct 
tttatttget 
tctagtgcag 
ccacaaaatt 
agaccgcaca 
acaccagaga 
ttgttcccgt 
ctattgtcat 
tctgcccact 
catcttgaat 
gtggctgcag 
tcatctaacc 
attttttcct 
aagtcactcg 
cctggtattg 
ttegtatcta 
ttaattatac 
gatgataatt 
egggtagatt 
tcttcagtat 
gaaagatgta 
gaccccattg 
agtgcgtgtt 
aactgaagcg 
cagaaaggaa 
atagaaatat 
ctcaaaaaaa 
tctttcctta 
cctttagtta 
aaattatata 
gcttgatttt 
tttaagaacg 
tgtccaacca 
gaaagtgttt 
ctcttggatt 
ttttctagta 
cgtacgatat 
cccgtagtct 
tgaggaagac 
ttgaggcaat 
cgtttggctc 
tacctccctt 
gegtagtaca 



64200 
64260 
64320 
64380 
64440 
64500 
64560 
64620 
64680 
64740 
64800 
64860 
64920 
64980 
65040 
65100 
65160 
65220 
65280 
65340 
65400 
65460 
65520 
65580 
65640 
65700 
65760 
65820 
65880 
65940 
66000 
66060 
66120 
66180 
66240 
66300 
66360 
66420 
66480 
66540 
66600 
66660 
66720 
66780 
66840 
66900 
66960 
67020 
67080 
67140 
67200 
67260 
67320 
67380 
67440 
67500 
67560 
67620 
67680 
67740 
67800 
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agatgacccc 
ctgtcgggga 
aaaacttcct 
agtcgtccat 
cagttagcca 
tagccttaat 
catccctgca 
tatcttcttc 
tagtattgat 
caatttgcct 
cgagtgtgta 
ggaagttctt 
aatcttcttc 
cagttttctc 
tggcttctgg 
atgctcactg 
tcctagaaga 
gcagctctct 
gggtactttc 
agtttctaga 
acacctcgtt 
ccacgtaatc 
tacgaatctg 
cactttactg 
aaaaaataac 
gctacccaga 
ccatattatg 
aggggtaaat 
cctaaagaaa 
gagtttgaac 
aatttggcag 
acaatgacaa 
cagcagacat 
gctgcagaaa 
gaagaagatg 
gtatcacaac 
cccgctcctg 
ccattaccac 
gatttcttag 
aatattcctc 
gcttcaattg 
gaaagagaag 
caacgggaag 
gctgctgctg 
gaacaaagag 
aaaagggaaa 
gcccaacaac 
gaactagcag 
cgtcaagaaa 
gaaatcacgt 
cggtccgcag 
caggcagtcg 
tacgatctca 
gtgaacaaaa 
gccattgaaa 
agaataataa 
ctcggggtct 
aaagaatcaa 
gatctcaata 
ttatacacta 
gaaataagta 



ctccatctgg 
agaattgaat 
cttcttcata 
tttgtgatta 
atcaacttcc 
tttctcgtgt 
gtaaaaattt 
cctgataaca 
aaacaggatc 
gcccaacatg 
gtcacaggta 
ggagttgatg 
tcctgaacaa 
cttgatgact 
gacacccatt 
atgggcgccg 
acttcgtgga 
acagagactt 
ttcctcttcc 
tcgaagaaca 
gaacacttct 
aaattcggat 
gaacacctgt 
atgttttata 
gttatacacc 
aaaggagacc 
ccaactttgg 
ttgtattaga 
aggaggaaag 
agaaagtgct 
agaaaggatt 
aaaaaataat 
ctgacaaagg 
gagaacaacc 
acgaagatat 
ccgttcaaca 
atattcccat 
ttgatgatta 
tggcaccacc 
ctccacctcc 
cagccttgga 
tagaggaaca 
cagacgaaaa 
ccgctgctgc 
cggccgctgc 
gagaagtaga 
ttcaagaaat 
ataaggagga 
tattagctaa 
ccaaaagagc 
ccagtgcaca 
aacctcaagg 
ataagaaatt 
caaacgaaca 
aattaactag 
cagagaggga 
attttacatt 
gggatctgga 
caatactgga 
aatttacaga 
acagatcaag 



ctagcaaaaa 
gaagtagtag 
acatctgtgt 
gaaactggag 
tcttgggatg 
ttatattttc 
tcataaatat 
ttaaaaccgt 
atctgaccca 
accagatctg 
ggtgaagaag 
ttggcaagag 
taatcctcgt 
ctcccattct 
tctgttgatg 
taatttttac 
gctgctttaa 
ctcggagact 
tgttgttgtt 
gtgtagtctt 
agggccaatt 
tcactcaata 
ctcctccttg 
ccctctcccg 
aaaaatgagc 
tggagatgaa 
cgatgacgcc 
gccacctaaa 
ggaacaacgt 
acaagatcga 
gcaagaacgg 
caggtttcgt 
tgcagcaacc 
agtagaaatt 
gggagaggat 
aatcgttgtc 
acaagaagat 
tgaggacgaa 
aacagcagca 
tcctgttgtt 
aaaggaggcc 
aagagcggcc 
aagggaaaga 
ccaacgggaa 
tgctgctgct 
ggaacaaaga 
gaaagaacaa 
agaaaaacgt 
agaagagcaa 
actcgaacaa 
gttagcgatc 
acagttagtt 
agaagtacag 
gaaccaattg 
tcaaaaacat 
tttttcttta 
aaaaagggac 
ggtgcgtcta 
aaggttggat 
ggcagacacg 
atctgcctta 



atgcatcttt 
gcacgttcct 
gtttgttgat 
gagccatcat 
aagggattat 
tagaaaagaa 
gtttcttgtt 
catgtaaaac 
tatttttctg 
a'tacattata 
aggaagattt 
aagtggcttt 
cgtcttcgtc 
gtgcaccgta 
ttactcccac 
acactctgcg 
aggttgtagt 
tgccagaagc 
tgtagagtgt 
cttcctcttt 
cccacacacc 
actgtaagga 
actgttaact 
acggaagatg 
gatacagggc 
gaagaggaag 
acgtactcca 
gaaagaagtg 
tctaatgttc 
tcgcgagaac 
caaaagaaat 
gaaggaggaa 
aatgttcttg 
acaggagata 
gaattaacta 
agttctccta 
atagtgggga 
gaagacgaac 
gcagcagctt 
gctgttgcag 
gaggaacaaa 
gctgctgctg 
gaagcagagg 
gcagacgaaa 
gccgctgctg 
gcggccgctg 
atgcgcataa 
cgagaactag 
cttgaaaaat 
atgttagaag 
caagcaatag 
cctatggata 
aataatacat 
gttgctcaat 
cttcctgtgg 
aagaatctgg 
ctttttaaat 
acagatgtat 
gtatccgtta 
gcattagcag 
ttgccatttt 



gaatctggat 
gaacaagaaa 
ccagtaatcg 
catgtcttcg 
aataggcact 
ttcagaaata 
ctgcatccgt 
gtctattatc 
tcccttcaac 
agtgggcaaa 
cttgcctctt 
gagaagtgag 
atcgtaacaa 
cgcattttcg 
gggatgttag 
tagtctggtg 
gcttggaatc 
gagtttcctc 
tatagattca 
cacagtctca 
ttgttcctta 
ggaagaatct 
gatgtctgac 
tatacgtcac 
agatggaaga 
aaactggtag 
tgtacactgg 
tacaaagggt 
ggacacggag 
ggtcagaaaa 
atactccaaa 
gaaaattcaa 
aaagggaaga 
ctatattagg 
tacaacattc 
taccgccaaa 
aaaatattag 
atttgtacga 
ccacaagacc 
acgaaacctt 
gagcggccgc 
ctgctgctgc 
aacaaagagc 
aaaaggaaag 
cccaacggga 
ctgctgaaag 
aggaagagga 
cagccaagga 
tgaatttcca 
aagagaaggc 
aatatgagga 
cggat ttgta 
taacttctgc 
cccttgaaaa 
atgatcctgc 
gaaatgttta 
cgaaggcatt 
cgacagatct 
acatacgctc 
atcaagttcc 
catctgcagg 



ttcaaaactg 
ctgttagtca 
agaccctcat 
attggaggtt 
ctttcaattc 
accttctttt 
gaatgcatag 
tgttgatttg 
atggtggttg 
tcaaccatgt 
ccaaaatcca 
ctagttgtgc 
tcttcatatc 
tacacttttt 
acgataatga 
tttttacccc 
tgcagcaaac 
ttcttaatag 
gtagggaagt 
atagaattgg 
agtgtctcga 
ccatccatga 
cccttgtaac 
aatttttttt 
aaataggcct 
tagtaatgtt 
agaaggaaaa 
gcaaaaacca 
acctggtcaa 
acttgggcaa 
ggtagcacaa 
ggcgccgcaa 
aattgagatg 
tgggctagga 
atctatggct 
gcccactagg 
ccagttacca 
agaagtgaat 
tcccaggcct 
gaagaacttg 
agttgaaagg 
tgctgccgcc 
ggccgctgct 
agaagtagag 
agcagacgaa 
ggaaatactt 
gaggcggaaa 
ggaagaaaag 
gttgggtaca 
ctcacgctca 
tgaacttcct 
cggaaaaatg 
atttgaagac 
atccgctaaa 
ttttatgcag 
caaaagagtt 
aattacagat 
cagggctaat 
tggtggaaca 
ttcgaggatt 
tttggatact 
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aattttacta 
gaggctatga 
gtcaccaatc 
tctgatgtgt 
atatcccctt 
atttcctctc 
atcaatacaa 
tctccagact 
tctcaattta 
aaaaccactg 
cataaaaatg 
atggaagaaa 
gattctagcg 
cagcaacaac 
aatcaacaac 
caacagcagc 
cagcagcaaa 
agtgaccagt 
caacaaggaa 
ggggctgaaa 
gctgacttga 
gcaaaattgg 
ttgagagagg 
aatagtatta 
aatttaatgg 
gtgaagaata 
caacaacaac 
ccttctacta 
ctagaaaagg 
aaatccgctt 
gggaaaacat 
gaccaacaaa 
gaagaaaact 
aagaagaacg 
gtgcagtctt 
gaagatactg 
gaacatgagc 
cacatcaagg 
gcgttgaaca 
tcaagaacag 
gctcaaggac 
tgtagtaacc 
aaattagcgg 
tgtcatctca 
tatacgttaa 
acaactaaag 
ttttcagtat 
aagggtttca 
tctctctcca 
gttat tgaaa 
aaggaaactg 
tacagtaagg 
gttaaggctg 
tctgatattt 
attgtggtgt 
tttagacaga 
caaaaacatc 
acaaaggcag 
gatcttttat 
ggatgggccc 
gaagtaaacg 



atagttccga 
atattttgaa 
tattaacagt 
cccgaatgga 
attttaaagg 
agataaaatc 
cctcaagaat 
ttttaaatgc 
cttatgatgt 
aaaaggttac 
ctttgagttt 
gtggttcatt 
ctgaacttag 
aacagcaaca 
aattattaca 
aaagggacca 
gggaacaaca 
ttcgacaaca 
gaagaagagg 
aggatgattg 
ctaccttatt 
gaacccttgt 
aatattccac 
tgcgtatttt 
gaaaatactt 
tattgtctgg 
aacaacaagc 
ctcctcctgt 
ctgccatagt 
cagctccttc 
ctatccgtat 
aatggatcga 
ttgtagaaac 
atccgacccg 
tcccatatta 
tacatcctga 
gtgaagatac 
actactattc 
ctctattgta 
cctttattta 
ctgttcggtc 
ttcttcgatc 
ccgctgctgc 
tgtacaattt 
atgtattgag 
ggcatacatc 
tgtacaagat 
atgcatggga 
tctcagatgc 
aacatgaaaa 
cactcacact 
ataccgatat 
ttctcagatt 
tcaagctacc 
tcactttgaa 
aagacggggg 
aacaaactgc 
cacacgtgtt 
ggggtgcttc 
aacaatacaa 
cacctgcctc 



caagtacaat 
agaaaatatt 
ttcaagctct 
ttctgaaata 
cgattggacg 
aaacgataaa 
acgtagtaat 
tgttaacgac 
cctttcaggc 
agaattgtgc 
aaatttgcct 
ggctgttgac 
tcgcacactg 
attccaacag 
acaacaaata 
acaacaacag 
acaacaaagg 
attattgcaa 
gggtgacgat 
tgtgcgtaag 
ccaacgagaa 
ctttgccacc 
attcacaacc 
ccccgagagg 
gtacattata 
tattttcaat 
tgcttcagct 
tacaagcatg 
agaggcaatt 
cacggcggct 
ggaaaaagtg 
cgaaagtact 
tgcacataat 
tgatgccaat 
cgtactccgc 
atatttccgc 
attaaaggca 
tcccagtgtc 
caacatattt 
ccgtaaattt 
tcaactgtgt 
gtctccctta 
tgtagacacg 
tatagtggct 
agcgtcgggt 
ttcttcccac 
tcttcaacta 
atcgtgtgtt 
cgaccaatcc 
cgatgcagaa 
catggcaaag 
ggaaaagttg 
.aacaaataaa 
tgtgattcca 
taatgtaatt 
agtttcttca 
ttccatttta 
tatttcagga 
aatgaaattc 
taacgaaagc 
tggactctta 



gaaatagtga 
gtcccaacat 
cgtcaatatg 
agaaaattcc 
gatgaaagac 
attaaagaga 
cccctcctgc 
tttagaaatt 
caaaatattg 
ctcgaattat 
gcaattactt 
attagacaag 
tcagaggcgt 
cagctgttac 
gaagaacaac 
gaacaacagc 
gaacaacaac 
caacagcaac 
ggtgatgaag 
gttgcagaat 
gaaaataact 
cctccttcac 
cagtgttttt 
attgtagaag 
actaccgcac 
caaattgaag 
tcttctacta 
caagtttgtg 
actctggcca 
gagcgagaaa 
gatctaagtt 
tccaaacaag 
gaaatggata 
cttaggctcg 
acatggctag 
caatacattg 
ctgggtgttt 
aaaaatgatg 
gctatcgacg 
ttaaggcaat 
gaagcgacaa 
gccgataaag 
tcaaccggag 
tatgtgaacc 
ctggcaaaca 
cggtttggat 
caaaaacaga 
gccgccatgg 
atactcttcc 
aaaaatgtcg 
gaactaaatt 
gcgagagtgt 
gccgagagtt 
attgatgata 
aaaccatgga 
gcctatttct 
gatgcatggg 
tatgaaaatc 
cctgccgacg 
gtattggaag 
ataccgccag 



accaactaag 
tgaaccaaat 
ctattgaaga 
tcgctataat 
aacgctctat 
gtgttgctac 
acaaatcctc 
ttctcgatat 
atgacctttc 
ccataatttt 
accccgcagg 
aaattggtaa 
tacaaatttt 
aacaacaaca 
aacgggttca 
aaagggaaca 
agcagcaaca 
aatttcagca 
aaagagagga 
cagtagcgac 
tccaatctaa 
ctatcatgaa 
caaaactaac 
tatgcaagag 
aaacagaaat 
agttttcaaa 
atcctcctcc 
agttggatga 
atgctgtact 
ttgctctaaa 
caggagctac 
aattggaaga 
ttggattaat 
tgaaacctca 
gagaaacaga 
atcgcaattg 
ctttatcaga 
catcaaaatc 
gaggaatgat 
ctatgacaga 
tagcgtctct 
tagaaccacg 
acatgtttcg 
tatgcaataa 
aaaaagtcgt 
cttatgatgt 
atatctcttt 
cagccttcac 
cactagaagg 
atatggttca 
catactacaa 
gcaggatgat 
tagtagacac 
ccaaaactct 
tggtttcgtt 
cgtttcaaaa 
cttgcgcgcc 
atatcaaatt 
gacgaggcac 
attttaccga 
atcctttact 



cagtataaat 
caaaattgat 
aagggtgtat 
gaacagtaaa 
tgctgacagt 
actacacgat 
agttttatca 
ccaaggaggt 
actggcatca 
agacgtgatc 
ggaaacatct 
gaacatatca 
tcagcaacaa 
ggaccaacaa 
ggaacagcag 
acagcagcaa 
gcagcaacag 
attactacaa 
aagagaagaa 
aaaatatact 
aatagcatca 
cttgacaaaa 
agctgaaaat 
taagaatctc 
ggaagatcga 
caatgtaaaa 
tccttctact 
tcaacgtacc 
tcaaactaca 
gctagagaat 
tggtgtttcc 
tttcattgca 
tttggatgcc 
tggaataaat 
tatattggat 
gaaggtggaa 
tacgttggca 
agtaccattt 
ttctagcctt 
taaagaggtt 
tttcacggca 
ccttcaagaa 
ta tacgagtt 
tcgtataaac 
ggccggtaaa 
cacatatgat 
acttttagag 
tgccgaccct 
gggagagatt 
agaattgtgg 
ctggttcata 
tatcggtata 
taatgctctt 
ggcaataaat 
caagcaaatg 
tatccaacaa 
cggtaagtta 
aaaaaaagat 
tgtggtcgaa 
cttttcaatt 
ttcttccatg 
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ttcggtaaag 
aagggaggat 
acgtcgtcgg 
ccttatagtg 
gaggatagtc 
atagggcaac 
cactacagtt 
ggacaatctt 
tctctggagc 
aagagcgaaa 
aataaagcta 
aagaggtatc 
cttatgaata 
aataaaattg 
tttggcaata 
ggagtaagag 
actggtggca 
gttgggaata 
aattaggaat 
ataaaggtta 
cccttccaaa 
aattgcacgg 
gagtgatgtg 
aaatgtgttc 
atacgagaat 
tagctggtgc 
ttcgcaaata 
cgtgtttctc 
agagagcaaa 
ttctattgtt 
tgtacatttg 
tttggtggcc 
ctgggtccac 
aacgaatttc 
atctcagaat 
attccccatc 
tttctctatg 
tgggcgtaga 
gccggatatt 
ctccattttc 
ttgtgatggt 
gggggaggga 
cagaaatggc 
gacattggcc 
caccccctaa 
taaaattctc 
agaaacgcct 
ctggtgtaat 
gagctagcgg 
gatagaggaa 
tcgaccagtc 
aaccaacaat 
tgacattcgg 
aaaagttttt 
cacatccttt 
aatgtacgtt 
gtccaccctc 
ccctaaaaaa 
cagaaggtta 
atacccataa 
ggcccgaccc 



gaaatggtgg 
taattttaaa 
atactaaaaa 
taattaaggc 
cagaagattt 
gcatgactta 
cacctaaaat 
gggctccttc 
ttttatatag 
gggaagaaga 
tcggaactat 
agcaacactc 
gtaaagatgc 
cagcagcagc 
gattggatga 
gaggacaaac 
atagtagttg 
atttagttgt 
aaggaattat 
tgatcataca 
aatgaatttt 
agcgaatctg 
aggattcagt 
cattattcgt 
gttgttgttc 
aaaacaactc 
gaaaacagaa 
agataataat 
gaacattttc 
gtgtttgatt 
cttgactgcg 
attatctggg 
aaacttgaaa 
gtctgcgtct 
cccattcaag 
tacataaagt 
agaaaatatt 
ggccaagagt 
gtctaccatg 
tggagagggt 
ttgaatgagg 
gggggagagg 
cataagatac 
cgacccagag 
actggagcgc 
tagtgaaaac 
gttccagaaa 
ctgacattgg 
tccaccccct 
gagtaaaatt 
tccagaaacg 
ttctggtgta 
tcgagctagc 
gagattgaag 
ctaggcacgg 
tctggagcta 
cgaacttgac 
tttttgaaaa 
taccccacct 
gtcctttcct 
agaggtccac 



aagcagcagc 
ccgccaagtt 
aataagacgt 
aagtaaagga 
cgcccttaaa 
tacaagacct 
tattctcgaa 
ttcatcatcc 
agaacttgcc 
taaaggacaa 
ccaacaacaa 
tgctgatcaa 
tacttccatg 
agcaagggat 
agcaataaac 
accccagagt 
gtggactact 
gcttgtaccc 
acacatttat 
taatcaactg 
gtacctccga 
ctcacccatt 
ctgttgtcag 
ctacgcaaat 
atcccaaaca 
ccaaaatctg 
atggtcacat 
tttcccgttt 
atttcttcac 
gaaaattgtt 
catcttcttt 
tggtatccat 
gcctctctat 
tctgccttgt 
gtgtgttttt 
ccagtaacaa 
gccttgttta 
agagccttgg 
ttagacatag 
ttgttgaagg 
tgtggtacag 
gttgccatca 
ttcctctatt 
gtccaccctc 
cctaaaaaat 
agaagggtat 
cacacaaaag 
cccgacccag 
aaactggagt 
ctctagtgaa 
cctgttccag 
atctgacatt 
ggtccacccc 
aggagtaaaa 
ccatgtccag 
ttacatttct 
attcggtcga 
gtttttgaga 
tttctggatg 
ctatttctgg 
cctcggaact 



agcagcagca 
gttggacaag 
gatgcaaata 
gtatcaattt 
acttccatca 
atatttgatc 
ggatcagatt 
ttgactctgg 
acaaaacaag 
aaacttaatg 
catcaatatt 
gctagtaatg 
agaaaggcaa 
ggggaaaatt 
cctggagcac 
tcaatgctaa 
aatacacccc 
aacctattgg 
ttgtatgttt 
gttgttttat 
atacaatttc 
caccaacaac 
cgagttcaac 
cgagtgcatc 
actcaaaatt 
ttttcatcag 
gtggaggcaa 
ctgaatagaa 
ctttaaaatc 
tcaccacttc 
gcacctgtgg 
tcaatatgtt 
taaaccctct 
ccattaacat 
ctggatagag 
caatctttaa 
ctttggtgga 
aggaaaattt 
aagtattatc 
atagatgttc 
tcattatata 
agcaatatct 
tctggagcta 
ggaacttgac 
ttttgaaaag 
accctctcat 
ttaatgtacg 
cggtccaccc 
gccctgaaaa 
aacagaaggt 
aaacacacaa 
ggcccgaccc 
ctaaactcga 
aactcactat 
aaacgcctgt 
ggtgtaatct 
gctagcggtc 

tggagtaaga 

cgactagatc 
aaccaaccat 
tgacattcgg 



aggataatac 
aacaagcacc 
ttgaaccaat 
cagtactgga 
tcaatgatgc 
atcaaacaca 
taaagaatgg 
cctccgattg 
tagagaagga 
aaaaattatc 
ctgaaagggg 
gtggcataga 
aactggcatt 
catcagctaa 
ttttattacg 
caatgttccg 
ttattcaacg 
attcacaccc 
tacatgtatt 
tacatagttt 
ttccctcggt 
accctcatta 
atttctccct 
atctccaaca 
atccccagtg 
agtattgtat 
ctgagaaaat 
aggaagtaac 
aacttgattt 
aaacatgcca 
ttgagtagat 
gtagccggca 
ccctagagga 
catatttaga 
gcaaataaca 
cgctttagat 
gagttgttcg 
cttgtccata 
actaaagtca 
ttccatggtg 
cacgtgggcc 
tcgtccagaa 
ttacatttct 
attcggtcga 
tttttgagat 
ttctgggtcg 
tttctggagc 
tcggaacttg 
aatttttgaa 
tataccctct 
aagttaatgt 
agaggtccac 
gtgccctgaa 
atgaaaggtg 
tccagaaaca 
gacattggcc 
caccccctaa 
gtaaaattct 
cagaaacgta 
ttctggtgta 
tcgagctagc 



aattattgga 
accaataaac 
aatagggacg 
tgatttcaat 
catacgagaa 
gaaaaatata 
acaacgttca 
gaatctacct 
agaagaagaa 
atttgtcgtg 
aggaggaatg 
tgatatagaa 
agccgttact 
accgtcaaac 
tagaggagga 
tcctggacaa 
cacaactagt 
tcctacattt 
ttattcaaca 
ccattactga 
ttacaaagga 
atcttcccat 
attgcatctg 
catactttgg 
tcgattttga 
tcttccatat 
gatttggaga 
tcttcgctgt 
atagtgaaga 
ttgatttgtt 
gaaggggaga 
caagggataa 
ataagtccac 
tgcacaacat 
tcatccacat 
aaaacaatag 
tcactgggaa 
ggaagtctgt 
aagaagacta 
gtggtggtgg 
ctttttaaaa 
acacctggtc 
ggtgtaatct 
gctagcggtc 
ggaggaagag 
accagtctcc 
tattacattt 
acattcggtc 
aagtttttga 
catttctggg 
acgtttctgg 
cctcggaact 
aaaatttctg 
tgtagaacac 
cgcaaaagtt 
cgacccagag 
actcgagcga 
ctagtgaaaa 
tgttccagaa 
atctgacatt 
ggtccacccc 
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ctaaactcga 
tttcttgcct 
ccatcattaa 
tcaccagcac 
tcttgtgata 
ctaccagttc 
agcattcctt 
tggaagggga 
acaaggctat 
attccattga 
agattgaaag 
aggatttaga 
gttcaagaat 
catttgagaa 
aaacaaccta 
ttttcttgac 
tgaatgcagg 
tgtgcatgtt 
agacgcagtc 
tggagaggaa 
actacagtgg 
ctagttggtg 
caattagtga 
atgaagaaca 
atgctagtcc 
cctctccaac 
aagaagagga 
ttgttatgaa 
aacctgcact 
atgatggtga 
cagtgaaaaa 
ctatgcaaat 
tattcaccaa 
caaccatttg 
tcacatcaat 
gaatgtttaa 
tgtatggaaa 
ataatgcact 
tattctcctt 
gtgcttcatc 
aaggggaaaa 
aacaacaagc 
tgccacgcag 
tttgtggact 
ttgaaaatga 
ctcgcttgta 
ttttgtctca 
aagaagtact 
aaaacaagta 
accatgcgtt 
aaggagacaa 
taatgaccag 
acgaagaaac 
aagaaagtat 
aaaaattcaa 
attcgatggt 
agaaaatccc 
caaggtttgt 
atgaaactgt 
tacctgttga 
acaccataga 



gcgaccctaa 
gaaaggtgtc 
taaaaaatgt 
accacagaca 
ttattcaaga 
taccccctca 
tgccaatctt 
gaggaagaaa 
gttggacgat 
atcgaccatc 
tgaagccaag 
aatctactcc 
tggaggcaag 
caacaacaac 
tcctcttgtt 
tggtgttggg 
tattcaaacc 
tggacgtaca 
ttttgacgaa 
gaagcgcatg 
agacgggaaa 
caacgcaaac 
tatgaatgac 
tgaagaacgt 
ttcccccatc 
agttcctcct 
agaaattgat 
gaaattagcc 
caaaattact 
tgatgacgat 
ttccttcatg 
ttctcttgct 
caaactccaa 
gactgaagaa 
tgtggcacat 
cgccaactct 
caagcacatt 
atttgcgtgg 
caaaatggca 
atctaacgat 
ggttgcacaa 
tggtcatgac 
taatgtcaat 
gtctgatgct 
aatcatgagg 
tgagaaatgc 
ccaaggggaa 
ggagaatatt 
tgcaaaggta 
tagcaagttt 
ctatacccat 
gaacaggatt 
cgatattgtg 
ggacgtagaa 
cagtattggg 
agtttctatt 
tcccggtttc 
gtttagcgat 
ttttggagtc 
tttttcaagc 
cggaaagggc 



aaaatttttg 
cagaagtgta 
cccctcccgc 
tcattctcaa 
aaacctactg 
aaaatgtccc 
catgacaagg 
atgtctaccc 
attatcaacg 
aaggaaattg 
aacagtgtaa 
aaggggcgag 
tatgcagtgt 
caagttttct 
gggttcgaaa 
ctagaaagat 
ctaaacatga 
gaatatgaca 
gaggaaaatg 
aatgacgcac 
gaacaattgc 
ggtgtggtca 
ttgtgctgtt 
tctatgaata 
aagacggaag 
cctacaccat 
gaagacgaaa 
aagctttcta 
tctgggggta 
gccgttgacg 
gccccaaacg 
aaaatttgta 
gagtgtttga 
agtcaacaat 
cagatggctc 
accaacatta 
tcttcctcgt 
gtgaaatcga 
tcagaaaagc 
gaacaccaag 
gaatcatcat 
aaggaaacta 
ggctcggctt 
tccagcacat 
agctcattca 
aggtctcaag 
ttgt ttgccg 
aagagttgga 
ctggtgagcg 
gtaccaatgt 
gacgttcact 
tgcgatgtgc 
gaggaagagg 
caacagaagc 
gatcaagtca 
gcaattaata 
ttgaaggatt 
gtcaaaatca 
agtacgcgtg 
gcaggactag 
attttgacaa 



aaaagttttt 
aacacacaca 
tcccccctca 
gacatttcaa 
aaatcgtccc 
agttgtattc 
ctactcttgc 
caaagtcctc 
agtatcaggc 
aaaatgtact 
cttccagccc 
tgtgcaaagg 
caatgagtat 
ctgaagaacc 
cttctacaga 
ctctacctaa 
ctggtttgag 
acttggaaga 
atgccagaat 
cagcgattac 
tctatatgat 
catctgacag 
ttgctgactg 
ttgtggtcga 
aagatggaga 
acgaaggtaa 
gtagcaagta 
ctatgaaaca 
acaatagtag 
ctactgcatt 
acgagagaac 
ataatccatc 
ataccatgga 
ttgctaaggg 
aagatatttg 
agggtaaata 
gcttcaaaac 
aactccattc 
cctcaaaaat 
aaccatcaac 
ctccttcttc 
tcaatttaat 
catatttgtc 
ttaagagaat 
ctagactgac 
ctgtagatat 
agttcttgga 
gcctagaaaa 
ttaatgctat 
tcttgttcaa 
tttcttccac 
tccagtacat 
aggaaggaga 
aagttcgcaa 
ttagaaaatt 
gtttgatctc 
ctagcacaat 
ataggaaaat 
tggattcaca 
ataaggcctc 
tttcacccaa 



gagatggagg 
cgcattctcg 
aaatcgggta 
gtactgagaa 
taaaatggat 
atcagtgaag 
atcaaaggtt 
ttctgatgga 
cgttaagagc 
tgaaagtgtg 
agaaaaagtg 
tctcaagtta 
caaaaaacac 
cagggattgt 
agatggaaat 
atatgtacca 
gatggccaag 
tttttacatc 
gaggtgtcac 
acctcatgtg 
aaccgagtat 
tggattttct 
catcgatgtt 
atctgacagg 
aaattcatcg 
cgcagttgtg 
tgaaggttca 
aatgagaagg 
cagtagtatc 
atgcccccaa 
tgaaaatatt 
atctatgagc 
cgatagtatc 
tttgttgttt 
caagtctgaa 
tgaaggacaa 
caacacggaa 
tggcacagtc 
gaagcgcaag 
aaaaatgatg 
atcgtctaca 
tcccctcagt 
tgaaatattc 
gtgcaagact 
tagatatgag 
tgaggaaaat 
ggacccgatc 
cgttaacacc 
tcgtaggaca 
tctgattaag 
ttgcctgtgg 
caacaacaac 
aggagaggag 
gggagggaga 
tgtgaaaagt 
tggaataagc 
caataccctt 
caatggaaca 
tattgtaggc 
atgtggcaaa 
atatgattca 



aagagtaaaa 
caccttgtca 
taaatagagc 
catcctctcc 
tttgaaggaa 
aaagttgcag 
attaaggacc 
caaaaactgg 
actgcagata 
cgcagaacca 
ttttctgtcg 
aacgccaact 
aacgtctcct 
tttatgcttg 
acatatgcag 
gttttcgaca 
cttcctgttc 
acttcaattg 
accgaagatt 
gccgtgtacg 
gaaaacacgg 
aacgaatgtg 
actgttaata 
cgtctttttg 
tcatcgtctt 
gagggggagg 
gaagatgctc 
gttaagaatg 
aataacgaag 
actgaagcta 
ttgtatgaaa 
agttaccgtg 
cgtcgccgtc 
gatgaggttg 
atatttggag 
aagaagagtc 
tctaatgtga 
atacctaacg 
cgtacctcta 
aaaaatgatg 
ccagaacaac 
ttcataaaaa 
ggtcaacgtc 
tttgaagatc 
agggaggtaa 
gaaatggatg 
gcttactttg 
cctaagcgca 
tatgaagaat 
agagaattgg 
tacctgactg 
aataatgata 
gataaaatgg 
aagaagggtc 
ttgtgtgaaa 
tggatgaaca 
gatgaggtct 
gatgataaat 
ccctttagta 
ttgtacgtta 
ttaaacgatg 
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aggatgttga 
atgacacctt 
gctcttctct 
tgagcaataa 
tctcgtctat 
caaagaaaaa 
gtggatttgt 
aacttgtagc 
ggaagattag 
agtctacata 
gtatttatac 
accaggacgg 
gtatcaaatc 
atgatggttc 
actttatgca 
agaaatcttt 
aaaattatat 
atcgtctaac 
gccatagatg 
ttgacgaata 
cgtttccttg 
agcgggagga 
atcagtagcg 
gggtagaact 
ggaaataggt 
gagttgaacc 
ttgttgttga 
ctgctgctgc 
gtactgttgt 
aggatttaag 
aagaggaaag 
ttgttgttgt 
cactgcactc 
tgcacatttc 
caaaagatcc 
acgaggtatt 
attcttatca 
agcaagttgt 
actgctcatt 
tgaacttaat 
ccaacaacta 
agtgtttgtt 
tgaagtagga 
tctgtactgt 
gtattgtggc 
caatggtcga 
gtcacttctc 
cttcgataca 
ctctccatct 
ctccccttcc 
accttcttga 
aacacaatcg 
cttactagaa 
ggattcggcg 
agtattaggt 
aaaaatattc 
ttcaaaacca 
tttggacaaa 
caccttttct 
tagtatatgg 
cccatctctt 



ttctactaca 
tttcaatatt 
cactgaaaag 
taatggcatg 
tgtagctcct 
gagcatctgg 
tgtaccaccc 
tgaaactgca 
atttaacggt 
tttgctcaat 
ttcatcttta 
aaatgctttg 
ctgctcgtcc 
tactacttct 
tgtaatgatt 
ggcaaagaaa 
gttttacatg 
cccatatccc 
gattcattga 
attggtgcaa 
acagcttgac 
agatcaaaat 
ttttcttctt 
ggtgtaggtg 
ggagaaggag 
ggaaagggct 
ggagtagggg 
tgctgcagct 
tgtaattgtt 
tatccttggt 
ggaggaggtt 
ttgggcacaa 
tttatcattt 
tccagcatgg 
atagtcgatg 
gataggccac 
aagttcttgt 
tcaaagtata 
aaattgttta 
tcttgccact 
ttagccagaa 
ctttttcctc 
aaagccttgg 
atagccttct 
gcattttcac 
gacacggtca 
tttctttttt 
gatgtagtag 
ttttcttctt 
ccttcctctt 
tcagtgtctc 
ccactccttg 
gttggcgcgt 
tcatcctcct 
ttatgtttag 
aagacattat 
tcttccatta 
taactatcac 
aaagcagcct 
tcagggttag 
ctaccgctac 



acagacaagc 
aataagaata 
aaaaagacaa 
tgtgtacaaa 
tcatcttctg 
aacgagaaca 
aaactttgca 
cctaaaacaa 
ctaaagaagg 
aagaatatga 
aatgacccat 
gatgactatg 
aaagatgaaa 
acctcttctt 
aagaaaattg 
acaaagaagc 
ttgtgtttta 
agtcaaacgc 
catctaaaag 
taaatgcctc 
gttgcgcttt 
caatactgcc 
ctgctcttgc 
gaggttgagg 
cagtattagg 
gctgctgctg 
gaggaggaat 
gaggcggcgg 
gaggagcata 
aagtaggttg 
ggaaaggggg 
acactccagt 
tttcaaccgg 
agatagaatg 
acatttttac 
gtacagtggc 
tcggtaatat 
ctcgttcttt 
tctctgttgc 
gaatctctcc 
gagctttcac 
cgaacttttt 
ttttattggc 
ccttgaccac 
tccttacacc 
tcatttttcc 
tagagggagg 
gtgtaacatg 
tctcttcact 
tcccttcctc 
tttttccctc 
acagaatttt 
ccaatttaga 
tttttcgttg 
gtggcatatt 
tttattttgt 
catatatctc 
tacattttct 
ttgtagactt 
ttatttcatc 
aatcataaca 



tagagaagga 
aggttcttcc 
aattcaatag 
ctccttctag 
ttctggctct 
tgtttttgac 
gttttattgt 
agttgtgtag 
tatgcaagag 
ctgcaacttc 
tgtatacttg 
gtgcagtttt 
ccgctgacga 
caactgatac 
atgccatgaa 
attaaatgta 
ttcaaattca 
tgctgctgct 
ttcatctcta 
ctcttcttct 
tattatctct 
aagctcagag 
tgtagacgga 
ggtagtagaa 
tataggagga 
ctgctgggga 
attattgggc 
ctgtgattgg 
tgggttataa 
ttgaacctgg 
aaattgttgt 
cgataaatta 
tcttgctgaa 
ctctgattta 
ccacacgtta 
gttagagtta 
acgtgtacca 
cacagttcgc 
atttccatcg 
tgaccgtata 
caattcagca 
cttcccttgt 
ctttaaaagg 
tgcctcgtct 
taaaatagag 
agcaggcact 
aaatatgtgc 
ttcgtcttca 
ctgttcttcc 
ttcttcttca 
atcattttct 
actagtacga 
gacctcctga 
ttggttccta 
tgtaaggtgg 
aggcaccgct 
aatctcttct 
taaagctaga 
acaatcagaa 
t tcacatatc 
agtcatggta 



tattttgcat 
attctataat 
gaagaagatc 
ttcaaattca 
atcttgctcc 
atctaggaac 
taaccataga 
gaatattatt 
tgttagcgcc 
acctagtgat 
taaattgacc 
tgtaaactat 
caatgctgct 
tgatgctgct 
agacattaga 
tacctttatt 
tatggttcat 
gctgctggtt 
aaatttcctg 
tcttcttctc 
tcaacgttcc 
tctatatcag 
ggaggaagag 
cgagagtcaa 
ggaggagggg 
ggaggaatat 
ggctgtggtg 
ttgcctaaaa 
tactggttat 
aaaggtgcct 
tgttgttgct 
aatgcttgac 
gctcttgcta 
ctcctcatcc 
ttatttatag 
tcagaagcta 
aacaggttat 
tgtatacgat 
ttaagagatt 
tctttcctat 
cccacatttg 
tcagatgaca 
ttagctaaaa 
gccaagtatg 
cccgaagtta 
gctaatgcct 
tttttcattt 
acattctcct 
ttttcttccc 
ttacctcctc 
aaatctgaac 
atcttacgcc 
cccatgtttc 
attatatgct 
gataaacatt 
gttttagtat 
tcttctcctc 
caacatagag 
tacaatagtt 
caacaaaaaa 
atacatccac 



ttgtctaagc 
atttctccta 
tcatctggta 
gtctcttccg 
ctttcttcta 
atgtggaggt 
cacgctgtaa 
gataggaata 
tttaccggcg 
ttgaacctat 
catgaagagt 
acatttaaga 
gctgctgatg 
gcaatccaag 
ggaaagtaca 
tttaccaata 
ccccatttaa 
caatgtctga 
gtgtacgcgg 
ctctgactcc 
tccctggagt 
taaaggacat 
gagtaggctt 
atcttacacg 
gactagaaat 
tactgggtga 
gttgctgctg 
agtacagcgg 
attgataata 
gtggtggagg 
gacgttgctg 
ttgtttcctc 
aaacaccagc 
ctcctccaac 
ctctaaaata 
cagatacaaa 
tacacacagt 
cctccacgat 
tcaatatttt 
gggaaattgc 
tgacaacaaa 
ataaagatga 
gagaccttat 
acacgtaatc 
tagcacctct 
ttgattctgt 
gttttgcccg 
cttcttcctc 
gctgctgttc 
ctcttccacc 
aatcgccagc 
tttgcctatt 
gttggaatct 
tttttaaagc 
acaagaaaac 
gtacatcaaa 
ctcctctttg 
tacaattata 
tatggtcaat 
ctgtagggta 
ctaattttat 
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gttaggtgtc 
cttccattct 
tgtcatgttt 
aggtctgata 
atggaacaca 
tttcgcccat 
tgcaattaga 
aaaaacactg 
gattacagat 
ccagttggct 
cttcttttcg 
atttggaggt 
gagtccggga 
atcttctaca 
tggaggaggt 
gggtagtaaa 
catcttaaag 
cttgtctagt 
ttccacagta 
ggctgcaatg 
tgaaaattcg 
aaaaacgtcc 
ttttatagta 
gtccatgaaa 
gttgaagaat 
ggtattccca 
ttggtacacc 
catgcatttg 
gaaagatcct 
attttctaag 
ctttggctcc 
gtatctgtac 
ttcttggggc 
acccttattg 
ggccatattt 
aacactaaac 
catgtaccac 
ataaatgaca 
gagtaagaaa 
cgcattcttc 
gcgcgtattt 
tggcccatct 
aagagttgta 
tttcttcatt 
tccctccaaa 
aagatcacca 
tctgcctccc 
ggatgttaca 
cttatatacc 
tacatgtatg 
gaacataaaa 
tcttctcgga 
tggagtacat 
taatagagag 
agttcgggtt 
tgaatagcca 
ttggtcgatt 
tgtttagtta 
accacttgga 
aaggat tcag 
attgaattgt 



ttacatttag 
aatgtcctct 
tgacgtacaa 
cccgataagg 
cattctggcc 
aatcctaatg 
ggtatactca 
tttaagtaat 
atgtcgttga 
agcgctagaa 
ggagttcttt 
acaaaatttg 
ccagtatcat 
tcctggaagt 
acatccactg 
cactgggtac 
agtttaacgg 
gcaaagtcct 
tagaagcatt 
tattctgggt 
tgacatctac 
tttagttttt 
caatcagggc 
gcatgtacgg 
tttggtttca 
ctcaatatct 
atcgccgctt 
gcgattgctt 
agttttctgg 
agcgaagcgt 
tctactcttt 
actatatcaa 
acatattcca 
tagtggttgt 
tcccaaatcc 
gttctagcgt 
tccttatgaa 
cttttcatgt 
atgcgtatca 
tcctccactg 
ttcatgacaa 
tgttttttgc 
aatggaactg 
aatagagata 
agttcatccc 
ctttctccca 
attaggtcgg 
ccttccatgt 
cttcagatgt 
tctttattgg 
gagcagcagc 
caggagcaga 
tcttgtcgac 
tcttgatccc 
cttccacaaa 
tgtccaaagt 
tagggatatt 
ctgaattcca 
a tgatcctga 
agttgcccag 
gaacaccata 



ggcattttac 
tttcatccac 
tatttacctg 
cacgcctatc 
gctccaacac 
cactctcatg 
cacctacaac 
ggactttaac 
gaagcccctt 
gaaccttgcc 
ttagtataga 
acgccaagaa 
atcttactag 
aatagtcttc 
ttacaatgtc 
agatcaggga 
gcggtctgaa 
tgtccatctt 
tctctgctcg 
cgacattttc 
ctataaactg 
cactaccagc 
atagttttgc 
ggaaataatc 
tgtccaaatc 
ttgtaacgtg 
gactttctcc 
cttctagatg 
ctgaaggtat 
tcggggagat 
cagtatctgc 
tcatcttggc 
tagcgtcata 
tggtggatgg 
tgtcattgtt 
ctcttatttt 
acacatcctt 
ggcggtgctt 
aatccatatc 
catcgcctgc 
acatgccact 
aactctttaa 
ggtcgtcgtt 
tctctttcaa 
tataagaatt 
ctatacccac 
tgatcaagat 
tgaacgaggg 
gaaaaaaaac 
ctaggaagaa 
aggatctgga 
aaaagctgca 
tgtgaactgg 
cttctcccat 
caagttgagg 
agtacgagga 
aggaagcgtc 
ttctcccagt 
aagtacatta 
gatctgtgca 
cttcataatg 



agtcttaatt 
agtgaacact 
tttaacaatt 
tacacaatga 
catttcccct 
gttcaatttt 
ttgttcaaaa 
cctattctga 
catattctct 
ctccggagac 
aacagcctgc 
accgcaggac 
ctcatcggtg 
cagttgttcg 
ttccatctca 
acatttgcta 
tctattatcc 
acacatctcc 
cttgatgagt 
tcctgctaaa 
taccatatat 
cgagaccttt 
agaacacgtt 
gtaagaggct 
cgcgtagcac 
tgcttgtatt 
ccaagactct 
catattttca 
gccgtgtcta 
tattagacca 
tgccaaaaga 
tatttttgtt 
cacgggcctg 
accacttaaa 
tcttccttca 
tgtccccact 
caaagtgttg 
cttgttctga 
tgatatcatc 
atttccacgt 
ataaatacga 
atgtccaatg 
acttttcatc 
aagaccggtt 
gcggttgcag 
aattcgttca 
tgcaacttct 
aatgaatgcg 
aacatgagga 
cacatttcac 
acaacaatag 
acaggagagg 
acagctctag 
gcatacatag 
gattgagctt 
ttaatttccc 
tgaatagatc 
tttatgagct 
cgattataca 
gtggatgcag 
tccct tctca 



tcctgtcctt 
ggatccttta 
tccatttttg 
aggacaaaag 
ataagattct 
tcatagacat 
tctctatcat 
aacctctccc 
ggcatacgat 
tctgggttcc 
ctacaatttg 
aatctcatgg 
ggagggtcat 
ggaggagggt 
ttaggctggt 
tcaccagttt 
accacagtaa 
atgaattcgg 
tcggtgatga 
atcgccctct 
tctccacaac 
tcgtacacat 
tcttgtcgtg 
ttcatgtttt 
aacccacatg 
acctccactt 
tcttgcagtg 
gcgttttcat 
gataccaggg 
gaaggcgaca 
tagggaacat 
ccaaaaagaa 
gcagcgtaca 
aatgcttctg 
tgcatatcag 
tgcaaagcag 
agagaaacta 
tctcgtttga 
tcggcgtacg 
ataatttcat 
tgaattagat 
aattcttgtt 
ttgatgttca 
ggattaatca 
atgtccagta 
taggtggacc 
gggacagcag 
ataaagtctg 
aaaaagtaaa 
aagcaccagg 
aggacttctt 
gagcttcctt 
ctgcgccctt 
tcattgaccg 
ggtcaacaaa 
atacagtttt 
caccacttgc 
ctctaa tcac 
tgttggacgt 
taggcatggg 
attgctccca 



ctttctcaac 
ttgattcaga 
caccttggtt 
aatgggctct 
tgacaaacat 
aaaaataggt 
caaaaccccc 
tggtgttgtg 
cagtcgagtg 
atcctcgaat 
tcacctcttc 
gccatttatg 
caaattcagc 
acggcaatac 
cacatacatt 
ctccctccac 
attttgaatc 
cctgaagaag 
gaagtttgtt 
cgaataaaga 
atgggcacct 
acattaggtg 
cttcaaaagt 
catgagaaca 
ttttattggt 
cgtcagatgg 
tctggctaac 
gttcgaggat 
acagtggtgt 
ttacggattt 
tgtagataga 
tatggtgtaa 
tgcgaagatg 
ttacggagaa 
aaccaaataa 
agaagggcca 
ccacatcttc 
tggcaacaag 
cctcacacaa 
ccaacatcaa 
ccttgtatgc 
cgttttggtc 
tgtcgagggc 
caactgtttc 
tggcctccag 
tcttattatt 
ctgcaactat 
gggcctccat 
aaattcaata 
gttattgata 
ctcctcttct 
cttgacttct 
tgtgcgtaga 
caccttgctg 
catacctctc 
gaatagttcc 
cataatcctc 
atattcgttc 
taaaggctca 
agcaacaaac 
atcgtgaatt 
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ggcaaagagt 
aaaatccctt 
gcaatttcac 
attagtgccg 
ccctgcacac 
gcatagaaat 
ttcatctccc 
ttgttgactg 
acaatttcag 
ctattgatgg 
atttggtcga 
ttctcgtaca 
gggcacatca 
cagatagctg 
tcaataaagt 
ctcttgtctc 
atgagaccag 
tgaaagtgga 
tcataaatac 
ggtgtgactg 
atgagatcat 
cgcataatca 
ccaatcttga 
tccattttgt 
taggtatcat 
ttctgagttg 
atgaatcttc 
tcggccagaa 
tgagatgcaa 
actggaagac 
gtgccattcc 
attgggatgt 
agatgtgagt 
ggatattttt 
agaaatcaga 
actgttcctc 
gctgtctagg 
tgcaccagaa 
gacgtttctg 
gaattttact 
tagggggtgg 
gccaatgtca 
tgtttctgga 
ctgttttcgc 
gggtcactcc 
gaccgctggg 
attaactttt 
agcttgtagc 
gttttatttg 
agtctgtaaa 
ccctctagtg 
cattctaata 
tccattggat 
gtggtagtga 
ctcccacgca 
acacccgcat 
aaacccccat 
aagtggccct 
attacccacc 
gaaggagcta 
cccgaccacc 



tgaaatatat 
tgctcaaagg 
atgaagcttc 
cttcttcaga 
ccaatcccat 
tgacatcaat 
ggtaatcaac 
cgatagaagc 
tgcacaaatt 
tatccttaaa 
ataatgcacg 
acgccttaaa 
gggaccaatt 
ggaaaagatc 
ccttcacgtc 
ctccctggct 
ggtgggtacc 
tgccgagtcc 
cctcaatgct 
ttccacaatt 
atgtttcaat 
tgtcctgagg 
tcaaataaga 
aatcgataac 
taactactga 
ttgcactaaa 
ctgccagttt 
aatcgtccat 
gttcttgagg 
aggcattttc 
tctttgagat 
tgttatttgg 
gaagaggctt 
gaaggagggg 
aggtgaggtg 
aaaaactttt 
cctgatgtca 
ccgactgttg 
ggcatagctg 
cttcctcact 
accgctgact 
ga t tgcacca 
acaggcgttt 
tagagaattt 
agtttagggg 
tcgggccaat 
gtgtgtttct 
ctagaattta 
agtaatatag 
agatgggcaa 
cacgtatcac 
gaggcagatt 
tgacactgta 
cgcacaaaac 
ctagaggcac 
tcttcacacc 
gtgcattgac 
gggtggcatc 
tctctctgtt 
gtgtgagagt 
gctgccaagg 



gtccctattc 
actaccctca 
caaggcacca 
ttcgaagggg 
tggcctagtt 
caccttgtcg 
atagggcctt 
caaattgcac 
tgaagacttg 
gcacacataa 
tgccttcacc 
ctcttcgcca 
cttcccagcc 
cctcgtcctc 
cagatgccaa 
aacctttttc 
actccatgac 
tccagcagtc 
atcatcttga 
aaacagtgtg 
gacagatttg 
gcgctcaacc 
gtattctagc 
agcatcgaga 
tgctggttta 
actattccac 
tccaaaatca 
ttcttggaag 
gttaattgcg 
aatcctcttg 
gaatgattgt 
agtattttcc 
ggacggtagc 
aagaaatata 
acagctgctg 
taaaaatttt 
agttccgagg 
ctccagaaat 
cgcccagaaa 
ctcacaaaat 
aggcctgatg 
gaaacgactg 
ctgggcatag 
tactctttct 
gtggaccgct 
gtcagattgc 
ggacaagtcg 
gtacaccaag 
tatccaagtt 
acgtacacct 
gttttcgccc 
gtttgtcaaa 
gttgttcgaa 
tcatgcaaga 
agaaaaaatc 
ataaaaggac 
gtcatgggga 
ggttagtaga 
gttcgtcgtg 
gaaacggtgt 
gcgactgtgc 



ttaatatttt 
aacagctcat 
tagtatatag 
attctgagtt 
ttcatattgg 
agatttctgg 
agggaaggga 
actgcagttt 
atgatgccga 
ggtgttccag 
acctctttac 
tggacgtcgg 
ttcactctct 
aaatcttcat 
tcagaaatat 
acagagacgt 
gaaatggggc 
ttggagatga 
aggcccagaa 
ggggaagcat 
atgtctgatc 
aagattctct 
gtcctcagtc 
atttcagcat 
ccagttccag 
tctttggtgt 
gggtggtcaa 
gagatggtag 
ttcttgtcaa 
attatctttt 
tgctggttag 
tatacgtctt 
tgtatttata 
tagtattata 
ctgctgcagt 
tctggatcac 
gtggaccgct 
gtgcattaac 
gggaaggtgc 
tttgaaaaat 
tcaagttcgg 
ttgctccaga 
ctgcgcccag 
ccctctcaca 
gggtcgaccc 
accagaaacg 
tttctggtct 
gataaagaaa 
aaaataacaa 
gagagtattg 
tattaccatt 
ataacactcc 
cggttttcct 
atgtgcgtgg 
aactgctgcg 
atgattcgtg 
attgctattg 
cagaaacaaa 
gctgtagtag 
gctgttagcc 
cgcaggtcta 



ccttccccat 
atgtttctcc 
tttcaaaaat 
tgaagaacaa 
aaatgcgggt 
tcatgatttt 
tgggagaata 
cctccgaatc 
cattttcttg 
tttcgatacg 
ctttgccttc 
aaaggccagg 
ccatgaagag 
taccggcatt 
agatggctgc 
taaagatttg 
ttccttttgc 
ttgccgcctc 
ggaagcacga 
gggtaaaata 
cgtgaatgcc 
tctttctatc 
cgaaacaggt 
tggccataac 
ggtggattgc 
ttttgtggat 
caatttttgt 
cgggaagacg 
gcttgggcac 
caagactaat 
aacccatttt 
ctcggctcga 
ccggcccggt 
cgtagttctt 
attagtagtg 
tcgagtttag 
gggtcgaccc 
ctttgtgtgt 
aacatactgt 
ttttctgggt 
agggtggacc 
aacgtacatt 
aaagggaagg 
aaattttgaa 
aatgtcaagt 
gaatagctcc 
aggaggcaac 
tataacaaat 
tcacgcactt 
tcccgacaat 
gatgcacaaa 
ctatagtaac 
gtataacatt 
gtagtttcat 
tggcagtgac 
ttagtcgtca 
caccacctta 
ccgtcaagat 
cttccgtcgt 
cgtgccccga 
ctcgaggagg 



gtcaaattgg 
cttttctttg 
cctcttgtta 
atctgctagt 
cttgtcaact 
tacaaccctc 
cttcacaaac 
actgtactgg 
gttagacttt 
tgcagaatta 
agcctcatat 
gcactcgtgg 
atcagatacc 
ctttctgcag 
agctcctctc 
gaggaacgcc 
cctcaaatca 
cttaagagta 
ggaaagttgg 
gtgcctcgac 
gacagccaca 
agtgggggaa 
gaagagataa 
attttcatag 
gtgcctcaat 
gttcgaacag 
ctttgcataa 
gtccatgatg 
atactggttg 
ttcttgctta 
atgttattaa 
ggagggtgaa 
gaaggtcgct 
tgatgtagaa 
caggtgcaaa 
ggggtggacc 
aatgtcagat 
ttctggaaca 
tttcgctata 
cactccagtt 
gctgggtcgg 
aacctttgcg 
ctgtatctta 
aaatttttct 
tcggagggtg 
agaaatgtgc 
tagaaatata 
acca tactat 
gtacattaat 
cgtacatgaa 
tttcctcctt 
aaccaggatt 
tcccagctgc 
ttctaagaat 
gcacggccac 
catctctcag 
tggtgtataa 
ggtgtcgtct 
ttttacaact 
cgttattgac 
tgacgacgac 



89820 
89880 
89940 
90000 
90060 
90120 
90180 
90240 
90300 
90360 
90420 
90480 
90540 
90600 
90660 
90720 
90780 
90840 
90900 
90960 
91020 
91080 
91140 
91200 
91260 
91320 
91380 
91440 
91500 
91560 
91620 
91680 
91740 
91800 
91860 
91920 
91980 
92040 
92100 
92160 
92220 
92280 
92340 
92400 
92460 
92520 
92580 
92640 
92700 
92760 
92820 
92880 
92940 
93000 
93060 
93120 
93180 
93240 
93300 
93360 
93420 
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gacgacgatg acgatggagg aactttcgat acagtagggt ctggtatact tggacgcaaa 93480 

aagcgtgccg cacctccacc tgaggatgaa gaagaggatg atttctaccg caaaaagcgt 93540 

gccgcacctc cacctgagga tgaagaagag gatgatttct accgcaaaaa gcgtgccgca 93600 

cctccacctg aggatgaaga agaggatgag ttctaccgca aaaagcgtgc cgcacctcca 93660 

cctgaggatg aagaagagga tgagttctac cgcaaaaagc gtgccgcacc tccacctgag 93720 

gatgaagaag aggatgagtt ctaccgcaaa aagcgtgccg cacctccacc tgaggatgaa 93*780 

gaagaggatg agttctaccg caaaaagcgt gccgcacctc cacctgagga tgaagaagag 9384 0 

gatgagttct accgcaaaaa gcgtgccgca cctccacctg aggatgaaga agaggatgag 93900 

ttctaccgca aaaagcgtgc cgcacctcca cctgaggatg aagaagagga tgatttctac 93960 

cgcaaaaagc gtgccgcacc tccacctgag gatgaagaag aggatgattt ctaccgcaaa 94 020 

aagcgtgccg cacctccacc tgaggatgaa gaagaggatg atttctaccg caaaaagcgt 94 080 

gccgcacctc cacctgagga tgaagaagag gatgatttct accgcaaaaa gcgttaaact 9414 0 

acgcacgaaa gtgacggtgg ttgaagaata gactaatatt gttgatatgt taaccccttt 94 200 

ttttcatgaa atgtgtacac acctgctata tatacgtgca tatttgaata aggaataaag 94260 

tttatctgca tctgtatttt tttattagta tcctttttgt gggataaata atcggtattt 94320 

gtgtccaaag atagtggtgc catgatgcca tatacaaagc acgtgatact ttccaagaag 94 380 

cagagtcgta tgaccattga cgtcaatgcg tgggcagagg cggaggtggt gataaagcgt 94 440 

ttctgagaaa cattgggcgt atgacgtcaa ctacattatt cttcctcctc ctcctcctat 94500 

tgcctctgcc agttcagtat tttatttttc ttcctataca ataaaaagta tcagatgaat 94560 

atttttactg tttttctgtt catccctctt tcctattgta aaaaaaccaa taactaacta 94 620 

atcatggata acttgaaagg ggaatttgtt gcgcttaaaa cagacctcac ccattacaaa 94 680 

acacagttgg atagatctat attggtattt gttgatgttg ttggtagatt atatgttata 94740 

gtaaatagtg aacaaacagc taaaaaggaa ggtctagcaa ctagagtggc aaagcaagcc 94 8O0 

acagagatac aacaattcaa ggacgaaata aacaacaaat ataatgctct aacaaatact 94 860 

ttggatgata tcatctacat ttttgatcat ggagggagtt tcaaaagagc aaaacataag 94920 

gccataattg aagcgaggga atactctaaa ccgctgaggg aattagagtg catgtttacg 94 980 

cgtatagcgg acatgttaac cttgactttt atgactgtgt acaccaatat cattactgaa 95040 

tttagacact ctagtgaaca agccactaat agtataaatg tcaccctcgg acgtcttttc 95100 

ttgtgtgacg acttgtgcaa tcaattacca aaagaagagg aagaagagga agatttgaaa 95160 

cagaaattca ttactttcca tgcgaaccta tacatgctgg acacacgcct aaagaaagat 95220 

ttgataattt tcaaagatgt catacaacaa cttcacgtga ttttgcaaaa ggatacctat 95280 

gctgtaaaag aaggtgtggc cattagatgt gcgaaacaga tgaacgaaat aagtcaatac 95340 

agggacaacc tcaaggataa ttacaataca ttttcaaaca ttttgaatga aattgtctac 95400 

atttttgatc acgggggaca ttttgaagaa gtaaaacaca aagccataac tctgactaga 95460 

aattacttga aaacactcat gggattaaaa tgcatgttca aacgcatatc cgaaatgttg 95520 

tcattgactt ttctaacagt gtacactaat gttatagcag aatttataaa cgctagcaat 95580 

atttctgata gagagatcaa taattatctt gtccaacttg taacatgtaa cgaattgtgc 95640 

aaccaactcc ccaaacctaa acaataccgt cccctcagtt tgatagataa catagcttat 95700 

ttttctcttt ctgtccaaaa acatctgagt gggtttcttt agtatgtgga gtattattgt 95760 

cttgaagctc ataagtattc aacctctcct actggtaaca tctcttcctc tctataatcc 95820 

caatatggat agttgttgcc tgatatcgag gataacacca gaactggctg gcaagttaac 95880 

ctggatcttc ataccagaaa acaacttcaa gattgtccag aactcactcc cagacgacca 95940 

agttatctcc caattcagat atttcgacca tagacattgc tatacgttta tggagatttt 96000 

gatggcaaac attaaaatcc aagacaggaa acaaaacacc acagccatat. gtgaattgac 96060 

aactggaaga gaaggacttt tatgtagaag aaccatacct gtatttttgg gttcagagga 96120 

aaaacgagaa gagttattgg ggaatctccc tgaaggtgca gaaattttca ggcctagaga 96180 

agttatgcaa gtaattggta ctctcttgga caagaaacta gaaattgacg acggtatagc 96240 

ttctgtaaag gctgccctct gtgctggttc atcatcgtta tacctaatca tgagccacat 96300 

agtgaaaatg accttttctg ctatcacaaa catgaaggat ataaacgaag aatatttcgt 96360 

agactttata tttcgtcata aacaattcct caaccctgaa ttcttcaagc accttatatc 96420 

tttgctcaag aattccagga aggaacatgt tgcccatcta gtaagacgtc tagaacactt 96480 

tctcatgcta tggacccttt ccaagatgag gttcacagaa atggaagaaa actacttccc 96540 

aatctccagc gatagtgatt acggcatctg tgaaaaatgt gcacgaaaaa ctcccaaata 96600 

caagctccgt atttttaggg aacgaaaatg ctgcgataga tgttgccgtc tttatcacca 96660 

acaaccgcct ccggaggtgt ataattggga tggaaaaata acccaacaat ccaataaagg 96720 

ctacattaat gcaggcgatg aaattatcgg catgctaaac tcaaatgata agggaaaaac 96780 

attccctcct atacctaaga tggttgtacg aagagtggtg gacggtgtct acgggcaagg 96840 

aactatcctg tcaaagattt tgaagttcag gcaggcaaat atccccacgt gtctattcgt 96900 

gacatgcaat aaatgcaata ggattttcag gctcactatc ttagggccta caagaaacat 96960 

cctttgccca ccttgcagaa agaaaagtgt tgcagtaaat acacaacaga aaggagaaaa 97020 

taaaccttcg tttgtgcaaa aaggaacaaa acgtctacga gtggataccg gtagcaacaa 97080 
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gaacacgtta gaaaaattct gttcctggga aagattcaat actgaagttt tgctcccttg 97140 
gcttggctac actattgagt ctaagtggca gaactgggaa tcttttctgg gttattcgag 97200 
taccagatat aaggaactgt gggcctttgt gaacaaacag gaaatatctt ccatgaaaga 97260 
ctcctacata aaaattgaag acatcgacca gttattgagg agtatcttgc aagaccagaa 97320 
gggtgtattt gagaccgtct gcaaaataaa gagcagagat ggtttgtgaa ttggccacac 97380 
tgattccgat accgactaaa aaggctgatt gatggcaacc cccccccctc cagactcagc 974 40 
cgcatgagta taaatatggc cacttctcac accacagcat cattccctcg tcatcggtcc 97500 
taccgtcaac ttccattatt actccaataa taccaacaac cccagaaatg gagtcaatca 97560 
aactgttcac cgttgctggt ctgaatatgg agcaagccaa ccaagtggct gaagaaatca 97620 
agtcagaata taaaaccgag gaggaaaaga ggattgccca ggaagtgttt gacaaattca 97680 
ccaaaaaact cattatgcaa gtagatacgt ctaaacactt acttacaaga gaaaacccca 97740 
accgttttgt atcccgcccc attgtccatg aagatctctg ggaaatgtac aaaaaagagg 97800 
ttgcctgttt ttggacattg gaagagattg atttcgaaag ggatcctaaa gattgggaga 97860 
aactcactca agatgagaag gatttcattc tccagattct ggcgttcttt gcatcctctg 97920 
acggaattgt aattgaaaat cttacaacac gtcttcgtca agtggcgcag attccagaag 97 980 
cgaggagttt ctttgacttc caagttggaa tggagagtat tcatggcaac gtctacggag 98040 
aactgattga tagactggtg cccgacgaaa aagacaaggc tatcttgttt aacgctgcac 98100 
aacacttccc cgccatcaag aagaaggagc agtgggctat taattggatg caaagcaata 98160 
acgatttggc ggaactaatt gttgcctttg ctgcagttga aggaatcttc tttagtggtg 98220 
cattcgcatc cattttctgg atcaagaaca ggggtatttt gcctggtctc acctcctcca 98280 
atgagttcat ttctagggac gaaggtcttc atcgcgactt tgcatgcatg ctgttgaaaa 98340 
agggttttgt tgatacccca tcaagagaaa ggattcttga aattgtcact gaagccgtcc 98400 
gaattgaaca agaatttctc acagtttccc tgcctgttaa attagtggga atgaactgca 984 60 
agttgatgag ccagtacatt gaatttgtgg cagataaact attggttgaa atgggactag 98520 
aaaagcacta taatgttacc aaccccttcc cattcatgga caatatttcc ctcgagaata 98580 
agaccaactt ttttgaaaag agagtcgccg agtatcaacg tgcccaggtc atggcttcta 98640 
tcaataagat caagaaggac caacaaaccc aagaaactgg ttctcctctc ccaattctga 98700 
ctgcacctcc tccagtctct tcctcatcat ccgaacaaga agatgttgaa gacggcgtcg 987 60 
gggactacat cagttatgac gatttttagt tccactattg tgtcaatagg ttgtgtattg 98820 
tattattatt gttataatat ttttaaaaaa taaatgttct ataagactaa aacaatgaat 98880 
ttacttccaa tattcctgac aacctttttt gttgcggtag atgcatgctc ttgctctacc 9894 0 
atctgccttt tacctgatgg gaagaaacaa cccttggttt ttgattctgt attagaagag 99000 
gtggtatacc ctacagatgt gtgtgggcca aagggagctg gcgaattatt cactggtgtg 99060 
gatcttttga ccctctgtat aggaggtaaa aacaatggag gtgaatggtc aggaaaaggt 99120 
ccttgtccaa ggatcaataa cgctgtcgtt gaacgagatt actcccttga cgaggaggat 99180 
tgtaaagggt ttagaaaggg gttccgaatt cctggcactg accattttca tactgtcttt 9924 0 
tccctttgtt gggtagacag agatatgcac gccaagtggg tgcgcaacaa aataaaccct 99300 
ggtatagtaa ctgatgatga agatttggta gattctggta ttaggactaa atttaaatac 99360 
tcttctaaaa tttttggtaa aggattcaat ccgagacctc tttactccct cgactatcaa 99420 
gagaggatta agatattaaa gtctcatttt aacaagagga cgggtaattt ctttgctcga 994 80 
ggccacttgg ctccggctgg agattttttc ctcgcttcag agagatgggc aacttttgct 9954 0 
ctagagaatg cagtacctca aatacagaac cataacaatg gtgaatggaa agatattgaa 99600 
aatcgtgcaa gaactacgcc aggtgccgcg tgggctgaga ctggaccaat attttaccaa 99660 
cacaagaaga aggaatatct agacaagaag aagaagtaca tccctatccc tcatgccctc 99720 
tacaagattg tgtacgacaa gaataacaag gaattgttcc gtgtacagag tgatatgtct 99780 
tggaaataaa atacataatt aaagaattta tattgtttta ttttgtcatt tatttgatac 99840 
atttgttgta tgttttaaag acaatacaga taattttcct ccatgacgaa acactggtga 99900 
tatgtctgga gctctgatga atgtcttgcc agatgccatc ctagtttttc tcctggggaa 99960 
ggtgacgtcc gtttgagaga gggttcgctg ggtttcaact gcagccaaga aaatatccct 100020 
ttgtccagtg acggtaaatt gtgtaccgtt atttagtgtg atgacattat ccctctctga 100080 
ttgtttcaaa gcttcataat cttcctcctc atcttcatag tcatctgcaa cattctcagt 100140 
gtctaataga acaataacag agttggggga aaaatcagtc gtgctacgtc cagactggcc 100200 
agtattttta ctgctccatg gtgtaggatc atgctttcta ggcttgtctg aatgtagacc 100260 
attatagttg gctctcccca tcatatcata cagtgtacac actagtgtat taccaatgcg 100320 
ttttccactc ttattgttat cacaataatc ctcatcttct ccaacaagag ttgagtacac 100380 
actggaatct gttagaattg tatgagaaac taatgcctct ttaaaaatat cgtcactctc 1004 4 0 
ttctaaccag aaatctgata cactatcatt acatttattt gcagtattat caataaatga 100500 
attgagtaaa gggtgtaaag gttccacgcc aaataaaggt ataggcacaa actttgcctc 100560 
atccgtgcga ggtaaaaatg gcctgtcttc gttgtatgat tttgaaaagg ttgttgcaag 100620 
aacacaattt cttgcatttt ttaatgcctc agctgatata cgtagacgta tactgtttaa 100680 
cttttgctct gttgtctggt tattaaatcc tccacacaac gagtacatgg cattatcgat 100740 
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attctccctc 
gtctttaccg 
ttcacagtgc 
attggaacct 
agaagggaac 
gacgaatgtt 
agctggaaga 
gtgtagagat 
ctcctttgtc 
cttcccatct 
catcagggac 
cgaagagata 
tctgtttgtc 
gtatgtgagc 
attaacatcg 
caaggcataa 
gtctgccatg 
tgctttctcc 
atttttgtca 
gtcattatct 
caattttgag 
gaaaccgagc 
ataagcaaat 
ataattcgtt 
tggaaggatt 
aatatctgac 
gggctttttt 
gtttgccatt 
ttctggagca 
ttcaacgtgt 
ggtgcaggcc 
tattatggaa 
actatttact 
cgaaaacagt 
gtgccgtata 
gagtgccttt 
gaggtacttt 
taaagacgtt 
atttttttaa 
aagtgggagg 
taattatctc 
agaaatcgcg 
ccaataattc 
aatttgatcc 
ttactgggtt 
aagaagacga 
taatgctcga 
tgtctataca 
ttgtcccaac 
aagaaggaac 
ccttaatcat 
aacagtacat 
cccctttttt 
gcacaatgaa 
tgcagaccat 
cacaaattta 
taatatctgt 
accagaagat 
tataggaggt 
tcctgcctat 
tcctcctcca 



gcatgatcct 
agaggaagaa 
acagatctct 
agcctattcc 
aaaaattgaa 
tgacgttgtt 
gggtgttgaa 
tttgtgtatt 
tttaattctt 
tgccctcgtt 
atttcttcca 
aaagacccta 
atccctgaac 
ccacaagcaa 
ttcgtaaaac 
gaagcagaca 
tagccatctg 
actctatctg 
ttagagttta 
tcattggagg 
aagggagtat 
aggtaaagtt 
attaatcccc 
tcaacgttaa 
tcaaaattct 
cttatgtatt 
gcatttttaa 
gcttcatcgg 
attttttcag 
ctagattcta 
aatctctcca 
gcaactccat 
ccatcacgag 
cttctggatg 
actctgtcta 
tctgcttcta 
ggcacgcacg 
ggctgtttta 
ttaacacacg 
agaactttcg 
tactatattt 
agtcagtaaa 
tccctacatg 
caaaaatatg 
ccctaaagat 
agaggaacag 
atctctgaga 
aggttctcct 
ccctaaaacg 
aggagaaaag 
gggctcttgt 
gtacacattt 
agaaaaattg 
aagttgatca 
gtaatgaaat 
ttaaaacaag 
ctgttaaaaa 
gttgcattct 
aggcccaata 
ggttatctca 
tctccagctg 



ctatgaccaa 
tatctgcatt 
ttatgaattt 
aaggaagtga 
gtaatctggt 
ttagtagctg 
agccacagac 
cctcttcgtg 
ccaagaagtc 
tcatcacgtc 
actttttagg 
taacattctc 
caaagcctat 
ctgtcaattt 
tctgtttata 
cctttcgtcc 
attcgtgttc 
ctacaaattt 
ttcttgcata 
atgaattttt 
ccagtacgtt 
gtacacatga 
tgatgtggtt 
agtacctttc 
cactaaacat 
tttcataaaa 
gagtatccga 
gcaataattt 
catataaacg 
ttgtcttcac 
tcttcttgtt 
ccaacatgtg 
cagtgaagaa 
atgttttcag 
cgaattccca 
tagttgtagt 
tgtgcactga 
cacctctcag 
caaaactgac 
cctgtggtcg 
gccatctact 
gaccccttgg 
atgttcaata 
aatgaagagg 
gtggaaaagg 
acgacagata 
ggggataagg 
ggtacaaaaa 
gaaaagtatg 
gcagaacatt 
ggctaccctg 
gaaccaataa 
gaacaacaac 
aacggttaca 
ccatatcaga 
ttgaatatgt 
tcctagaaag 
tcaattctaa 
tgacggtgcg 
cagatgataa 
caagaagatc 



accagccttg 
gaattttgaa 
agaggctaga 
attttgatgt 
ccaattaacc 
aagagacact 
gccaaatctg 
gatttttctt 
ttgtacattg 
gttgttctta 
gtcttcatgg 
aatgcaatga 
tactggaaca 
cttcagaagg 
cttgtgagca 
caaaagttta 
ggcgataggt 
cactgcttta 
ttggagttca 
agttttgttg 
ttctctacat 
aaaggcacct 
gctagggaat 
caatctacga 
acccgacaac 
gtcaaaatcc 
aagatgcacc 
gctcacttca 
ttgccatatt 
atcgacaaga 
gtagtagtct 
gtacagggct 
atcttgaagg 
tctcatgtca 
tgtcaggttt 
agaagttgaa 
ccgagacaca 
aggaatccct 
gctgaaatcg 
tagggatact 
acttttattt 
ctatacgcgt 
caaacattaa 
aacgagcaga 
aacaggtgca 
ctaaccccaa 
ctggattttt 
ctgtctgtaa 
ttccttcaga 
attccttcac 
tgatgaagga 
aggaaaaaaa 
aatgtcgttg 
aacctctgta 
ctacataatt 
tttcgatgaa 
atgcgcacag 
gatgggggaa 
ggttaatttt 
tgatactact 
cccttttttc 



gtcattagaa 
attatatcat 
gatatgaaat 
cttttattat 
gcagctgcag 
tgggttggct 
aacgtcatct 
ctcctttcat 
tec teat teg 
aacaggegeg 
atgtccttct 
actgeatact 
accatgccgc 
atetegtatt 
acccagttca 
gaaatttgaa 
gtcctcatca 
acaatacttt 
caggggtaat 
tegttcttta 
gcactagccg 
ccaaaaatat 
ctaaagttgt 
ggcactttaa 
ttgtttgatt 
gtcatcgtta 
aaaccagttc 
t tgtatgeat 
ttcttatcag 
aagtatcctt 
atcctcttcc 
gattgggtac 
aaactgtata 
gttctctgea 
teatttaegt 
tccataatgt 
acggtgtact 
ctaccagaat 
tttegcaata 
ttcggtggtg 
ccgtccaaag 
agatgetaac 
ggctagaaat 
atttttcaag 
gaagaagaag 
aaataagaaa 
gatagatgag 
gaaaacgacc 
tgaaagattt 
gtctgttaag 
tataagcaaa 
gtaatgtctt 
gctgtgacag 
tatcacaccc 
tetegtcgat 
gaaacaggag 
aaaggaggaa 
gtaacgcgcc 
aaacatggge 
actgttactc 
acacgcactc 



aatggtctgg 
tgacaacacg 
gtctatttaa 
ccctctcgtc 
tttctctctg 
tgtcactact 
ttagattggt 
tgggcacaaa 
egaaegttae 
cagaaagctg 
tcctcttccc 
gtctcaattt 
acttgtggtt 
tttcacttcc 
ctataaataa 
gggagageca 
ttgtgacgga 
ttctgcacgc 
attcttcccc 
tatactgttt 
cagaaagagt 
taccaatgaa 
gctccagact 
tgtttatgtc 
ttcccaatac 
aatcagttgg 
tgtagttgta 
cgacaacttt 
gatctagtag 
caegtatact 
tgaagcaaga 
accactcgtc 
tagagatgaa 
taagtttctc 
attcttttag 
aggaatgagg 
gaacaagggt 
aagtcctttc 
atgggagagg 
ttaaccgcca 
gaagggttaa 
catgtcctct 
acactaacag 
atccattatt 
gaggaagaag 
gtggaatatg 
aataagacca 
ctaccaaagt 
ttcaggatgg 
acaggaagaa 
ggtacaagee 
tgtttatttc 
aagattaegg 
ctctattagg 
tcatgaacta 
cagttatagc 
tatatgatgc 
tatttactat 
agacaaataa 
ctcctgttac 
tcatatccga 



100800 
100860 
100920 
100980 
101040 
101100 
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101760 
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gtcgtcttca gttgaccatt atgtattgat gcatgataac ccaaaaagat cttcatttaa 104460 
ggtgtatgat attcacgcag aaacctttcc ccataaagct ccttctgttc ctaccttccc 104520 
ccctaaaacc tcgtttgaaa tttctgacgt gactctcgat tgttcaatgg agattttttc 104580 
acgagacagg gatgttttag acaatgttca cgactatatt gctaacgacc ccgtaccatt 104 640 
tttagtggat gttgtgcacc gtggatctag tctccgttga gtaagtactt tagttgcagc 104700 
cggtctcatt agacctgaaa aatacccttc atttgttaga aaagtgtgct tagatcaaaa 104760 
tggcatccac ttcttcatca actaagaaaa gagtacatga agaagatgaa aatctcatcc 104 820 
cacaacccaa aaagaaaaaa tcaaagaaag tactaccatt tcctgttgac aagtatagag 104 880 
ctgtggataa aaaggtggta aatctcatac acaagatatt agatcaagaa aaggaccacc 104 940 
tttctagtac cgaactgcaa atgataactg aatgtaatgg tgcgcgagaa gatctgctta 105000 
aacatcttct agacgaagga gaatttaacc ctactataat tgaagtagta tcatccatgc 105060 
ctattgaaac aatatacgaa atactctctt cttctgctga cgacaagaag tttgtacaga 105120 
tatcattatc aatgttgatc cacatacttt tcttcgctga taagggtact atgtgggtat 105180 
ccaacgcgtg cgttcaaaat gttttgggga acgactataa agtggaattt gaaaatatac 105240 
gtaaaaagta tctgatattg gaagacttac tgaacggcgt ttcaaatcat tggtctgaac 105300 
atggtcctct ttctcacatg ctccattctt caatccctat tgtacaagac atgttattga 105360 
acaggctggt gcgttacttt agcacgtatg atggagatgc tcaattcgat atatcattca 105420 
taattaatag tgtcttgtgg ggaattgata aaagtgttct caacgaattg acacaattga 105480 
tatcgagggg tgttttcatt gtgtcgtacg taccgatgcg tgtacgaaca ccttcaaagg 105540 
acagtaatcg gccacaaaat actccttcac aaaatatgtc agcactaggt atgaaactca 105600 
atacattttc atccagaatc tcagtgtaca gaaacaatac ctttaaaaaa ctaaccgagt 105660 
tagtgcataa ctttgattac ggttccaaag atgcatcatc atcatctcct cctcctcctt 105720 
cattatcgga cagcgtcaac acttttgtga ggttgtacac caactatgac atattcttaa 105780 
aggtgatttc cgactggaaa atgccttatg ggttctttaa gaaaactttt gacgtccttt 105840 
attctaaggg gttgatgaca ttatcagtgt ctgaatatac actcaaaaaa gagttggtta 105900 
cgtttttgcg cgccttgaag gaaagggaaa ttttaatcta taaaatggag aagagagaca 105960 
ttatatgtat actgaaaaag tctttgttcg gatttaattt caggtgttta aaacaattac 106020 
tccctctctt caaacacttt ttaaaaattg aagaggttaa acatatagca cgttttgtct 106080 
ttagagatta cagtctcatg tgcaaaactc aaaaagattt gcagagtttc cctgccatac 106140 
agtctgcttc acttttcatg gaagaattcc cttggcttgc aaaaacttgg atcgacgacg 106200 
atgatgatga aggaggaaag ggacataccc tattaacatt tgctatagtg cacagatatc 106260 
ccttaataag ccaacttatt tcacacccaa ttttaaaatc gttagtgaat actacatgta 106320 
gagacaagca ctttactccc ctcatgcacc tcgccaacac gtctataatg taccaatgca 106380 
atacactctt gtgccttata ataaatggag ctaaaccaga gttcataaac aagttcaacg 106440 
agaatgtttt gcatatagcg attgaaaatg ttaactatgg agtcatcact gaattgagag 106500 
gaacattatc cagcgaacaa attgaaaaaa tggtcaatgt aagaagaatg atggataata 106560 
caacaccttt aatgatcgcc ttggcgaggg agaatattgt actcgctcag ctttttgacg 106620 
gtctttacaa gcccaaaata aaggtccgtt tcggttcttc aaagaggcta aggataccag 106680 
agtttgtcct cttaaagggc ctaaaggaat cagttgcata tttggaaacg aggaatatat 106740 
cctacgatat taacatcata aaggatgcag taatggacaa cagtcttttt gaagaggagt 106800 
acgaaatagc agcagcagga ctgcgaggca ataactgcga ccctgaagca gacgagaaga 106860 
ctatgaacac gtggaacttt ttcaccaaaa attcaaccaa atgggcaagc tctattttcc 106920 
aaaagaatag gcagaaattt gtaaagattg tggatggtat gaataggaca tatgaagact 106980 
ctgaatgtgc aatatgcttg gatagtctgg acggggatct tccttcaggg agaacaacgt 107040 
gcggtcattg cttccacaac gtctgttggt tatccttgat aaggatgagc gggccgaata 107100 
atggcagccg cgcaagagga ggaggaataa aatgcccgtc ctgcagacaa gtcacctgcc 107160 
tcggaaaaag actaggggtt gccgactatg atattgaaac agaggaagaa cgtgacacga 107220 
aaaatgtcgt gccttcggta gaagaaggaa gaagggaatg gaggaagatt ggtgttgaca 107280 
gatatgaatt tcttgtaggt ggagtgtgga caaatgaaat aaaactataa aatgaactag 107340 
aatattggta ttatttttac accttacaca tgttccagaa acgtctgttc cagaaatgtg 107400 
ttttgagatt tctggacaag ccatttctgg agtaatctga cattgggtcg acccagcggt 1074 60 
ccaccctcgg aacttgacat tgggtcgacc gagcggtcca ccctctaaac tcgagcgacc 107520 
cagaaaaatt tttaaaaaat ttttgagatg gagaatgagt aaaatctctt ccctgaaagg 107580 
gaggtctgaa gaggctatga cggttctagg tgcccgccct ctccagaaac gtctgttcca 107640 
gaaatgtgtt ttgagatttc tggacactcc atttctggag taatctgaca ttgggtcgac 107700 
ccagcggtcc accctcggaa cttgacatca ggccgaccca gcggtccacc ctctaaactc 107760 
gagtgaccca gaaaaatttt taaaaatttt ttgagatgga gaatgagtaa aatctcttcc 107820 
ctgaaaggga gatctgaaga ggctataccg gttctaggtg cccaccctct ccagaaacgt 107880 
ctgttccaga aacttactaa agttaaagta cgtttctgga caagccattt ctggagtaat 107940 
ctgacattgg gtcgacccag cggtccaccc tcggaacttg acatcaggcc gaccgagcgg 108000 
tccaccctct aaactcgagc gacacagaaa aatttttaaa aaatttttga gatgtagaac 108060 
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gagtaaaaca 
cccagaaaca 
tctggagtaa 
cgacccagag 
agatgtagaa 
gcgcagctat 
gagatgtgcg 
ggaaaatgct 
tgagtaacca 
aacaagtacc 
aaacgtcacg 
tataaacaga 
caactcgggt 
acattcttat 
aaagagtatg 
aaaagattgc 
atgaagaagt 
gattgggaga 
ataatagtgt 
caccaccatc 
atgtcttcca 
agatggacaa 
ccccaacaac 
cctccattcc 
gaccagtgca 
agggaaagag 
agatcttccc 
aaccaacaca 
cttgtgaatt 
gctcgcgccc 
gttgtcatcg 
ccagatattc 
gattcttcta 
gtttattctg 
atgaaattta 
agtgcgcctt 
ctctttggtg 
tcaccctctg 
tcatatccaa 
aagtagaggg 
gttgtttttg 
cctcttgtac 
ggtgtcaata 
tcatagaaac 
ctaacgcgga 
tcttccaaat 
cttcctccta 
gttggttcaa 
tctatgtctg 
tagggggaaa 
aagtccacta 
atgtggaaaa 
aaaatgtcat 
tcgaacgtac 
taaggagtgt 
ttggaagata 
gttttagttt 
gctgcagaga 
ctatttcctc 
actttgggac 
attttatcga 



ctctagtgga 
tctgttccag 
tctgacattg 
gtccaccctc 
cgagtaaaac 
acccagaaac 
cacgtatgca 
atttaataaa 
tgtctagcgg 
ttctagatgg 
ccggttccct 
gacggggaaa 
cagtatcagt 
ttgctcagta 
ttgagagctg 
gcttgtatta 
acaacaagat 
agtaccatgc 
ctcaaaggat 
gaggcgggta 
accgattcag 
ctgtcaagaa 
aacaacaaca 
ccatcattag 
gaaattgcga 
ttcgctcatt 
ccttccttaa 
tgtaccccct 
ggcgagaccc 
ctacacccac 
aggaagaaac 
ccgcaataac 
cagtatttct 
cattatttgc 
aatctttatt 
ctggcaacac 
ggaatggtcc 
aaacagaaac 
aatggtgatg 
ttgaaagatc 
ttccactaag 
aagcactctg 
caggaagtac 
cgaccatgtc 
gattttctgt 
ccacataatc 
acatttccat 
tgcttgcgcc 
taaagcaagg 
atgttcccac 
tccttctagc 
tttcagcctc 
tagatacaat 
ccttcctttc 
aagccgtact 
tatcccatat 
ctccgatagt 
gatgaacatc 
tgctaggtgt 
agaataaacc 
atatgttttt 



ttagggtgtt 
aaacttaata 
ggtcgaccca 
taaactcgag 
tctctagtgg 
atctgttcca 
tgcatgcata 
acaaccacca 
aaaagtaacc 
aggtgcagca 
ccacgataac 
aacaaacagc 
tgtccaacac 
caatatgggt 
caagaagcac 
ctttaacaag 
aatcttccct 
ttctattcga 
gtaattaaaa 
taaataaggg 
tcagctgagg 
caggaggaac 
tcaaaaaaga 
ttggggagcc 
tgaaactggc 
aaacaacact 
taagaagaag 
caacaagtac 
ttcacaagaa 
tccccaagaa 
tgcaccagaa 
tgcaactact 
cagaaatgtt 
aaaatgtatt 
gtttctttcc 
tcccatgtcc 
tctatctgat 
agttttttct 
gtgtgcaaat 
atcaaatggt 
atctgaaggt 
agcagctgta 
atggactctt 
tttttgtcct 
agcctttcct 
tgaaatagat 
attgctgtta 
ttcatttatt 
cgttgcatac 
gttaaatgta 
ctttgcaccc 
cgcaggaggc 
atctgtcata 
gtccaaaaca 
cctacgcatg 
attttctgtt 
atttagtatt 
gtttttaggg 
tgccgctata 
gttggtgcaa 
actattagga 



aacagcctac 
aagttaaagt 
gcggtccacc 
cgacacagaa 
attagggtgt 
gaaacttact 
cgtacctatc 
ttctccgctc 
tacgaaatcg 
atctgtctgc 
ctcttcaaga 
aaaaatctgg 
ccgacagaac 
aatggaagaa 
gaaagggtcc 
tgtcttcacg 
gccagaattg 
gatttctcca 
aataaaaccg 
gcgctggcac 
ggcaacgagg 
agacagcaac 
acatcaacca 
ctcggcagct 
tattacaatt 
ccaagtgaag 
gacgttgacg 
agtttccgtg 
aaacaggaca 
aaggaaccag 
ccagaaccag 
actactacta 
attctgagta 
agatctaaga 
aattactcca 
acaagtgcta 
tgacaaacaa 
cttgtataat 
ttccttagat 
tctgggcctt 
ctcatgaaat 
ggtgcgcaag 
ctcatgactc 
ccctctacat 
tgcttattga 
aatctcctct 
actacttcca 
gtcgcagaat 
aggctcacta 
gaggcagtaa 
tcttgacccg 
acgcccggca 
ccaacatccc 
gcagcaggta 
gcacaaaatt 
gcagctttta 
ctagcgtctc 
agggtagtca 
gggtcagttt 
ttacagttct 
tcttgca tct 



cctttctggt 
acgtttctgg 
ctcggaactt 
aaatttttaa 
taacagccta 
aaagttaaag 
tgtatttttt 
actatacagc 
ttgaaggggg 
agtctaattg 
tgctaggatt 
ccataattga 
cagaaaggtt 
aatgttactt 
acaaatcttc 
cgatcgccaa 
ggtgcgcggc 
caatcattga 
tcgctggcaa 
atgggtggca 
agatggttgg 
agtattccca 
attctcctcc 
actcaatgta 
tccactctta 
gcatgtggcg 
aagctccacc 
aatatactcc 
agatcttaca 
aagtagaaac 
aaccagcccc 
cagttgcaac 
tcgtgttttg 
aggaataaat 
ttgaaattgt 
aaggttgtat 
aggggaatct 
aatatctgct 
taggtccccc 
taactcttac 
ctgcataggt 
attcgaatcc 
tagggactct 
actttgatga 
ttgtgaaaac 
tctttgtact 
ccacaacttc 
cggccgatat 
tatccttgac 
ctccatgaga 
tttttgctgc 
agcgttcata 
ctaacaattc 
ccttttcttg 
tattaggggc 
ggtttatctt 
cggattgagc 
tgaggggaaa 
tgttcctacc 
ccttttctcc 
caaaaatact 



cgcagctaga 
acaagccatt 
gacatcaggc 
aaaatttttg 
ccctttctgg 
tacatttctg 
taaaagtagg 
tgaccttcgt 
attgttgaac 
tgttgcaaga 
tggcgacccc 
agatagacct 
ttgctccatg 
ccctaacgac 
cacagaaatg 
atcacctgca 
agctggagga 
taaggaagtg 
cccccgccac 
cactcgcatc 
ggactattca 
tagtttccgt 
tgctccacct 
tcgactggat 
tgatagaaag 
gcgcacaagt 
tcctcaatca 
ttcatcaaag 
agaggaagaa 
taaagatgat 
agttccagac 
acgtcacgac 
gtttctgggt 
aaaatggtat 
cctccttgtg 
agttccatcc 
atccttcaaa 
cattaatcct 
tacgcttcta 
cccctccaat 
tatttccaca 
tccatggtag 
tttactcgct 
tgatgatgtt 
ttccgcatct 
accggcagtg 
actcccatca 
tgttggcgaa 
accctcaacg 
agactcgaaa 
atcccttata 
atcctccat t 
tgcgttccta 
tcttgtggaa 
caacataaca 
tcttccatta 
tctcaggagt 
acgtgaatgc 
ccctccattc 
caattccccg 
cttttctcca 
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atattggcag 
aacgaagaat 
agggcacccg 
gagtacacgc 
aaccgattag 
agtagcgact 
ttgacttcac 
tgcatggacc 
ctggaaggta 
gccgcagaca 
gaattgaagg 
gttactcgat 
ctcgcgttga 
atttccatag 
aggattgctc 
tcgcgcatac 
tcattagaag 
tcccccactt 
gaatttgtag 
tgcgctgtaa 
tcattgtata 
aagttgggga 
tttgtaacca 
gtcaaaagag 
ggtctatagt 
ttttccttta 
tcgtgagatt 
tcagaatctt 
gtctttactc 
tgggacgcta 
ctatggggta 
ccagctccaa 
cttcctgttc 
tgttcgtcag 
cgcaataatc 
ccaaaaagag 
ctagggaggc 
agcaataatt 
agatctatac 
tttgttccag 
tccttttcct 
ctctgctggg 
tcgaaatcga 
cccactcctt 
ggatctttat 
tttttggtga 
tagtgcaaat 
gtagcaacat 
agagggacac 
tctctactgg 
tcgttgcctc 
gaactaattt 
actccagcca 
ataattcccc 
gtatctacaa 
attataaatt 
tgttgtctag 
gccgtggaag 
caatgaagaa 
gaaagaagtc 
atttaattga 



gaagtagccg 
gtgattgcgg 
agcacgggta 
cgttcaagta 
ggatcaattc 
caaagagagc 
attgttgcca 
cgaccgtgcc 
tgggtaaagg 
ctcccgcagg 
aggttttaat 
cataagtgtt 
ccggcgatct 
gtcttacaag 
ctgcaaaaat 
ccgtaccttc 
ctatgcgcca 
tcttaaacac 
acaatatttt 
caggttgcag 
ttgatcttgt 
aaaaatagtc 
tcatagaatt 
agtgcttgaa 
gtaccaaggg 
ttactctatg 
ttctcagttt 
taaggaagga 
ctgaaggatc 
ttatgcccac 
aaggcacgac 
ccagcccatt 
cactgtaggg 
aatcttcaat 
ctcccgtaat 
aagcggcata 
acgcgtcctt 
ttgcaccctc 
cggccacttc 
tatctcctac 
gtataccacc 
agtagccagg 
tatcgtccac 
ttcctccaca 
tgagaaggga 
tataagtcgt 
aagcctcctc 
catcgtcgtg 
aaactccgtc 
atttgacgca 
ctacgcgaca 
cccctggtaa 
taccgaaggg 
cattttcttt 
aagagctcgc 
gggaagaccg 
gaacaaagaa 
caagaagagg 
gcgtgcagga 
tggagcccgc 
ttttttttct 



tgtttgttcc 
gcccaggggt 
tccgaaagat 
gtggccggcc 
tggcctaata 
cataggagga 
taattcatcc 
gttcgagact 
aaatccccct 
aggagcgaga 
gtctgttgtc 
gttgatggct 
gcgttcgaac 
aggtgaagat 
atcgggcttt 
tattccatta 
cgcagttgga 
tgcagatgaa 
ctcactgaat 
aataatacct 
attgatgggc 
cagaagtaaa 
taatgtactg 
aattgtctcg 
caatatctta 
gagtgaagga 
ctgtaagggg 
tgtttctatg 
gcctgctctc 
agcatgatct 
cctgagcgat 
atcgtttagt 
cactacacct 
tcttgagtgc 
cagatggcct 
agaatagccg 
agtactacca 
tacaaagggt 
ttccagtctg 
aaccccatac 
ttctttcttc 
ttcgcacaaa 
atcatcaata 
cagtacaata 
aggatccttt 
tccctttttt 
gacagtagta 
cttgtcgaac 
atattcatca 
agttcctccc 
ttttttagga 
tagagattct 
ggcgtccgta 
taatcctttg 
ttaaccacag 
atacagtctt 
acattaaaaa 
tccaccactg 
aagaagagct 
aagtcaaggc 
tctgaataat 



tctacataac 
ttggggacca 
tggccagttt 
gtgtgggggc 
aacgtttcag 
gggatgaaat 
ccttcatgga 
gattcttgga 
aaaactccca 
ggcttagaaa 
gaatttagaa 
gtcaagttgt 
caggtgtgtt 
gcaccattca 
tttgataacg 
tttagtcctc 
gggtgagcaa 
ccagccttgt 
cttgcaaagg 
ccaatttctt 
aggccagacc 
tttgtaacgt 
ttcgcaaaag 
atgggagtgt 
tttactgttt 
acgacgtatg 
ttatctttgt 
gtcgtatcgt 
atgtgaaccg 
atgctatcga 
gcgtttggat 
gcatctccgt 
cctacagatt 
caattgccgg 
ccgttaaacg 
agagccgaca 
cctaaagttt 
tgagcataat 
atttgtgtac 
ctgaacatga 
tcacattttg 
caatccatag 
attgagaatg 
tctgtgcacc 
cgcaccacag 
cctcctccag 
aaagggttga 
gcagaagagg 
tcatcttcta 
ccgagagaac 
gttttaaaac 
acaaaacctt 
tcgacatata 
acgcgtaaag 
ttgcactcac 
tcatgacaaa 
tggttgcccg 
ctggacgcat 
ccactgtccg 
gttaattctt 
tggaaataat 



tctttataga 
tcattgtagt 
cttccctgac 
tcagacccca 
aggaaagagt 
tgagggctcc 
attttagagg 
cactcttttg 
aaaatgatgt 
agggtcgtgc 
tagctctaat 
cggccgaacc 
cggggagaga 
ttaaaactcc 
tttttgcgac 
ttcccgagaa 
acatttcata 
aatgatcaat 
tggaactttt 
tgaatttggc 
tctcttttcc 
cgtcaaaact 
attggtctct 
gggccattcg 
tatcattttt 
ctgttgcgca 
acgaatcgta 
tgtgcgcatc 
tctcggggaa 
ttcctaaacc 
gttcagtaaa 
tccagatgtg 
cactgactgt 
cggcgtccat 
ctgtaatttc 
tcctcgtatc 
cgtacctttc 
ctgtttttgc 
tttcatcaca 
cagggcagcc 
gggcgttggt 
tagaagtcat 
ttgtagggtg 
caagaatggg 
acgaatctac 
ttatgactcc 
cgtttctgta 
ctgaaaaaat 
taccgtattc 
gacagctaat 
agtcaaatgt 
tccctatgat 
aggggtcgaa 
agtttcgaga 
ccgggtacaa 
ataccccgag 
aagctccaag 
ctccaagcgg 
tcgccgttcc 
ccctgtacaa 
aaaacatcca 



aaatttagat 
gtactgagag 
acaaggcggg 
atctgctctg 
cctaaccctg 
ttcagtttcc 
gactattcca 
aatgagaaaa 
agcttgagca 
tggagcgtac 
gtctcccttt 
agggtctcct 
agatgcagga 
atctccagtg 
cctttcggcc 
aaatgtccct 
tgcttctttt 
tattttggga 
cttagcaccc 
attgtttggt 
ttcgccgttc 
attactagaa 
ttcttgtgcc 
cttagcggta 
agcagaaggt 
tatgccagag 
ccctgccttg 
ccttctgtca 
gattttaccc 
gggaatattg 
atttccccct 
agcggccaca 
gagttttcct 
ggcagattcg 
aggtgctctt 
gtttcctggt 
tgttatttgc 
cgcatcggga 
aagacagccc 
tagaccttgt 
ggcagggtct 
actgcttatg 
tataggtcgc 
ctctaatctt 
acaccctctt 
tccagattcg 
ggctgcatct 
agtatcggcg 
agttttcaat 
gcattcaaaa 
gttataatgg 
gtctgacttg 
agggttggga 
catggctcct 
taagacaaaa 
aacaaaagat 
accaaatccc 
aggagcccat 
tcaaagagcg 
caactatgtt 
ttgaaactta 
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tgcagtattt 
agggtacaga 
aagaagtgtc 
caaataccag 
cattttgagg 
agagacggcg 
aatgcttctt 
acgacaccac 
tgtttgcacc 
ctagacgtac 
tggctagccg 
cgcgtaggcg 
tatttgctcc 
ctttatttac 
ttgaagtgtg 
tagcgtacac 
gacgttcttg 
gagtatctgt 
gccgagggta 
gtagggggac 
gaggaagtgt 
cccttacagg 
cagacatgtc 
ctttatcgtc 
cgtcaataaa 
tagaaccaga 
catcatcatc 
acgtgatact 
aatctaaaac 
gctgttcacc 
cagtcaatgg 
cggcatattc 
tatcagtttt 
tgggtccagg 
agcacccgac 
cagtacttat 
tattgtaccc 
ccaacatttt 
gagctgggtt 
cctcggattc 
ctggtgtgtt 
ctgaaagttg 
gggaaagacg 
gagtgtattc 
catttctaca 
aatcgaataa 
cagtgttgaa 
ccacgtgttt 
cattggtgat 
gaccactatc 
cttttatctc 
gtatcctcac 
tcattcgaat 
gagaaatatc 
gtaacaattt 
caatattttt 
ttttcgggtc 
cagctttttc 
taaaattgtt 
gaagcgtgaa 
caaacattgt 



ttattcaatt 
tatattataa 
tatgaatttc 
agggtctact 
gattttaaca 
atgtagtgct 
tccttcattt 
cttgtcttga 
attattattg 
cagaaattgt 
agtctgtaga 
gccagtgttt 
aaaaacaaca 
cagaagattt 
cacaatttgg 
tttctccccc 
ttccttatcg 
attgacaatg 
ctttatttct 
aagactagtc 
agtgcgacag 
tgccgtaaag 
tgaaaactgt 
acctccaaat 
tatagggcgg 
cacgggatcg 
atcattgacg 
gtgaatgggt 
agggaaaaca 
tcccagtaat 
accaactgaa 
tgtggcgggg 
caagatgata 
ccaatctggt 
tctcgaaggt 
aaccgagaac 
agaagatgtg 
ccattgcgct 
agatggattt 
atttccctcc 
aacagtgttt 
attttcgcga 
gggaaaatta 
cacattatca 
agcaaaaact 
cacggtcttg 
aatacttccc 
gaatacggct 
aggaggccaa 
gctgctgcca 
tgccttaaag 
gatggtatct 
tggcagcaat 
atcaccaata 
tggatcctgc 
agcgagtttt 
ttttctaaca 
tattttcgca 
ctggacaatt 
ttcaaaggca 
tattgctgtt 



ttttaaccaa 
gatgatgtaa 
ttgtagaatt 
agtacacctg 
cataatacaa 
gaatttctat 
ctagctacaa 
ttcccaacta 
tagaacccgt 
ccatcggcta 
ggggcaactg 
ctccagttcg 
ataccccatt 
tcctttttgg 
tctaatgttt 
tgtgtatacg 
atttgttccc 
tttgtagtat 
cccccgtaat 
tgtttcatga 
tagactatgc 
gttgagggat 
ccttcaaaat 
acttccatct 
ctggttattc 
gagactgtga 
ataacatcag 
tggaatgggt 
gcgtacagtt 
gcagtttgaa 
acgtttaaag 
gaagaaagca 
catggtctta 
gcagttgcag 
gtattatgta 
aatttatcag 
gttaaaacgt 
tctatttcac 
ctaaggtggg 
atattcttta 
gttttcgagt 
ggtagtttag 
gcagggtaat 
tagtttattg 
tccgttattt 
tttttatcct 
tttgttgata 
tgttgtttat 
cctgctggga 
cccccgtctt 
ggaagaggga 
tttccacctt 
gatgatgatg 
tcataaccga 
acatcagtga 
tttgacgccg 
gcagcggaat 
ggttttctta 
ttcataccaa 
ccccaccacg 
cttaacggca 



ctataaatcc 
cagatgcacg 
cttccctgtc 
agctagtgcc 
tatcatcatg 
ccactaaaac 
ttttcatgta 
tagcaaatag 
ttgcactacc 
accgagtaac 
gttgtctcaa 
tgtatctgtc 
ttcctaaaat 
ttccagactc 
cttcagaaac 
cttcactagg 
tcaaccagtt 
cattcgttct 
cagcgaatgc 
aagggtccct 
ccttgttggt 
cattccatat 
gcccgtccat 
ttacactgaa 
tcgaagaaag 
tgtggtagct 
atacggtcaa 
tatcactcgc 
tagtaggttc 
cgggggtaat 
cacgaggcag 
cagccgttat 
gttggtagga 
catctactct 
caacaacttt 
attgagtaac 
cagacgacgt 
tttcaggtac 
ttatctctgt 
acgtttcat t 
acacgaacga 
ctgctaaatt 
ctgaaacata 
ctttaggca t 
ttggtatggg 
ctgttacaat 
gtgttatacc 
tgccattgtt 
caaattcagg 
ctccttcata 
cattatctgg 
tttctaataa 
atgaaccatc 
tctgccgtgc 
cacgttttaa 
gttctaaccc 
catcgatagc 
tgtgtatcac 
tataggaaat 
agccaaaaa t 
gggttggttg 



acatgtggta 
gtcaataaca 
tgtacttgac 
gggttctatt 
gtcgaatgtc 
tagaccatcg 
gaatcttact 
taacgaggaa 
aattcctcca 
agggtcgccg 
ttgagaataa 
tgcaggttcg 
agccgcaagt 
gtctaccacg 
gccgtctccc 
tgctggcaat 
ttctattgaa 
tatggaagaa 
ccattcaggt 
tatccctcta 
tatgtcggtg 
agccaattcg 
cgtaaagggc 
aaccctatcg 
aacgttttct 
tttatcttca 
gacagtagtt 
tttcactagg 
tgcgggtgcc 
aatatctcca 
tagagatgtt 
tcgaggttct 
gaagaaatct 
aaagtttacg 
gttcgtgtta 
ttttgatgct 
tttgaagccc 
agacacacat 
ggcgtttaga 
gtatagtact 
cactgaaggt 
ggacgctatt 
tttttcattc 
atcaagcctc 
tgaagatatt 
ggaaattttc 
gtcagctaga 
gcgagagaac 
gaaagtttcg 
tttttcatcc 
taatattccg 
ttcttctgta 
tctatcagga 
agtgcgtttc 
ttcctcagag 
attcttcaag 
cgctgctata 
accagatctt 
aatagaaata 
acccgtgact 
ttcggcaaat 



taaaagttta 
acagggcctg 
gctacattgg 
ttagacatga 
gtcctgttcc 
tcatttttca 
ctttctgcat 
gtaaattcat 
catctagggt 
ctaggccatg 
gaacccactg 
tcagggtatt 
ttcttgtctt 
tatgtagtat 
atgtacattt 
aactgtcgat 
aattcagttt 
atgaaaatgt 
atgcttgtag 
ttagaaatgg 
gcgggtgtgt 
ttagatggta 
atctgtatgc 
tctcgtgcac 
tttgcaagta 
aggacaacat 
ttgttccttg 
ccgttatgcg 
ttaaatgtag 
cctcgtactt 
gtgcgttcac 
ctattatcat 
ggcccgttcg 
ctagaaccga 
gcattcacat 
ggaaatacaa 
acaatccagc 
ataaaggtcc 
atcttgtaac 
ttgtttaata 
ttgttaatat 
tcagcttgtc 
aatgtatcag 
atgagggcta 
gttacctgag 
agtctattcc 
gaatttactt 
gcattcgggt 
aatagagacg 
actaaatcaa 
tcaagaatat 
cggatattaa 
gccgattgag 
attatactgt 
gatatccttt 
ttccttttct 
ttcccgaata 
tcatcgggcg 
aacacaacga 
gctgtttctc 
caaggaaaca 
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aaaaagttac 
atgtatatca 
aatgactcgc 
ggaatacttg 
atatgcggca 
agaaagtata 
tggaaaaaaa 
tctatcgtca 
cagattgaat 
attcttacgt 
tgctactaca 
tgacataata 
tatagactgg 
ttctgtagat 
tatggcaaaa 
ggaaatttta 
atggtgcttc 
tgtaattttc 
aagacttctg 
caagactaat 
tgtgacagaa 
tattttcgac 
tgcagaattc 
aaaaatgcta 
ctcttcattt 
cgataacgat 
tatccgtaat 
agtcgagctg 
tctcctattt 
accggccata 
aaaaactata 
ccgttttaaa 
tacgataagc 
gattttgtgg 
aaaggggaaa 
cctagtcaat 
attcgaagag 
ccaaaacaag 
attccaagtt 
cacccacaat 
ccagagcctg 
atactacagg 
gtccgtaaaa 
gtgcatgatc 
agaagttgga 
tagccaacac 
gaaaacacct 
ctccctctac 
actcaagtct 
acccaacatt 
tgtggaagaa 
caagcat tac 
a tgccccatg 
atttagatgg 
aaagcaagaa 
tcgcaggaca 
aattgccacc 
tgttctcatt 
tgaagaggag 
aatacagatg 
caatgataat 



ccgggcatgg 
gacgtggaac 
ctagccaact 
agatacgcgt 
acagttgaca 
ctgaattcta 
ccttctgaac 
gagttggcaa 
tcagcaatag 
gaaaatgctg 
gatacaggga 
aagcgttcag 
cgccccaccc 
gatactgttc 
tttatcctgt 
tacggtaaca 
tggttgaacc 
ctccactcaa 
gtggtagttg 
ataacgtcat 
gacaccttaa 
gaaattaact 
tgcacaaaga 
gtaaaatcag 
ggtttattac 
gatgagacta 
aggacatatg 
aagcgagaga 
ccgtatgtgt 
attaaaagtg 
aatgaaaagg 
aataaaaaaa 
aacagggagc 
aaacaagtcg 
gtactcaaca 
cgcagcttca 
gatgtaaaga 
gatattgtta 
ctatacccaa 
aatgcccctc 
ctgacgtacg 
cgccccatca 
gttctggaca 
aacaagaaga 
agcaagatga 
ctatccgacg 
gcccatctag 
ggcgcaccct 
aacaagtgtg 
aacaacaatc 
gatgaaaatg 
atccattgcg 
tgccgtgagg 
ttaaaactgg 
ttctataagc 
tt tctcaccc 
aacgccatcg 
gacgggagcg 
gcgcttgctt 
cgccagtttg 
cctcatcgac 



cagggaatag 
aaggcatgga 
taaaattggg 
ctacgcccct 
tcaacctaat 
tacggagagt 
caatctcacg 
atgcaataag 
tgacagcggc 
taaactacat 
atgtagtcta 
aaatattaga 
ctgacaattc 
ttatagatct 
tcatatgtaa 
tttcttcttc 
tgtacaatag 
aaaaggagtc 
tggagagttt 
ggcctgttat 
tgagactagc 
ccttgacaac 
ttttgctcgg 
tcaaagagac 
tgaagaacac 
cagatgtggc 
cgtttaaaaa 
gtgacgccat 
ccgactggag 
tgtggttaca 
tacaatctct 
ttgctgccga 
aagaagcagc 
ttcagaacac 
tgttcactca 
ttgacgttca 
atccggaatc 
caatagttgg 
tcaagaaggt 
tctgggtaca 
agattgtgaa 
actatcaaac 
ttttaatgag 
acaacaccaa 
tgctccacat 
ctagtagttt 
caatccaaga 
ggtttcaaga 
tcaaggtact 
aactagaacc 
aagggaaaac 
aatgcctgat 
atgtgggcga 
ctgagagatc 
agatggaagc 
cagccagaag 
ctgaagccac 
gagaagaata 
ttggaagacc 
ctgaactgtc 
acatctctac 



aacccagttc 
gtgttgtggc 
cgattctctt 
ccttggggaa 
ggctcatttc 
tgtagtggct 
ccctcacccg 
ggacaagttc 
cttgggggct 
gtacgatgta 
tctttccacc 
taaggtatca 
gttcccttac 
catcaccaat 
ccatttaagg 
atccgataat 
actggaatgg 
cttttcagga 
tcctcccgtt 
taataacagc 
gataaggacg 
agcagtgacc 
gcgagctctg 
gggagaagaa 
aaaaaatgaa 
ttgttgggca 
aatatggggc 
tacatccttt 
ttgcttacta 
aatcctgaaa 
ttcatctgag 
acacgttcgc 
actatctaca 
t*ctcaacctt 
cttgacccga 
ccgagtggtg 
atccatatat 
tgattacatc 
catcgaacac 
ccttctggac 
tgccaagtac 
tgggaagagt 
atgtggatca 
ctttcttcac 
tgctgaaatg 
ccctgatgct 
agataatgct 
taacaactcg 
atcctttgca 
agataccatg 
aacaagtctt 
gggaatgtgt 
cgaagtactg 
tgaacacaat 
aatgaaacct 
aggcgaacga 
agctcaagga 
tgaagaagag 
atttctggaa 
tagacgaggc 
agtaaatatt 



gtatcgtccc 
agacaagcac 
aaagaaactg 
ttaaactacg 
tcctacgctg 
aatcatcaac 
ctgggagggg 
atcagcatgg 
attgccagtg 
gaatttgcag 
aaaatggacg 
aaacgacccg 
caattgattt 
gcgatcgtgc 
gcagttatta 
tattttgagg 
ttcatgttag 
gctgacaatg 
ctcttggaca 
aataataata 
agtagcggtg 
aaccgtatta 
gacgaagaag 
aaggataaga 
gaattggaaa 
cgtacttcct 
cttgaggatg 
gtcaccgata 
ttacatccct 
gatttttccc 
atttgtcaga 
agtgtaaaaa 
gaacactgta 
ctggagaatt 
gccttccgca 
gccgagttat 
agaactccga 
ctctctccga 
ttcccagtaa 
gaacgccatc 
aggggtattg 
ctactgatga 
tacaaattca 
tgctgtgcaa 
ttctttgcca 
gcagcagagg 
gatgcactcc 
tacatgaaat 
gctgacaagt 
tgtggagtgt 
tcctggtacc 
gctgctgctg 
gaaagatgcc 
cgtgtacttt 
cccagagttg 
gccatcagaa 
gatgtcaact 
ggagaagaat 
gatgaggaag 
gtttctgtca 
gtgcaaccag 



tgattgctaa 
aggatgcatt 
atgttaattt 
acaaacaaca 
ctttgggtat 
gtagaaataa 
tagaacctcc 
gggcgttgga 
aacgtgaact 
aaagagatgc 
aagatgaaga 
caaaggaagg 
ggggcgatga 
ctaatatttt 
ggagtatgag 
atggacgtaa 
tagttagatt 
ttaacgtgaa 
ctgaatgggt 
gtacactccc 
cccgacatcc 
ccttccagtc 
aagctggaac 
acaatacgtt 
taaacatagg 
cgacatcctt 
caagtgatgt 
agagcagtcc 
gttgtaaagc 
aggaaaatat 
aatcaaacga 
agttattaaa 
tttggttaac 
ttcccgtata 
agatgaacaa 
cctatccaga 
tatccctctt 
agacggactc 
tcttccactg 
atcgcctgct 
ttgtcatccc 
gcaaactggc 
tctcattaat 
gtaaatgggg 
accctactac 
acgacaaggg 
tgttcctgat 
ctgcccttga 
acgagatttt 
gtgcaacatc 
agatgaattg 
gcaatgtaca 
ctcctacaat 
ttgaagcaaa 
ttgttcctcc 
tcgcaagaga 
cctacttccc 
tcttcaattc 
aagccagaca 
atattattaa 
tttatggagt 
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119520 
119580 
119640 
119700 
119760 
119820 
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121440 
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121560 
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121800 
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121920 
121980 
122040 
122100 
122160 
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122340 
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122460 
122520 
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tgaaaagtca 
tatacgctca 
tgacaagagg 
aaactctcaa 
ctatatcgag 
cggggcccag 
agaaatggca 
atccccactg 
gaggcgtaga 
tctagcttca 
cattctcatt 
taatgcactc 
aaacttgcgt 
ccgcttgttc 
catgaagatc 
tattattgaa 
atgtttttca 
ggcacacttg 
ggctatccag 
ttatcttccc 
tcctacacta 
gaaactcttg 
agatgctgaa 
attgaaaaaa 
gtacattgat 
ttcatcaata 
cagaagaacc 
aattggagca 
tgcatacaca 
tgatgcatat 
attgagtact 
acctttccgt 
ctttaatgaa 
aaaacaccat 
agaacggtct 
tgtaattgta 
aataaataca 
tatggattgc 
tcgtataagg 
caacaagaag 
agaacctaag 
gaccctcgaa 
acttggacga 
tatttctaac 
taactacaaa 
aaagatgaag 
tgctataatt 
tggtggttgt 
tgttaaagtg 
cgaacgcaac 
acgagaactg 
taatgtccac 
aatacctaaa 
gttgaatttt 
gttagtatac 
aaaatggtgc 
aacgggggag 
tcacactaag 
ttctctcgat 
tgacggaaaa 
aacaaccaaa 



cctgctgctt 
agagatactc 
gcattattcc 
attgttggat 
acgtttggta 
gactattatg 
agtgagttgc 
ttcaagaaaa 
aattatgctc 
cgcgaacacc 
gaatatatag 
cattgtatga 
cctactgatg 
aatgaacaat 
ttcaaccatg 
atgtactgtt 
tccctcctcc 
tccagtagcc 
gaaacaatcc 
cataaaactg 
gaaatgagga 
gggcattttt 
acgtgcgtgg 
ctagccctga 
agaaattcaa 
cagaacactt 
gagtgtcttg 
gccagaaagg 
gaaacgtttg 
tttaatgcgg 
ttgcacaatt 
cttcctccct 
gatacgacac 
acgcgaaaaa 
atatttttac 
ggatctacta 
gtcaaatcta 
agatcttatg 
gctgcaattg 
aaaacgagca 
tggtacaaca 
caacacgtcc 
gaaaagaaat 
aatctcaagg 
tcttcttcct 
attatcgcct 
tcatcagaga 
gcgttttcaa 
aataaataca 
aagtgtaggt 
tatagggctt 
ttgacatggc 
tgtaagctac 
ctgagaacga 
ggagaggagt 
tcgaaaaaag 
gaagataaag 
gatataaaaa 
aaaaatgtgc 
acgtctatgg 
aaacgaaaac 



ccttcatcta 
gagttggagg 
acgcagcttc 
tggactttca 
gtcctcttca 
cagagagtat 
acatcaccga 
tcaggactgg 
gagatattgc 
cccctgttca 
tagatttcat 
gaaagtacat 
aaagggtgga 
acaacactag 
atttgggtgt 
ttactttaat 
tctgtggtgt 
gttgtcgagt 
agtctgaatt 
gcatctgcga 
acgaagttga 
gtggcaactt 
gctttttgag 
ttgcccgtga 
tgatgaaaca 
ctcacacaaa 
ccctggattg 
tttgtttcct 
ttcttgatac 
agaataaaga 
tatttgttag 
ttggacaaca 
ctattctagc 
atagtagatg 
gtggagtttt 
taatgcctag 
ataacttttc 
ggtttgagat 
attgctcgaa 
ttattaatgc 
aaaatgccat 
gtgttagtga 
ggacgtgtaa 
gtaaaattga 
tatactataa 
ctataaacaa 
aaaaggatga 
agtataagaa 
ttgaggcttt 
ttaaagaggc 
cggccagaag 
aacgccctta 
acacagtatt 
gggatggaaa 
acgctaaaga 
gtgtgattaa 
acgaaaggga 
aaatagaaaa 
aaacccacgc 
tgggctggga 
tggaaacgat 



caacatgctc 
agaaagagtg 
ttccatgctt 
agcagtgtat 
cgcctaccca 
cagatacgac 
agcattagat 
aaaatcttac 
tgaggaattt 
tgtacatccc 
ccaccactgc 
tgaacacgag 
agttttaagg 
aatgtccctc 
gtctaaattt 
ctaaacaata 
gtatatataa 
gaacatcatg 
ggaagaaatt 
agaagctgca 
ccatttctgg 
gtatgtggag 
gtcgcaagca 
ggataaatca 
agtttttagt 
atcttccccc 
gctggacgca 
ccagaacctg 
aggcaacgaa 
taaagtggat 
gaaaagtctt 
ccctatcatc 
gtcaatttct 
gagatgtccc 
aacagtcagt 
tgtacttttc 
tgccataacc 
tatagacaca 
cacagatttc 
ggtaaagaac 
gtgtggtgaa 
tgagtatatg 
ttatctagat 
caccatgtgt 
gcaaataact 
gaggaggtat 
ggtggaggaa 
aaaacaactc 
ttcgttaatt 
tgaaattaga 
ttatgtccac 
cgatgaaaac 
gtacgataaa 
ttataaccca 
tgtcagtact 
atatgaagac 
attctttaga 
tgtactaaat 
atcttctagt 
ttatattttt 
agatatatcg 



aagaatgacg 
cccgtcatga 
tgtgactttg 
gatccccatc 
ggagccgtca 
aatgatattg 
gtctttgagg 
tctaactgga 
gttagggtat 
tttagagatg 
atcacctggt 
aatacaaatg 
gtatctcaac 
agcaccaaaa 
ggtgtataca 
aaaataatgt 
ggcgggtagc 
gcagcagacc 
gccgatactg 
gctaatggac 
tcccaagata 
gcatttatag 
actggactag 
aatacaacta 
gctgaaattg 
gtgtacttga 
tccaagagga 
attgttgcca 
ctagaacagc 
gaaatgtgtg 
ccccatcatt 
aacattgaaa 
ataccatcaa 
aataacttga 
ggagattacg 
tatggcgacc 
tgttcttatt 
cctgaaaata 
cattcaccgg 
ccttttttta 
gtattggaaa 
gacagatttg 
agaataaagt 
aaaattctag 
gctacgtctg 
ttgtgtaata 
gatcacacta 
gaacctaaac 
aagatgctga 
gagtgcgcca 
gatctggtat 
gctaacacta 
gattcgcgcg 
ataagacact 
gttacctgtt 
tttttggatc 
ctaaaaaaat 
tctgatacac 
acagttgtaa 
tcaatcggta 
agtagtgacg 



tctttgagtc 
acctgtccaa 
ccacagaaac 
acatatccaa 
cttttctgga 
tctcattctc 
gtagtttatt 
acgaccattt 
gtgaaaactc 
gagcaatccc 
ctatgcaagt 
tgcacctgtt 
tcagatggag 
gattgagcct 
aactcctaga 
aaagaattcc 
ctcctccaca 
tcctagagt t 
aattcctcaa 
ggccatatct 
acaggaagct 
ctggttctat 
gataccctct 
attacaactt 
ataagcgtcc 
aactgatcga 
cggccaagga 
tattaatacc 
aagtattgga 
tagtggccat 
tgtacaatgc 
attcctcatt 
gtatggtaat 
tgactgcagc 
gatggttttc 
gaaaacatct 
ggaacaagta 
actgtggttt 
taacgcgagt 
ttagacacac 
atgttggcgt 
gtagtctatt 
ctctagaaac 
aaaccaagta 
atgatcctat 
ttctagaatt 
aaaccggcaa 
aacatttaat 
gaaatgattg 
acgaactggt 
tgaagcgaac 
tcatgtcttt 
atgttaagt t 
ctatgctgga 
ttgaatggtt 
gttacgagaa 
gtagtagaga 
tttattctta 
aaaatgacac 
aaggagaaaa 
acgacgatga 
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agaagaagag 
caagaacaag 
agaaccttct 
tttgttatga 
acacctgagc 
tttgtaaagg 
gaaagaagac 
tttcttgctg 
tgttgaggca 
gggatctact 
aacgctggag 
gagactaata 
gtttaaaaat 
gatatacaat 
agtgttgttt 
ggtccttttt 
cttttgttcc 
agcttccttt 
ctccatcatg 
atcaagttct 
attcataaag 
atcacttgga 
gttttggtgt 
gatgagtctg 
cctcaaacca 
gtagaattct 
cgcattggaa 
gaaaagttgt 
ttctagttcc 
ttttgtggtc 
caactcttcg 
tgtcgaaagg 
cgccaccata 
ggctatatat 
atccttgaaa 
cattgtactg 
tttttctact 
gtccttgtct 
ttcttgcgtt 
tccaggttca 
ttcttgtaat 
aacaaatgtg 
catgacttgc 
taactccttt 
ttccatattt 
ggttacaata 
gttcggccca 
actccttttg 
gtatattttt 
gttattttac 
catattttta 
ctcagaaata 
atcaatagca 
attacgggaa 
actcttgaaa 
ggccacaggc 
taatttttcc 
aacaaaacgt 
agttggactg 
tacttcctgt 
gtttttaagt 



gaagaagatg 
agcaagaaca 
ctacctaata 
tgttaaaaat 
ggtttgggcc 
tgctgaaatg 
ttcttgaaga 
gtgggggatg 
gtcaggaaga 
tttgtcctca 
attcaaccca 
tctttcttag 
ttgtcttggt 
atcccttttt 
ccatcttctt 
actccccttt 
tccctcttct 
gtggtcaaat 
gaatggagct 
tgaggcgaag 
attcggatat 
ggggtctgga 
ttgatactgt 
ggcctcatac 
ttttctgttg 
gtagaggata 
gatttcttgt 
tcagtttcga 
ttttcaactt 
ttttcgacct 
cgtatctggc 
tacatttcaa 
aacaagggaa 
tctcctttgt 
ccatgtttct 
aaacctaaac 
gcttcagaaa 
tttcgtttta 
gcaagccatt 
ggaggggcac 
ctttttaacc 
tcttcggaca 
tcatatttac 
tctgatacac 
atcttcttca 
ctgtggattg 
ttttgcacca 
aacaaattca 
aaaaattagg 
cttaattttt 
aaatgaaggt 
tcctcacata 
aacgctaacg 
aacatgtacc 
atggggttta 
acccatgtat 
atgacatcat 
tccttttccc 
aaataaaacc 
acgtctttgt 
ataacgtgtc 



aaggaaaaag 
agaatgggag 
ctttatcgtt 
aataaaaata 
tgttgacgtg 
tacttgtttt 
ggaccgataa 
atactagtag 
gtgatctaga 
atggaggcat 
tgattgtgcc 
tttcccgtca 
taaaaaaata 
acacagaaat 
cttcttctac 
gctcctgttc 
ttaccttctc 
gtctttcctt 
tcctttgttg 
agatgtctat 
cagaacctgt 
aaactatact 
cgttgcaagc 
ctgcagtatc 
cagtttcaaa 
tcgagaagac 
gtacatttct 
tatacgagtt 
ttgtggtctt 
ggttttctag 
gttgcttttc 
tgcttgcagc 
caatcaagca 
aatgggtacg 
gcatagttct 
cgattaagaa 
gactattgct 
atttttcctt 
cctcgtcagt 
aacttatccg 
ctcttgcgaa 
attcgctttg 
ttgttttctt 
tttcattact 
aaacaacaac 
ataattttac 
gatctgtttt 
gatttatatg 
atggggacag 
tttccctata 
tatcatatca 
atcgcatgca 
ccaacagata 
ctaga tacac 
caacaatatc 
atttcagacc 
cagcactaaa 
cgaaaggtat 
ctctaaacaa 
cttgagaagc 
ctttctcagg 



aatgaaaatg 
aatgtgttgc 
caattgtgta 
ttgtacattt 
cctaactttt 
ttatccaaat 
aaaaatggcc 
atatgaagaa 
tgaccgtgtt 
cgaagaattg 
aacaacagaa 
cggtgaaaat 
aaacgaaaac 
ggcatgtctc 
gtcgctatca 
actggtagac 
aaaccacttt 
gttgtaccta 
ttcaatttca 
cctctgccca 
cactttttgt 
tgatcggtta 
gataggtgcg 
ccttttcttg 
ggaaccactt 
gagacaagag 
gagagattct 
tttgtactcc 
tactacttgt 
ttcctctttt 
atcactgtat 
agaagggtgt 
atgtgaagaa 
gtgacaaatt 
ctccattgac 
atcggaactt 
catgtacctg 
gactattgta 
aacgtgccca 
ttcaattctt 
tttgagttgt 
tcttatagat 
cccgatcatc 
aacagggtta 
ggaaaataca 
tcacagcttt 
aaaggaactg 
tatttgtcga 
atggtgtaga 
atacaaataa 
actaaacact 
agacttgcgc 
cgtctccact 
cagtggaata 
acccttgtat 
acttttccgc 
accgagacca 
ttgaggaaga 
catccgtgtc 
aagccactcg 
cttaatagag 



aataactgca 
acagatattc 
aaaagtatgg 
tattacattt 
tgtcaccctc 
ttcctgtact 
accttccaga 
gtgatgaaga 
tacatggtgt 
cgtcttttga 
taaaataaag 
gttggttatt 
tgtcaatata 
tttacacgct 
aattcattgt 
tgttctccat 
tcggtcttgt 
gtgacaattt 
aattgggtca 
tcttgataga 
acaataccct 
cattttaata 
tcacagccag 
tttatggggg 
ttgtctgtcc 
ccgctgtcgt 
ttttgttcct 
tctttcttta 
gtttctagtt 
atttgtaaat 
aagataacag 
gcgctccttg 
tggctcttga 
tcaaccaatg 
tttaaagcgt 
attatcatac 
tataattgac 
ttgattttac 
gcatcatctt 
aaagagtcat 
ttatctttct 
tcgatagctt 
aggaatgacg 
ttgtttttct 
gttttctttt 
gttcaaaact 
ggtaaaatac 
tttttattaa 
ttgatacagt 
ttggtatcag 
tttatttacc 
atttcttccc 
tctagagatg 
actaggcaat 
tttttgagat 
atatttccct 
atcaaaaaat 
ctttttgtta 
ttttccttta 
tcatcagtag 
gataataatg 



gcagcagcat 
tcaatgttgt 
atgtgttgaa 
gtttgttttg 
atgaatacaa 
gaagaatatt 
ctgacgccga 
cttttgatac 
gcctaaagca 
ctggagattc 
acggtgacgg 
tcttccctat 
ttgttttatt 
gttgttcttg 
cgttttccca 
tttctcgctt 
ttctcgtacg 
ttgatgtgtt 
tattttcttc 
cggtgccatc 
tcaaatcttc 
ccttgaactt 
tcacggccat 
agaaaatgta 
tgcacaacaa 
atctgaaggt 
cagagacttt 
ctacttgtgt 
ccttttcaac 
ttttcctagc 
cgtaggctag 
caaaaacgcc 
aaataggatt 
ggaccatcat 
gcgtgacgtt 
aatcgtgttt 
tgaatttaaa 
gcacgtcttc 
tgaataaatg 
cttctaccaa 
tcacatgtac 
cataaacacc 
atcttagttt 
ctccaatgta 
aagaaatata 
taactacgtg 
acatgaaaac 
tattaataaa 
gaccgtccct 
tttaggaaaa 
cccttttcac 
tgccgtacaa 
ggggttgaac 
gggaatgata 
ggcacacttc 
ctatcgctat 
ctgtgctaac 
tatacctgta 
cgagcttatt 
ttgtagacgt 
attctatctt 
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gagggaattg 

tttattttcc 

ttcaactgct 

cattagaaaa 

tatcagtttt 

ctttgacaca 

gttaaaagtt 

atgaggtagg 

cacataataa 

tcaatatgtt 

cggatattaa 

gagccaaggc 

aggctattca 

tcgtggaaga 

acgcgtgggt 

attctttgta 

atgagataga 

ctggggatat 

aaatatacag 

ttaaacttgc 

aaaataacaa 

gtgacactac 

aatatgagga 

tattgttata 

actctgtttg 

tcttccttct 

cttttaggaa 

acggaggaac 

gaacaaatgg 

gtggtggaac 

gaggtggtgg 

atggaaatgg 

aacgtctact 

cagaggtaga 

atgacgacca 

tgttcaagct 

aagaccgtgc 

gagaaaaaca 

atttttccaa 

taggagaaac 

tagtagagga 

cttacaaaat 

attgtttggt 

ctaacagatg 

atgctaccat 

ccaataagaa 

tagaattaaa 

ccgttccaag 

ttggtgcctt 

tgattgtttt 

cttttaaggg 

attgttgaga 

gttgatgaaa 

atcaatgaag 

actagtagtg 

agagataaaa 

agatatgtac 

catgccaatt 

attaataacg 

tccgagtggg 

gatttacagg 



tcccctgtga 
tttaattgta 
tcaaatgtac 
gaagagcgta 
tgttccccta 
gtttttatgg 
tgagtgtatt 
tactgattcg 
aaagggaaat 
gtttgatttc 
tgggaatgat 
tgtcatggat 
ggaagccgta 
aactgacagc 
agacagagcc 
tgataaactc 
tacaccaata 
cgaagaagag 
agatgaaaga 
taaattggat 
tgaatataca 
taaatcaggt 
gcgagagatc 
ctttttggtg 
aagagaaaaa 
tttcgtctca 
aaggaggaag 
tggaacaaca 
aaatggaact 
aaatggaaat 
tggaacaaat 
aggggataca 
taattccata 
aactgcctta 
attagaattg 
cgcaggtgtt 
ccgtaatata 
catcaacgat 
tagtcttaat 
tattgaaaag 
ttttaatagt 
cagagtagta 
ttttgccgta 
ccaagacata 
ggtaataagg 
tgacactact 
catgctcatt 
acctttgact 
tacagacaaa 
attttcaaaa 
aattaaaggc 
gatttgtcac 
aatacgatga 
catccgaaac 
ttattgctcc 
ggaaagaata 
aagcaattac 
ccgatacata 
accgcgccag 
gaaatgataa 
aaaaggatct 



gctctcttaa 
caatgaaaac 
tcattacttc 
gctgtaactc 
tatactccat 
tactctggga 
cttttgtggt 
tgatggataa 
aaaaataaga 
tttctaaaaa 
acagatttat 
agaaataggg 
gcggccaaga 
ggacaaagtg 
atgccttcgc 
cctcctttta 
tcttaccttg 
tacaatcttg 
gaccagattt 
agtatatcag 
gagtttgtct 
ataggtattc 
tttacatttt 
gacgtcaggt 
atgttcgtca 
ataacaattt 
aggaagagat 
aacggaggag 
ggaacaacaa 
ggttctggaa 
ggaggaggaa 
gacacagacg 
tcttcaaaac 
caattatctc 
gacgcctcag 
tcatcggcct 
aatgaggaag 
gaatacgatg 
cttgttacta 
gctttccctc 
ggaactatta 
gaaggatcaa 
gtagtaaata 
tgctttgtaa 
aaaggtgatg 
catttttcaa 
ttctcagaaa 
gacgccaacg 
aatttattgt 
aaaataaaac 
agaggctaat 
aactaaatct 
attattagaa 
atatgataaa 
tgcaatagct 
tgaaaagagg 
agtgatggaa 
tttaacattc 
ccccgtgaac 
cggagttgga 
ttacctgtct 



ttctttcagt 
gccgtctcta 
ttcatacttt 
gtcttcttca 
ggtttattta 
ttactttact 
aaactaacaa 
cgatggcgtc 
actagtaaca 
tggctgctgc 
ccaagctaat 
ctaaaatgga 
aacaaaaagc 
tccctccaac 
atattgaact 
acgtacaaga 
ccatggtagt 
ctcctacctt 
ttacaaaggc 
gtaaaagtag 
gtagcgtctt 
gtgaatatga 
ttattccaat 
cccgtattat 
taagcatagc 
tagatggtgc 
atcgtaccag 
gaggaggtgg 
acggaggagg 
caacaaacgg 
atggaaatgg 
a ttttgagcc 
ctaaagaata 
gagatgattc 
acactctaca 
tt ttagaagg 
aaattgcaca 
gaaaatatgc 
aatacacaaa 
atgagattga 
cttcaaacac 
caacaga tec 
aggaacaaca 
t tattcctcg 
aaattaaaca 
tcatcacaga 
gaattctccc 
tactgtcagc 
ccagccaata 
accatgggag 
gatgattggc 
gatgagacag 
gataaggttg 
ettgetgetg 
ggcacagtgg 
ctatggacat 
tttcgtttat 
ccttttttaa 
tgctccttgt 
cgtaaggttg 
gttatttgta 



cccccttcaa 
aattcactct 
tcatattgtt 
ggtgtctcat 
gtcagttgat 
caaaaattta 
tgaagtacat 
ccatgtctag 
tattttttat 
aaaaatggac 
cacagaegtg 
catgaataga 
attagtggta 
attateggga 
tgtagagagt 
catagacgac 
ggtaaaagtc 
tggtgtgaca 
tgataaatct 
acaactgacg 
tgcagagttt 
caaacctaaa 
acaacccgca 
ataggactat 
tacgtcattg 
aaaaacaatc 
tgagagtggt 
aggagaagga 
aggtggagga 
aggaggaggt 
aggaggaaat 
tacgccagcc 
ctatgaagca 
tacccaaact 
aggaaaacca 
taccactatc 
aacaatatta 
cacacccgag 
ccatgaagtg 
gtttgagaga 
tatgeagtae 
aggggaagtt 
ttctctagaa 
tttatccgea 
agaaacctat 
caaggatgaa 
cactttgagt 
etaeggaaag 
aaaaaaatat 
ataagcaaaa 
tateggctaa 
cccaagttgt 
aagaaatgag 
atatgataag 
caagaactat 
tagectacaa 
catataaaga 
gaataaaaaa 
cagtgtctta 
acatacacat 
tgttagatac 



aagtaatgtg 
ttcggatagt 
tttctccaat 
ccaggagatt 
ttatttttct 
ctcacatgtt 
ttcttagtaa 
tgatgtgagt 
tttatccttc 
gctatccttg 
attcaaaaga 
agagtagatg 
tttgataaac 
tccgattacg 
gttgagggag 
caaateggtg 
gactgtgaaa 
caaaataata 
gtgcgtattt 
tatgeggtaa 
gaatctgaca 
aatgaattcg 
gggacgaaat 
aaggcacaac 
gtgttattct 
gactctcaac 
gaeggcatag 
ggtggtggtg 
gaaggaggtg 
ggaggagaag 
ggaaatggaa 
cttctaaaag 
ttcgtatctg 
ataataattg 
agggattatt 
egcaaagegg 
agtcaactaa 
gaaagagctg 
ggtcttttag 
tgeataatte 
aggtccaacg 
gtccctgatg 
atatctgeaa 
ataggaaaaa 
ctgtttgtgg 
tctgttggaa 
gaccctgcaa 
cgcctaggtg 
aaaaaaagat 
ggtggaacag 
tgttgatcca 
taaacaggee 
aceggatata 
agaggtagac 
caataattta 
accatggaga 
cctgactgtc 
gatcgectat 
tcccaacaag 
cagaagaaat 
tgatttttca 
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ggatatgata 
actatgtttt 
atatgcacta 
aatgtctatt 
tataaaaagc 
agtcaacgag 
aaagacacca 
gttataaaaa 
tcaatcaggg 
cctgctgcaa 
gataccagtc 
gaaaagggga 
gcctccagtt 
gataaattag 
gcagttaatt 
aaattgggga 
ttaagatatt 
cgaaaagagt 
acagaccctg 
gattttttct 
gaattaatta 
gatttaaaac 
tggatagaag 
acgaacaaaa 
gacaagcgct 
ccccctcccc 
gctgctagtc 
agttttactg 
aaaattgaga 
agttctatta 
aataaagtag 
agagtactag 
aaggatggaa 
cagaattttg 
tgccctgata 
gaaatttcga 
tatttcatct 
cgcctttcac 
cacattttaa 
gtaggtgatg 
aggacagctc 
aaggtaaaca 
acatcaaaaa 
ggtcccacta 
gacctgcttt 
tttttaaagt 
agcacatttc 
agaccctggt 
ttactcttcc 
ggtggaccgc 
tgtcagattg 
ctggaacagg 
tacactagag 
cactcgagtt 
gctggctcgg 
aaatttctac 
ctgtaaccta 
aaaatttttc 
ttcggagggt 
actcccatgt 
gaagccttag 



aagctgtgga 
tacccaaaac 
tagtgttccc 
acaggatcac 
gaggttggca 
tacagaggag 
agttaagagt 
gtgctctcca 
aggcttatct 
aatcattaaa 
ctagaagtat 
attttaggca 
ctagagtgtg 
ggggtattta 
tttctcccat 
agaaaagtgt 
atggggataa 
ttgagcagtt 
aattacgttg 
gctcaaaagg 
tagccatgtg 
gtttcaataa 
gacatgttaa 
acacacatgg 
ggtccccagg 
cagaagtggg 
attgcgtaac 
ctaatattaa 
ttaaggatga 
cttctactcc 
actctataca 
aagtacacaa 
ataaagttgt 
ttcaagtact 
gcggaccata 
gtacagctgc 
ctgggtgtgt 
atatgggacc 
tttttagatt 
gaggaaactt 
tcgaaaatag 
aggtaacaaa 
gtgataaaat 
ctttggatat 
atgacaacaa 
gtataaaatt 
atcccttttt 
taggtccctg 
tccatctcaa 
tgggtcggcc 
caccagaaac 
cgtttctgga 
aatttttact 
tagagggtgg 
gccaatgtca 
tgtttctgga 
ccatatacac 
tgggtcactc 
ggacctctgg 
ccagaaatat 
gcgagtcatg 



agttgatgcc 
ctctaatcta 
tcccgcttct 
atgcacatgc 
cagactatca 
gatcggagtc 
atcatatctg 
gaaaggatgc 
ctattaccat 
tacaaatatg 
ggtggcttct 
acccagttac 
ctcccacatg 
ttatgctaat 
agaaagaatt 
acagttatta 
agtgaaagat 
ttgggggtta 
ttattttaat 
gttctttaga 
cataggtgac 
aggagaagaa 
acggatgcct 
agtatctttt 
ggtgtggctt 
ccaatttctg 
gaggaatatt 
ttctgaaccg 
taatactact 
acctcctaca 
attgaataat 
ccaaaactct 
gtttaaggag 
aaaggatgaa 
cagaggatta 
taggatagaa 
gactagacaa 
aaataaacaa 
gttgactggt 
gttctctgtg 
aaagataaag 
ggaagatata 
aatgaatggt 
tgtaaagaat 
ataaaggaat 
tttctaagac 
ccccttcttc 
ctaagggtag 
aaacttttaa 
tgatgtcaag 
gtctgttgct 
cacaccagca 
cttcctccat 
accgctggct 
gattgcacca 
acaggcgttt 
tagagaattt 
gagtttagag 
gtcgggccaa 
ctagatcctt 
tttcttgcac 



cataaatttc 
tttaatcgat 
gccagttctg 
tcctagaggg 
aacattcagc 
ctcccagaag 
ggcgtgggac 
aggaggaacg 
cttggattga 
gtaaacagaa 
aatgagtgtg 
ttgatggatg 
agagctttgt 
tttaatgaac 
aaacacgtct 
aatttaagaa 
acaaataaga 
tgctttaaat 
gagttgacat 
gaagaaagtc 
cgtaaacagt 
gggagaaaag 
caaatgcctg 
gccctagaga 
cattcttata 
gatcaggctt 
tgtactacta 
atggaaatta 
gctactgtta 
aagaaacaaa 
ctaccaactt 
aaaaagggtg 
atgagaaaaa 
gatgtgtgta 
tacagatgct 
aaagtgaaat 
gaaggaattg 
tatgtgtatg 
gtgagtgata 
gacgagaatt 
gaactacaat 
gatagctgtc 
gtatgtaata 
aattgtactt 
aatactttat 
aaaagtgagt 
ttcttcttct 
gctgtaacct 
aaaatttttc 
ttcggagggt 
ccagaaacgt 
cccagaaagg 
ctcaaaaact 
aggcctgatg 
gaaacgtctg 
ctggacacac 
ttactcttcc 
ggtggaccgc 
tgtcagattg 
tctggttctc 
catgtatttc 



attttgaagc 
cgcatatagt 
cttctacaac 
ggggggtggc 
cgtaaaccat 
aggctgcctc 
attggggata 
atgaggatat 
attacattga 
ttaaaatcat 
tgagaactct 
ctgccatgag 
gttgcaagga 
ttgaaactca 
tcagggagat 
gtgtagcagc 
aacatagtgg 
ttgttactca 
atgccataaa 
tattccttac 
ttgccaagat 
aagaggctgc 
tatgggttct 
gtagtatggt 
caaagatgcg 
ttaatacatt 
caggatttat 
aggaagaaat 
ctgttagtgc 
aaacaactcc 
taaatatgga 
tagctgctac 
gctttggttg 
aattggacta 
atttcaaaat 
ggggagaaaa 
ggaaaattat 
ataactatag 
ccaacacatc 
atgtgggcgc 
tgctactgaa 
ttccatcttg 
ttggtaaaaa 
gtattttagg 
acaatttttc 
ttgagggttg 
tcttcttctt 
accatataca 
tgggtcactc 
ggaccgctgg 
acattaaatt 
gtaggctgta 
tttaaaaaat 
tcaagttcga 
ttgctccaga 
cagcacccag 
tccatctcaa 
tggctaggcc 
caccagaaac 
gcccaccacc 
taggaatttc 



agggaacagg 
aaactctaaa 
agaactggat 
tgtgtccttg 
ggaggaagaa 
ccagatcctg 
ctctgtttca 
aactgcgtgg 
aaacgtaaaa 
agctgtggag 
agaaaaatat 
gctagttcac 
agaggacagt 
gtgtgttagt 
tgaaagtgta 
ttaccatgtg 
accattcaag 
gcacgtaaaa 
ttggagaagg 
atctattgtg 
tcagaaaagg 
tacatttgat 
ggacaaacat 
ttctggagga 
tctagattct 
gaagcgagaa 
aaaggcttct 
aaagaaaaga 
tactactagt 
aagtggaagc 
ggatttagat 
agttttaatg 
gggttctcat 
tttgttgcca 
agtaaaggat 
tgccatgtgt 
aacagatgtg 
gcaattaata 
aaacattctc 
aaaggatccg 
gacttccttt 
gctatttgat 
tatgggaatt 
ggtcgtgaat 
atttccttcc 
gtcataatct 
cttatttgaa 
ctagagaatt 
gagtttagag 
gtcgggccaa 
tctactgttt 
acctaccata 
ttttctgggt 
agggtggacc 
aacgtacatt 
aaagggtagg 
aaacttttaa 
tgatgtcaag 
attgcgtact 
agatatctgg 
gctgacgtca 
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ataaactggt 
catctcgccc 
tgtgccgatc 
atctgtacag 
tcgcagccat 
tgttggaaca 
aacctaggaa 
acaaccacaa 
ctctactcaa 
acagggctaa 
cttcggttgc 
tcatgagtga 
ctcttcttgg 
atgcccgctc 
tgctaatggc 
aatatacttt 
caata tcagg 
atgaaaaaac 
cttcaaagtg 
caccaaaagg 
atgtgttgac 
atgtaaaatg 
taaaaagatt 
ctatgccggc 
ttgatgattt 
gacagcgaaa 
aagatccaaa 
aaaagcagaa 
cagccatcag 
acatggctgt 
gtgcagtcga 
ggatcaagcg 
aagacgcgag 
aggaggaaga 
aaaataacaa 
ccaccaacga 
ctgttgatat 
acaagaaaga 
gggcaatgga 
ggtgtaataa 
cctattctaa 
ctagcaccaa 
tttccttcaa 
atttctcgtt 
cttgtgtctg 
cttttcttca 
ctacagagag 
gacgtccaac 
ggataagtat 
ttctccttct 
agaaggagaa 
taattctgca 
actaaaccac 
ttcaccagag 
tctaggtgtg 
aaatgtagtt 
ttcttcgtca 
ttcgtctctg 
tatcaactct 
gtcgcccgaa 
tgacaacgca 



ttgtcatcct 
acacagaaac 
tgggtgtata 
ctagcagcag 
ggcagcagca 
actcaccccg 
gatctttggt 
ccaaggtgag 
ttgcatatgg 
aaagtataga 
cgtgatccct 
gattattgac 
aatgatgaac 
aactaaaccg 
agctgaagct 
tcctggtttg 
agatgatgac 
tcgcggtcgg 
tctcagtgtt 
agaatttgaa 
taaagggtct 
tttgaggaat 
tgatgagagt 
aggtgctgga 
cagaaaattg 
tcttgatatg 
gaaaaacgaa 
caaggaggaa 
agaaaagcag 
cgcagccatc 
catggctgtc 
tgcagtcgac 
gatcaagcgt 
agacgcgagg 
ggaggaagaa 
aaagaacaag 
gaggattcaa 
agaagagaaa 
tatggtgaac 
ggatgaagcc 
agaaattgta 
ctattctgcc 
cgatgca tct 
caagacataa 
gataaaatac 
actcacacat 
atggtatata 
agaccaacaa 
gatacgtatg 
tctgctcgta 
aaattccatc 
acaattgaac 
gagggagata 
catgatattt 
aatctacgca 
gatgtgtgcc 
t tctcaaaaa 
ggggaatgca 
tattttttct 
atggaaattg 
agtagaaact 



gttttatcca 
aggatatgat 
aaagagcgct 
cagtagcagc 
gcagtctcag 
gacggagacg 
gacaaaagtg 
agttaccaaa 
aatggagagt 
cttctcaact 
atctacgggt 
aaggaaactc 
gccaagcatg 
tttgaaaatt 
atttatgatg 
gaatctgctg 
aatatgttgt 
gtaaatagat 
ttaaagtttg 
gaaaaggccg 
gaacaagaat 
atttgtatag 
attttgagaa 
aatgaagaag 
ttgteccctg 
ttgtgcccct 
gatggaaaca 
gaagacgcga 
aacaaggagg 
aacgaaaaga 
gcagccatca 
atggcggttg 
gcagtcgaca 
atcaagcgtg 
gacgcgagga 
aaggaagaag 
cgtatagtcg 
aggacaaaaa 
gaagtacaga 
aagaatacta 
ccttgtttag 
aacaatacta 
agattctcca 
aataaaatgt 
cacacatctg 
ctattttttg 
aaggctttgt 
catcccctct 
aggatatctt 
aaagtgaaat 
ttgcaaaggg 
ctattatttc 
tatcctcaag 
atgaattgga 
acgtttctaa 
acagaaaatt 
atgtgtgtat 
gggatttatt 
accctgaaaa 
cagatgagga 
ggtctggtgt 



gtttgctgac 
atcatagatt 
gcggagaggc 
agccaagaga 
gagaggggag 
tgataagata 
tgatagagac 
tcgcatcttc 
cgggaggaat 
cacttttggt 
cagtgtgtga 
ttcagaccat 
gcacattggg 
tccaatatga 
gattcagaga 
atgtgtacgg 
tgaacctcat 
ccttgttaga 
ttgaaaaaca 
aaacatgtgt 
cttacaagct 
tatcacaaca 
agtgtacacc 
aaatatgttt 
tctcaattcc 
acagtgacca 
gagtgagggt 
ggataaagcg 
aagaagacgc 
acaaggagga 
acgaaaataa 
cagccatcaa 
tggctgtcgc 
cagtcgacat 
tcaagcgtgc 
acgcgaggat 
acatggcaat 
gggaacaaga 
agaaacttga 
gtaatgttgt 
gaaataataa 
agaataatgt 
atattgtaga 
gtaacatgca 
tgaacatttc 
cacatgcaag 
atctccatca 
ttcgttctct 
acttgaacaa 
agaggataaa 
catagaagag 
tccaacgaca 
tcccctatat 
ccttattgtt 
actgatgaag 
tttcaacaat 
tattcctcta 
caacggtatt 
tactactact 
agaacaatcc 
ctgtttgatt 



gtcaatagac 
tctgggaaga 
agaaacatca 
agatcggacg 
aatctctgca 
cgactctgag 
tattggacat 
tgtgttggaa 
ggctctatgg 
ccacaagata 
cagagaagaa 
atgtaagagt 
aggtaatttt 
agcaatggga 
ccatggctta 
aaacaatcca 
ctgcaactat 
ttttttaaaa 
ctttaaaatt 
taattgtctt 
ttcttgtgga 
cctaagatgt 
taacttgaat 
catgagaaat 
tcatttcttc 
cacgataata 
caaccacaca 
tgtagccgtc 
gaggatcaag 
agaagacgcg 
caaggaggaa 
cgaaaagaac 
agccatcaac 
ggcggttgca 
agtcgacatg 
caagcgtata 
tgcagctgcc 
gttaagggct 
agacatggaa 
tagtagcagt 
taatgctgtc 
atttggttca 
aactcccaaa 
atactgtttt 
ttgcacacgt 
taaaacattt 
tttatcacta 
gaaggcttct 
ttcaactgtt 
actttaatct 
ctccgtgaga 
ttcaatgaca 
actcagataa 
ggcactgatt 
aaaatatcgt 
aggattatag 
ttttctgcag 
tgtgatgacg 
actactacta 
ccaaaaacta 
tttgaagtat 



catatttggg 
gggtgttttt 
gacagacttg 
caaaccattc 
gatctactcc 
cagtacacca 
tttctcatcc 
aaattccccg 
aaggcattgt 
aaaaattggc 
aggcccatca 
gatatacgtt 
ttacacttct 
gctaatgcag 
aacccatcag 
gtggaaattg 
ggtgtatctt 
atgaacacag 
gagtcaaata 
gatagaaata 
cactttcttc 
gaaaaatgtc 
tggtggttga 
aagaaactgg 
aaaaatagta 
ccaaataaag 
gccatcagtg 
aggacattta 
cgtgcagtcg 
aggatcaagc 
gaagacgcga 
aaggaggaag 
gaaaataaca 
gccatcaacg 
gcggttgcag 
attgacttga 
actaaaaagg 
gatctgagaa 
ctagaaaagg 
agtgttgttg 
attggtatga 
cctcataaat 
atgtctttca 
ttgtaaaacc 
aacactacga 
cttcaaacct 
ttcgtacgat 
cgctttcagg 
tcaagacatc 
ttcaacttaa 
ttctagacga 
gaaacgaatt 
tgaagcatat 
tgctttttgg 
atggtacttt 
ttaatcccat 
ctgaagaatt 
tagagagata 
ctgctccttc 
taaagagaaa 
ttaaaaacac 
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gtactacatt 
aatttcttct 
attggttatg 
gattggagaa 
tgaacaagca 
tgttaacggt 
atatgaaaat 
aaggagagct 
cagcgcagaa 
tcctcctcct 
atgtacactc 
ttgcaacacc 
ttgtgaatat 
ggcgagctaa 
ttgaagctgc 
atgagctcat 
agtgggtaaa 
agcacgccga 
tcgctgcttc 
cagattgtga 
ctcaagaaat 
ctttccatat 
tgtatgggtt 
ttgaatcaga 
aaattgatga 
caaaattgat 
tgcaaccctt 
tagctgtata 
atgaggaaga 
tcaaggaaaa 
taaaaactgc 
tagatgtgga 
caacaagcga 
taacagaaga 
ataaagacgg 
tggggttgta 
tccctcatga 
ttgtgactga 
gggatgaacc 
taaaccacgt 
ttccacggtt 
tcaaacctaa 
caatggataa 
acttgaggaa 
catatgggaa 
taaaatttat 
tgtcttcttc 
aagacattgt 
ttatcaataa 
ttgatgaacg 
■ttacaagtgc 
gagccaacgg 
ataaaatgag 
aacgattttc 
tgatgcgtgg 
tggacggttc 
acaaagtagg 
gagtagacga 
ttgataattt 
aaagagtcat 
ctgatatgtt 



attaatagag 
atcaaggaaa 
gtgataactg 
aacaggcgca 
gatctaatcg 
ttggggatgt 
gttgtatgtg 
attgcagact 
gaagaaaatg 
tcttctcctc 
tgttaatttt 
ctttttatga 
caacaacaag 
aactgtattt 
attatatgaa 
acaagaaaga 
ccgtacaata 
ttttttactg 
tacttttaag 
aacgggggat 
taaaatatat 
ttataaaatt 
aaggaataaa 
tgggattatg 
aaacgaagaa 
catctacttc 
ccttgggttt 
cactgcacct 
acccgtaaag 
acctgcagag 
cgaagaaaca 
ggaagtactt 
aacgctaaca 
agatatcact 
ggaagcaact 
cttgaacgaa 
gatggttttt 
tcaagccatt 
tgaagta ttc 
cttgaaaatt 
ctctcctgta 
aacattctta 
gcaacattct 
tttacccgat 
gcgtctagga 
aaccacttta 
gtcttctgaa 
aaatgctcta 
tcacatctca 
gtacgataga 
tcagactatg 
gactggacca 
gggaaatatg 
cagttctatg 
ttctgacggg 
aagaaagaaa 
ggatgagtgg 
ctatgaaaga 
cgaacaagca 
gttcttaccc 
ttctataata 



gagatagagg 
agagatgtaa 
ggtgttatac 
aatttttgaa 
aattcgcgat 
tccagaaccg 
gtgagcatga 
ataatgactt 
aagatgaaga 
ctgcacataa 
ttcaaacaat 
taccattaag 
gatgaagatc 
gacactaaaa 
gcaatatcca 
ggtgatgaaa 
actccctcat 
gaccgaatga 
gaggtatcag 
atcgatgaag 
aggtcaaaca 
tctgcaacag 
aaagcaggtt 
gccaatacaa 
ggtaaatatg 
tttttaaatt 
ctggtctttt 
caaataaaga 
actttggaag 
tactttgagt 
gcccttcgag 
gaagagaaac 
aacacaatca 
attaaaatgt 
cctgaagaaa 
gtagtagaaa 
gaaagatgtg 
cagataccct 
cctggtgact 
tctgcaaaga 
ggaagtgttt 
tttttagcca 
gtagggtgtg 
tcaaaaccta 
actggtgttt 
tctgtagact 
actcctaaga 
gataataatg 
cgaaaaacgg 
aaaa tagccg 
tatgaccaat 
gtcatggggc 
ctcgaatacg 
acaacgcttg 
tattttgcat 
gaagacccaa 
gaagagggta 
gatatccatg 
ttaggagaaa 
aagaaatcta 
ttcccacctg 



aggttctttt 
aatcacagat 
agaattgtac 
aatgaatggg 
gagtgtttct 
taaaatgata 
tatacaaaaa 
tgtaagtaag 
tgcagacgct 
aaaatcacgt 
aaactaacca 
acacgatggc 
ttacaaaact 
atcaagcagg 
aaaagaaaga 
ttacaccttt 
tgacgactga 
cacccgtaag 
attcaaaaac 
tgtataatct 
attcctcgga 
atagcgacag 
atacgtgttt 
atatcggtgt 
gttttttaat 
gttggacata 
ctgtgcttat 
aatcgaagaa 
attttgtaaa 
tgctaatatc 
attttgttat 
caagagaata 
tcgcagaggt 
taaaacaatt 
aggaagattt 
aaacaacaaa 
ctattttaat 
ccaacaaata 
gcttggatct 
acggatgtga 
cttccatgat 
acaaaaatga 
agttggacat 
gacctctaag 
tcacaacaga 
tttatgtcct 
cttccaccga 
gcgagtggtt 
cagaaactgt 
acaaaatcaa 
atgcaataga 
cagtgaacac 
ctgaagatat 
aatttgatct 
tccctttccg 
ttaattgtat 
aagaacgtga 
tttcaatttt 
atgcaaactc 
aactgttcaa 
catctgacca 



gaaaaggctg 
attaatggta 
ttcaaagatg 
aattactttt 
ggtgccgggg 
cctgtaattg 
gaagatgcta 
aacaagagag 
agcagcagca 
cttccggatg 
ccttgtatat 
ggtaaacttg 
cgtatccgag 
gtttgacatg 
aaaggccata 
gactacaatg 
aaatttatta 
cgaggaagat 
tgcaacagtc 
tgcaccatca 
attggataat 
tggcaatact 
gtgtagaatt 
cgctgaaaac 
acccaaacaa 
aaaaatgata 
tgtaattgta 
gagaaaaatt 
gggtcggctc 
tgcagacact 
tgagaacgac 
tgtcttcaaa 
acaaaaaaag 
cagggctgcg 
taccaataat 
tattgtcatt 
tgaagatttt 
caaaatcaga 
tgcagtttca 
aaacaactgc 
attgggcagc 
cagtacacat 
gttaattttt 
tgatgcagac 
aaatttggta 
ttgtgcaaca 
tactggggaa 
gtcttcctat 
ccaaaaaatc 
cgaaataaaa 
cacatttcaa 
ggttatcgat 
gtgggacgga 
aagttactct 
tggaacaaaa 
catttcagta 
agtgaatttt 
gtgcatgtta 
tttttatttt 
tagacctact 
agattttgca 



tgaagagtgc 
ataaacctcg 
cacttaaaca 
ctctgattga 
agaggatttt 
atcctctcac 
ttctttctgt 
ggaagaaacg 
gcagcagcag 
aaggcgaaaa 
ttttgtttgt 
gataatgttc 
gcaataaagc 
agacgtcaag 
aaggcattcg 
cagtatgaag 
ggtgatgttg 
attgaaggtt 
atagttaagg 
ttcggcgtca 
gtcgcagatt 
aaaaaattgt 
tttgcagaaa 
aacagagatg 
ccagctggtg 
ttttatacaa 
atgactgtat 
gaagatgaaa 
cttaacgctg 
gaagcagcat 
tctgtcgaaa 
ttggcaggcg 
gcagcattaa 
aacaaagata 
tcagatcttg 
aacaaaatat 
gatactggtg 
ttagtcgaag 
gttgataaaa 
ttcgttatta 
actgaccaag 
tttcaattca 
tcagaaagga 
atattggcct 
gacgattaaa 
aaaataaaca 
gaaaggatta 
attgatccga 
aaccaagaag 
tcatccatct 
gaaggaaaag 
acaactttaa 
gatgactgga 
gatttaacta 
aagataaaga 
acatatccaa 
aacctagaaa 
catgcacaac 
aaaaaggggc 
gtagaagatt 
gatgatattt 



141060 
141120 
141180 
141240 
141300 
141360 
141420 
141480 
141540 
141600 
141660 
141720 
141780 
141840 
141900 
141960 
142020 
142080 
142140 
142200 
142260 
142320 
142380 
142440 
142500 
142560 
142620 
142680 
142740 
142800 
142860 
142920 
142980 
143040 
143100 
143160 
143220 
143280 
143340 
143400 
143460 
143520 
143580 
143640 
143700 
143760 
143820 
143880 
143940 
144000 
144060 
144120 
144180 
144240 
144300 
144360 
144420 
144480 
144540 
144600 
144660 
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attatcgaat 
attcaaaatc 
caggtgatac 
ttgtattcat 
ctgtaatctt 
gtgctgtagt 
acttgtaatc 
ggatgaggtc 
ttgaattcgc 
tgccactatt 
gtgtcttgtt 
cagatctcca 
ttctgaaggt 
attcacagaa 
cttcacatag 
tgtcgaaaga 
tagtggaacc 
ggctgttttg 
tagactcgat 
acttgaggat 
gagatagaca 
cagattgagg 
agagagggtt 
ctgaacactg 
tgctgataac 
tatcctcctc 
acgaagagga 
cttcttcctc 
tttttctaat 
aatcgtaatc 
catatgctga 
tactctccct 
attcagtgac 
tggtagacga 
tatagtttcc 
tcttcttctc 
gagggcctcc 
actggagatt 
cagttttctt 
caaatgccga 
tgcgtccaga 
ccaattcttc 
cactcacgtc 
gctgcttctt 
ctcctacaac 
tagagaccat 
aagtagtagt 
ttggctgctg 
attcagtagt 
gtgatgtaga 
ggttgccaaa 
taataatata 
ttagaagtac 
gcaggggaca 
taggccctgg 
acgagtttat 
tgtaatgaga 
aatatgttcc 
ttctcatcaa 
cgttcacgtt 
cagtttgtac 



aattgtaaca 
ttttattatt 
ctgaacatat 
atccctggtg 
tgccctgtgg 
ttcatccccc 
aatgaggtat 
acatttcttg 
ttcaatgctt 
ttctcccacg 
ctcaacaact 
attgttgttg 
cttgtagctt 
tggaatagcc 
agcattaata 
ggaccagaaa 
catagaaaag 
cagatcaagt 
gtgcacacaa 
agtggacatg 
gacaggagtt 
atatttggtc 
cccatcgctc 
ggaagttgca 
actttccttg 
atcctcttcc 
agaagaacac 
tgactgctca 
aagattcttg 
aacttcctcc 
cttttcgtca 
cttggccaac 
actttcttcc 
agaactggga 
tccaagttcc 
caaatccatc 
cttatcaagg 
catacaacca 
cacagcttct 
cttgaccttt 
gttcatcaca 
gtcctcatct 
aagatgagct 
ctcggcgggt 
ttcttccaca 
tcttcttcca 
agtatcctca 
ctgctgctgt 
tctgttattc 
gcccttgccc 
caacaacggt 
cagtataagc 
aagaacaggt 
acactcttct 
ggcgttgctg 
agtttccaga 
gaaccacaaa 
tttctggact 
gggctgatct 
atagttgcat 
aagataactg 



tgttcataat 
tttcctttct 
ttataaaaat 
atcatctcgt 
agtgcttgtt 
tcacaaggct 
tgcttgaaag 
gcaacattct 
ccaaaaatat 
cctccgtcaa 
tccctcttta 
atgtcgttat 
ccataaggag 
agtgtatgga 
agagcactct 
ctcttatcat 
ataataagat 
ccctgatcct 
tcctcagcag 
ttgaatgagt 
gatctgttct 
actgaagcag 
tcgtcgaatt 
ccatcatcat 
tcatcttcat 
tcttcttctt 
actgatgagg 
ccctcttcct 
atgacatttc 
tcgttttcat 
tcctcttctc 
cacttgtcgt 
acaatttggt 
gttgacatga 
ttaaacagtg 
tcgtccacat 
gccttcttgt 
aggaatagat 
ccctcaacct 
gagacacacc 
ctattgaaca 
tcgtcattct 
ggcaccttgc 
gtgatttctg 
gtttccgtgt 
gcagcaaact 
aattcataaa 
tccttgattg 
ctagacaaca 
gtgcctcaat 
tggaaatctt 
ttttggggaa 
.gattctgcat 
ttcttgaaaa 
gaggaggacg 
agacaacata 
attcttcttc 
accacattat 
ccatttcttg 
atagaacagt 
gacgcatttg 



aaaatttgtt 
aatccaacat 
caggaatata 
gccagatgcc 
tgtatgcgtc 
ccatgataat 
gcccagtcaa 
taaaggcggc 
cctcattatt 
caataaggtt 
gaagattttc 
gttgcccaga 
tagtgttatt 
cagtagcgta 
tcttgctctc 
aatctacctc 
tgtccacatc 
cctcatcttc 
cctcagttcc 
tggtgatgca 
ttgataggag 
atggtctgaa 
cgcactctgt 
cctcatcttc 
cctctactac 
cagcagcagc 
cgtcgtcatc 
tttcctcctc 
caactgaact 
cttcgtcttc 
caaatacagt 
tcttgaagca 
tagtggtctt 
ccaatccagt 
acttatctcc 
a tacagcatc 
tgagtgtttc 
tggccagttt 
tgacaatgtt 
catcacaaaa 
gtaccgattt 
ccagaaattc 
ccataagagc 
gaggagtctg 
gtggactcct 
tggccttgga 
cttgatctgg 
actccatttt 
cgtcttgtct 
a tatacccct 
agtagaaaga 
caacattcat 
ccagctcatt 
a tatggatga 
cagaagaaga 
atttagctgt 
ccgtcttttc 
cttcgtttaa 
ttgtgcttag 
gggatgaaag 
ctagttctgg 



gaaataaaat 
atataatggt 
cataaacatg 
cctcagcgag 
ttgagccctc 
attctttctg 
gatgtcgttg 
agcagcagtc 
ctcgtgcacc 
gcgaacgtat 
aaccatctta 
aaagttccct 
aatgtggtta 
taccttggca 
acaataaggc 
caccattgaa 
gacaagtact 
atcctcatcc 
ggcagggaac 
tccagaggag 
agaaacatcc 
gctgttgtct 
tacatcccca 
ttctgaatct 
tgttgctgta 
tacaacagtg 
atcatcatcg 
tacatcactg 
ttcaatagtt 
atcctcactg 
attgttcagt 
acaacatcca 
gaaatgtgtt 
tgacttgatt 
attacttcca 
agagagcttc 
aatagtcacg 
tgtagtatca 
gctaaccaga 
ggtggaaggg 
ctgcgccttc 
agcaacagat 
gtccagttct 
ctgctgctgt 
gatagtgttc 
aggagttggt 
ttcctcagag 
tagaatattg 
gtacgattag 
cagtagacct 
accaatggaa 
ttattgtcat 
ttgttcaacc 
agacacttgg 
agaagaacag 
aattggacga 
tggtataaaa 
gggaacatat 
agccgatttg 
gaacttgtcg 
agggacgtcc 



acagtaatac 
gatacattta 
ctattgtaag 
atggagcttg 
ttcatctcat 
gccgaattat 
gttgtagtct 
ttggcagaca 
ttcaagtagt 
ctgcggctgt 
gtgtacatgt 
ggcaaactaa 
atgggcatag 
gtcacattat 
ttcacagact 
gattcgtatg 
gagtagtact 
ttcttctttg 
aacttaagag 
tgtctgcata 
atctcttcct 
gaagccaggt 
aacacaactt 
tcactgctac 
tccttatcct 
acagatgaag 
gagtcttctt 
agaccgatag 
tcattgttgt 
tcgctgtatg 
gacttgaggt 
tttctcatag 
cctgccttct 
gcactatcaa 
aatatctgcc 
aatgtgaaga 
tgctcatgga 
gtggcgattg 
tcctttcctt 
gacagttctg 
ttattcttct 
ttggccagaa 
tgaagagatt 
tcttcttcct 
ctcctctgct 
ttcttggcag 
aaagtgacag 
ggttattagg 
tcgagatgat 
tgataatcac 
caagtcagga 
agcgctaaca 
cacttcattt 
aattcttcaa 
ctgccgttga 
ggagacttct 
gtagtcccat 
ctgaccgact 
tattccccgt 
gttacgtgtt 
tggtttttgt 



144720 
144780 
144840 
144900 
144960 
145020 
145080 
145140 
145200 
145260 
145320 
145380 
145440 
145500 
145560 
145620 
145680 
145740 
145800 
145860 
145920 
145980 
146040 
146100 
146160 
146220 
146280 
146340 
146400 
146460 
146520 
146580 
146640 
146700 
146760 
146820 
146880 
146940 
147000 
147060 
147120 
147180 
147240 
147300 
147360 
147420 
147480 
147540 
147600 
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147720 
147780 
147840 
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147960 
148020 
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ttattgttga agtaggtttt atctttttcc tatttaagca cacacccttt gaattaatgt 148380 
gtatgcgatt cgcattgatg tatatgcaga ttgtgagtgc tggagaaaat ggtatctcaa 148440 
aatcggtagc tggttcaata atttctggtt caacttcctg ttgttcttct tcctcatctt 148500 
tctcttcctt atttttgaca cctagaactc taaggttggc agtactaatt attctggctg 148560 
cagccgtgct ggtagcagaa ggggaagatt gttgggagat aacaggagac atgttgaccg 148620 
tgccgtgtcg tgaggataaa tggctaaagg tatatacggc atgccctttt ttccgttact 148680 
gtgtgcgagg tcgttttcta ggaccgtaga tgaccatctg ccttcctctc tttttctgga 148740 
caacctttaa ttgccatggt gttatgagag gtaaactcgc ctcatagaat ccgcaaactt 148800 
cctctaaagg gaacaagtgt ttccccatag gggacaattc aaaaatataa ttccaacata 148860 
ttaagggtgt tgttttattt tccttttcat cgtactccgt tattgtttta ctaaaatttg 148920 
gaaggtgatt ttttgataga ctatcggaca ttatatccgc tgcaagttga cgtttatgtt 148980 
caatgacata atcattcgaa tcaaagttgt cgtgtgttga caeca tacca ttccatttcc 14 9040 
atttggctag tttggagagg gatttattgt tgcgcaattt tagaegtgea gaaacgtttt 149100 
gecatattat tttagaaagt acaggcaaaa caacatccct caegtcaata ctgtccatta 149160 
atatatcagt agaatcattg aaccttataa catctctctg tatatcttcg tttgatgggg 149220 
ccaatcgaac tagaggaact aaatccccag ttatgagatc ggtgaaaata aagctctcgt 14 9280 
catcaacaat ttttttggag cctatgtaca tctcacactt actataaaag tcatttttat 14 934 0 
accctggttt cttttcccac ctagaacata agagtattaa aactctcgac aaattataca 14 9400 
tggcacctac tcttccttcc atattactga tagctctata catattaacc acgtattgta 149460 
taggtgette tacgtacaaa tctgaaaatc cactagcggc agttccttca ctaaaggcta 14 9520 
gaacttgttc catagtcttc tcttctgaaa acatttgetg aattatttta gtggatagtt 149580 
tgtgccgctt attgaatttt gtgccagggt actgggaccc gaattcgttg aaatgttgtt 149640 
gttgtttttt ccctacttct ttagtttcat gecaggeggg gtgttcttga agagcageca 149700 
cctccaggaa caaatcacag atggctcgta ctccatcaca tgtgcatttt acagtttcag 14 9760 
ggaaatttaa tcctgttatt ttataaaaca actctctgtc tattattttt cccattcgta 149820 
ggegattatt gacattaaac ccacaacact cgcctctgat tcccattata aaggctaaag 149880 
aatatactat aatatttccc gcacttgaac ccacgagtct aagagttgta tattctccat 149940 
cgcagttata gagacatctt gecaattegt ccattgattt cctgcccgtg aagegcacat 150000 
ccatgagtct tgctcctcca tatccaattt cagegtttat tgaagtcctc caacctttat 150060 
ccttgtttag gcgctgtatc tttttcgect tctctgccaa tacattcata aaagtgtaaa 150120 
atttaaegtc tcttacagat tccataattt tttcctcaag tgttagcggc ggcggtggcg 150180 
gcgatccttc tttgtagcct tcaaagataa aagagtgtta ttcaacttac agtgtttatt 150240 
ggcccagcaa ttcttttctt tcatcaactg catagtagaa ggtcgategg aaagaatgac 150300 
caatttggaa tccaaatccc taggtttatc tagtccagct tctatgtcca tactctttcc 150360 
tcctattatc atatagtctt caaacagtgt tttattagca gacatgttaa tcatttcaat 150420 
tacagtctgg aggcatgttt catgtctatt gaccagatct tcatcatctt catcccatgt 150480 
cacacccacc agaatatttt ctcctttctt gagtttggat acattttctg taatcttttt 150540 
cattatccgt tgtttcttgg acgccacact ttegtagcat agegteggta ggggagagtc 150600 
gcaatccaca accaagtatt ttttcatgat taatttgttg aggattgtgt tctttgaatt 150660 
gaagaaaaaa ateattegtt ctgtaccatg gagacaacta tggataatgt cgttcagaat 150720 
aacgaegtaa caaaaccaac accagatgtt gctactgtta caacagcaac tgagaaacgt 150780 
cagtcatgea aagagaaaaa ggatcaactt aaggccgaat gtcctcaagt actgagagca 150840 
ctaaaattgt ccaatacttt aaaggcaaat tttggaaaat ccatgtcggc tatttttget 150900 
caacatttag tggacatgac aaacgctaaa cactttaagg acccaaagac aaagaagatt 150960 
ttagaactgg atggaagtag tagcagtgac agtgaagaag aggaagaaac tagttcttca 151020 
tecaaaegga aaagaggtag tggtgctaga agtgcttctt caaagaaaga aaaatgcccc 151080 
aatactatca aaaattggct caatgatget caaggtgtat tccgccagtt tgeagatate 151140 
atcattaatc ttccctcttt tgatgatctt agagacgaag taaaggatga acaaactgag 151200 
ctaaagacca tatatgactt gtatagacag gacatggaaa aggtggtgga agaagtttta 151260 
gggcgccaag acctgtttga tcacaagtca gaaatageca aaggtttggc ccgtttcgat 151320 
acccacgtct cgttgctccc tteggatagg tcggctgttc tagactegtc catatccaaa 151380 
gagttggaaa aaaatagcaa gggcccgaac agtaatattt ttgacacact aaacacactc 151440 
aaggaagaaa tcaaagaact tttatgtcat catgtcaaat atttattgea aaatcttaca 151500 
ccggaggatg caaattttgt gttcaatagt tctgtaaagt atgtaaagaa atcatatcaa 151560 
tactacatac aaacatcaga gatggaaagt gatgaattta agtcccttct cactggagtc 151620 
aacattaaaa tattggagaa gataatctcg tccgataata atgttgctac tccttacaaa 151680 
cacatcacta atcccaggaa cattatttcg tctttacaaa aagtacgtga aactaaacct 151740 
gtttcaaagg attatcegtt cagagtggat aeggecagag atattgtact acttccagag 151800 
acgggtggca tttctgatct ccctatcaag cccgttacat tattgeagtt ggtgtcctac 151860 
atcaacgctc tcttttccct cgagcgccga aatgttttca ccgacggctt ttttaatgea 151920 
gcgtgcgtcc taatttccca gtgcctaacg aatgegaace tattatctaa cgactttcct 151980 
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aaacccattg 
caagaaggtt 
aagggcggtg 
tcttcttctt 
gcagtagaaa 
gatttagata 
gaactcggca 
tatgcagtaa 
cttttagatt 
acttcctcca 
gtggttacaa 
gtggtggtgg 
ttaccctcga 
acaattagta 
gacgaacaac 
gtgggagaag 
agtgtacaat 
aatttggcgg 
ctattttcta 
aacctattga 
acgctcgtaa 
cctccaacat 
caggatatga 
atcggggtcg 
gtggactcaa 
atgtctgaag 
tcttttgtag 
acccctctag 
aatgataaaa 
gttacaaata 
gttataggcg 
gcagcaaatt 
ggaatgatac 
atgtcaaagc 
agcagcagac 
gcatttctgg 
acaaaggatc 
gacaaaaata 
aatataactg 
gagatgtcta 
gtcgttttta 
tcccct tgta 
ccacccgtct 
ctcttgt tga 
tggagggagc 
cttcttcagc 
caaagaagaa 
catctccaga 
taatcgataa 
cgctactcac 
cctttaatca 
ataacaacta 
aagctgt ttt 
ctgaagggga 
atattgtcca 
tccacaagat 
agaaagtaag 
gggattctct 
agaggccatt 
cttccactgg 
tttctaaggg 



aactggcagc 
catccagtga 
gtggtggtga 
cttcatcctc 
aaggcaagaa 
caattagtaa 
ataaaattag 
caa tcactga 
tgataatgag 
ccgccaaatc 
tgatggtgga 
ataatactgt 
atgaaaatag 
acaaagtctc 
aacaacaaca 
tatttctaga 
taacggctga 
ccaagataaa 
acctcccaga 
aaaacatatc 
acaatacgat 
caaaagaaac 
ctacaatgtc 
tacgctcgag 
atactatgga 
ccttaacaaa 
acggtgctct 
caaaatatac 
atacacaaga 
agcctacaag 
cagcatcaat 
atcaagcaac 
gtaatactac 
tcgccaacaa 
gaggagacag 
aacagacaag 
atgtggaacg 
ctactactac 
gcataaacac 
gcttcagaac 
aactttaagc 
cccaaattat 
atcattcaat 
gaataatacc 
agaagttgcc 
ttcggatagt 
acgtaatatc 
aatgctcagt 
tagttttggc 
tgaaactatt 
actcgaaagg 
cagatactgg 
ggacgcaatc 
gggaatcggg 
ggctgccgat 
ggagaaggcc 
gaggaacaag 
tcaagagcgt 
ctccaattct 
tgttgttaat 
atgtttcact 



taatgttact 
aaagaagagt 
cgattctgat 
ctcctcctca 
aactaaacgc 
actgattcta 
aaatttaata 
gatgcaatct 
actaaaagaa 
taataatgct 
ttctggagct 
agttaatcgc 
aggacttaaa 
tacctcgaat 
atgccctcaa 
aatgggaaaa 
cgcgttcaag 
aacggctata 
atctgtaggg 
ccaaagtgcc 
ggaccaacaa 
ttctatcttt 
aaatctggca 
ggaatttgat 
catggcagtt 
taatgcgaat 
ttcttacatg 
aatgaaagat 
tttatataaa 
tgttgtgcat 
caagctcatg 
atcaaaatct 
tcacctgtgt 
ccattttatg 
atcgtccctg 
aggaagagga 
tatgaagaga 
tacacccagt 
gataaaacaa 
ttaaagtgcc 
gtttgtggtt 
aacaaagtgt 
atcccaactg 
caagaagaaa 
cctagaggtg 
gaagatgatg 
aatagcggaa 
gctgtaaatg 
gttcagttta 
gccgtggaat 
accggagagc 
caaattcgaa 
gttgaaggaa 
cttcaattta 
attgaatttg 
gatgaatctt 
tcccagccta 
attgataacg 
gctgctgctg 
ccacgaggga 
gtgaggaata 



cgccataatc 
aaaaagaagg 
tcagaaacag 
tccgaagacg 
aaaacaaaaa 
aaaacaggag 
gacaaggatg 
acgccaatga 
caagtaaaat 
ttagatagtg 
gaattggcaa 
catgaagcat 
actgttgtag 
gaagaaatga 
catgaacaac 
tcaatagtga 
caaaactact 
tctatcggat 
aataatactg 
caagctaata 
aactcagcag 
cccggaaacg 
cgaggatttt 
gaaggaggag 
aattttgccg 
atgaaccctt 
tttgaaaaaa 
gtatcaaatc 
aatagagcag 
tcccagctag 
gaagcagaag 
acaaatgcta 
acgaccatcg 
agtgtattaa 
atgttgcagc 
ggaggagtat 
gattggatat 
aatgccggcc 
gacgacaaga 
aattttcata 
ttgggtcgtt 
cagtatggtc 
aaccaaccaa 
tgattctgga 
tcaagagacc 
aaggaggaga 
agtactggaa 
atattgataa 
aaaagagtgt 
atggaacaat 
ctcttaagaa 
ttgaagccgc 
atgacaccgt 
acaagagtgt 
gacaagttgc 
ctaattcttc 
caaattctta 
caatcaaagt 
ctgaagaaga 
ttaaggatat 
ttgttgcagc 



ttctgagtat 
agaaaaagaa. 
attcttcctc 
aagaagaaga 
agaaaccatc 
gttacttcca 
attttgcggg 
atcaaagatt 
atagtgttga 
ctaaattgac 
ggctagctgc 
tcattctaac 
agtcattctt 
tgtcggtgat 
aaccggattt 
actcattccc 
cgcctatggg 
ctaatatctc 
taactggtct 
atataatcaa 
ccatgtccat 
atccttcatc 
attccatcgc 
ttaaagcgta 
ctcagtctct 
ctaatgtatt 
atggcagtga 
gatacttgaa 
aaagagcact 
caaacgccat 
cagcggaatc 
ttaatataac 
ccgtgagtgc 
acactgcaaa 
aacaacaacc 
taggatcagg 
taaacatgat 
ggacattagg 
gtatgatgga 
taaaaatgta 
cacggtccaa 
aatggaaagt 
ccccagaacc 
aactactgtt 
ccttccttct 
gcaaccacaa 
aatcgaaaca 
cgtgtccaaa 
atctgaagaa 
cacaaatgtt 
gaagcgctca 
tgcagcagag 
gatcaaggca 
tagctctcaa 
gcacatgaag 
tggtgaatca 
ctacactttc 
tatcgaaatg 
caccactact 
tcactttttc 
aaatggtgaa 



gaaaatgctt 
gga taaaaag 
a tcatcatca 
aaaaggagaa 
aaaggacgac 
cgacacgagt 
cgtagcccaa 
agtatctagt 
tactgaaagt 
atctcaacaa 
cttctttttc 
atcaaaactt 
taaaaattta 
gccgttcgaa 
gaaaagagtg 
ttccaataag 
aagacgcata 
gcccaatata 
aaggctaacc 
aaatgccaac 
actactcttt 
tataaaatta 
agaaggatgt 
cactttacta 
ggaaaaatca 
agaaggagga 
ctgtgaaccc 
aaaattcaac 
cgtcgaacag 
gggggtagct 
tgaaatgaga 
aaatacaatt 
agccgccgac 
taacagtcat 
aacacattca 
aactgaacag 
atctccagaa 
atatggatct 
taaactttct 
gtattatgtg 
gatgtaagag 
atctaaaaat 
aacaacatgt 
gaaggtgttg 
tcctcatctt 
acgaaacccc 
atcgaaccag 
actattcctc 
cagatcaaaa 
aaatattcaa 
aacaatggaa 
aatgttacac 
atcctcctcc 
caagctaaaa 
tgtaatttgt 
ccaaaggtta 
actatgatcg 
tctccagtta 
acgaccacta 
gattcatcca 
gttccacaag 
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aagaattcgt 
ctactttcaa 
gcatctgccc 
agggaattgc 
cacatacatc 
tgctcttcca 
gtttggagcc 
gcagggcaac 
atgttttgtc 
ctatggaata 
agaagggagg 
caatcacttc 
cacacaagaa 
caacagcaaa 
caaagcttac 
aaatggtgca 
aagagaagat 
agatcaagac 
ctgctgataa 
atgttgtaag 
aaaataaggc 
agaagctgtt 
gaaaagaata 
taataaacta 
tgtaattgta 
tttacgatgg 
cagactgcaa 
aaaccttaaa 
aaatatccga 
acattcagtc 
aagtttttga 
cctttctggg 
atgtttctgg 
accctccgaa 
aaaaattttt 
atctttagta 
aaacagtcaa 
tggccctacc 
tctaaactcg 
attctctagt 
aacgcctgtt 
ggtgtaatct 
cccagcggtc 
agggaaaaca 
agagtgaagg 
tttctggaac 
ggtaccactt 
cgaacttgac 
tttttgaaaa 
cacataccct 
ttaatgtacg 
gcggtccacc 
tgagctgaaa 
aaacagaagg 
aaacacacaa 
ccttcacttg 
ataaaaattt 
aagttcggag 
gctccagaaa 
ctgatcagat 
ggtgtaaaac 



ttctgagttg 
gaagcttatt 
ctactacacc 
tacctacaga 
tactacccca 
gagtcctgga 
tatttcatcg 
acgctgcttc 
tattgttcat 
tggtgttgca 
agatactttc 
ggaggaactt 
acgaaagagc 
tactccaaag 
tttagattat 
gagaattttg 
ggaaattgta 
gaatgcaaag 
ggtgtgtcta 
ggaggataat 
aagcggagga 
gcacgaattg 
aaataaaagg 
tatagcaaat 
catagtaaca 
gtgaccgtcg 
atactcacct 
agtttggtgt 
gctgcaatct 
gacccagcgg 
gatggagaat 
tacgttcata 
agcaacagac 
cttgacattc 
tgaaaagttt 
gaggtggtgg 
aagttaatgc 
cagcggtcca 
agtgacccag 
gaatacagta 
ccagaaacag 
gacatcgagt 
caccctctaa 
acactaagtt 
aggctatcca 
agccatttct 
ctggcacatt 
atcaggccga 
gtttttgaga 
ttctggttac 
tttctggagc 
cttcgaactt 
aaatttttga 
ttacacctac 
aggttgatat 
tacacatttc 
ttctgggtcg 
ggtggaccgc 
tggtggggga 
aaaaattggg 
gtactgaaac 



tacactaacc 
catgaccgta 
acaggaggcg 
atatacgagg 
gcacaggcag 
actgatattc 
ggcgaaattg 
aaattttatt 
actgatggac 
aagagcaaga 
catagcgagg 
aatagggaat 
aactcttcta 
aagacgaaga 
gttgatagta 
gctcaggaga 
gagggagaag 
gcatacaatc 
aagcacacat 
gttaataaga 
gatgatgaag 
ttgaatgaat 
ttaatataat 
attaatgcta 
tgccttttct 
tgataacgcc 
aacaggcctt 
ttcatttttt 
gacattggct 
tccaccctct 
gagtaaaatt 
tccagaaacg 
gtttctggtg 
ggtcgaccca 
ttgaggagga 
tccttctagg 
acatttctgg 
ccctccgaac 
aaaaaatttt 
ggt tacacac 
tcaaaagtta 
cgacccagag 
actcgagtga 
gcttgtcaat 
gatatctgga 
ggaaagggtt 
cgtgcaatct 
cccagcggtc 
tggagaacga 
ggtcatgtcc 
aacagacgtt 
gacatcaggc 
aaagtttttg 
cctttctggg 
acgtttctgg 
acttagtata 
ctcgagttta 
tgggtcgact 
gtattcagag 
tgtatacatt 
attacatcac 



ttctaaaggt 
caatgaatcg 
tcccccctgc 
acaggactgg 
ccgaggcaac 
aaaagttcct 
tgtaccgttc 
cctcttcaga 
ttttcagtag 
gtaaaattat 
aggatattga 
gtaacaccaa 
ctactactac 
agagcgcttc 
catcatttgt 
gagtaaagac 
aagcacaaga 
ttgccaacaa 
tggaagattt 
cggtcgcttc 
aaactccaat 
aatgttgttg 
gttgtctttt 
ttgacacatg 
taggcatttg 
tacaccaggt 
ggtatagagg 
tacatcgggg 
ccctacccag 
aaactcgatc 
ctctagcgga 
cctgttccag 
taatctgaca 
gcggtccacc 
gagaggaggt 
agctggtgtc 
agctatccca 
ttgacattcg 
ataaaatttt 
taccctttcg 
atgtgcgttt 
gtccaccatc 
cccagaaaaa 
tcatacacta 
tcaaccagaa 
acattttatt 
gacattggcc 
caccctctaa 
gtaaaattct 
agaaacgtct 
tctggtgcaa 
cgacccagcg 
agatggagga 
cgcagctata 
agcaacagcg 
atttgtcatc 
gagggtggac 
cgatgtcaga 
tgtgtttctg 
ttctgatccg 
ccacagtcgt 



tgaagagaag 
acacattaag 
agctgataag 
agtgttccaa 
tggtgctatt 
cgatgctaag 
aaagtggagc 
cgagaagatg 
tgtacacttt 
ccccaaaact 
agtgcctgta 
gggaatgaac 
ttccaccact 
tgctgccagt 
attttacaat 
gctaaaggca 
aacttataga 
aacatctgga 
gggggatgta 
tactactact 
ggaatttgaa 
tgttgtattt 
acattttcct 
ttactagtcc 
aggcaaatag 
gacagtatag 
aatcaataaa 
acagaaatgg 
aggtccaccc 
gagctgaaaa 
aacagtaggt 
aaacagtaaa 
tcgagtcgac 
ctctaaactc 
aaattgatac 
tccagaaacg 
tttctggtgc 
gtcgacccag 
gagtcggaga 
gggcgcagca 
ctggagcaac 
cgaacttgac 
tttttataaa 
gtgcaggttg 
tccagaaacg 
atatttggta 
ctacccagcg 
actcgagtga 
ctagcgaaaa 
gttccagaaa 
tctgacattg 
gtccaccctc 
cgagtaaaat 
tccagaaacg 
tgacagtagt 
acatggtcga 
cgctcgctcg 
ttgcaccaga 
gaacaggtac 
agaaagtaaa 
cctcctccct 



gtggatcatc 
gcatggtact 
gtttccgcta 
ttcgatgggg 
cacaagtcta 
aaggcagaag 
cccaatgata 
aacattgctg 
agaaaggata 
atcaagatta 
aaattcactg 
agcctccgag 
tccacttcaa 
gacccatttg 
atcagtaagg 
gtcaagaacg 
ggaatcgtaa 
gtacttttcc 
cttgattttg 
acatcatctg 
actgacggag 
tgtatataga 
aaacttgata 
ctagtcaacg 
gaaaagagta 
agggtttcag 
aaatcaacaa 
ggttaaaaac 
tccgaacttg 
attttttgaa 
tacacactac 
gagttaatgt 
ccagaggtcc 
gatcgagctg 
agatagaggt 
cctgttccag 
aatctgacat 
cggtccaccc 
tagaagggaa 
agtctccaga 
aaccatttct 
attcggtcga 
attttcgtca 
gttgagatag 
tttctatcca 
tattatttct 
gtccaccctt 
gctgaaaaaa 
cagaaggtta 
cagtcaaaag 
gcccgaccca 
taaactcgag 
tctctagcga 
tctgttccag 
ttctactaag 
caaaaatttt 
gtctgatgtc 
agttgttgtt 
acatacccaa 
ccaactgaac 
tctaagagag 
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aacatatccc 
attatggacg 
gaggagaatg 
aaaaagatcc 
tcatcatcat 
tacgtggata 
caaactttag 
gactcatcat 
ccagataatg 
gaaaaggaaa 
ataaacacat 
cttttgaaga 
aagcaagtaa 
aggttggtgg 
acggatacta 
atcaagaagc 
agttctaatg 
ggaggttcta 
ttaaacaagg 
gtctgtgtgc 
atggttacta 
aaattcatat 
actggcccca 
attatagcca 
gcgacaaatt 
cataaaatgg 
cataaaatgt 
ctcagtgaaa 
ttgcggaaca 
gtaaaattgg 
ggatgtagca 
ggagccgatg 
agaaaaaaca 
gctttcattg 
acgtcatcaa 
acttcttcta 
ataccacttt 
gaccctgaac 
aaccggcaac 
tcattcgggc 
ataatcaact 
cttcctcatc 
tgtctactgc 
atctgttgcc 
ttgtgccgtt 
accaggctga 
agtttaacct 
tcattgcgga 
catcacgcaa 
cagtaacaat 
gtgaagtggg 
gaatggcact 
gagtagaaaa 
gggtagagat 
atttgaagga 
cctattttgt 
ctgaacataa 
catcatctgg 
ttgataacga 
ctgctggagt 
attgtagtgg 



gtacccaact 
atctctacat 
aggtaggagt 
tccgtaaacg 
catcagacga 
ttgcagatac 
agaacgacct 
cgtcttctgg 
tccatgttct 
aggaggaaga 
ctgtagatga 
aaaatagttt 
acgaggccat 
aggactcttt 
gtaattcatc 
gtaacattgc 
aggagaataa 
atgcatccaa 
ttatattttt 
agcgagcaat 
aaatcttgaa 
ccgttggaag 
aacataacaa 
tgttggttaa 
tagtacaatt 
cgtctgtttt 
tattaagggc 
gaaaacctag 
aattgaggtc 
tgtcacagca 
tgattagcgg 
ttaaggacgt 
ttagaagaaa 
caatgggttt 
ataaaaacaa 
cttctggtcc 
ctccacgatc 
tcttttctga 
tcttctccct 
tcattcaact 
tttcaagatg 
atcatcagct 
tgatttcaag 
agattctccc 
tctgggttca 
aaattcttct 
attgaacaag 
tactcctccc 
cccatgggtc 
tgaagtctcc 
ggagaatcca 
taatgtcgtg 
cagggccaat 
gctcctcaag 
acattatgag 
tgttgaattt 
tggatcatca 
gccaagtgga 
aatggggaat 
ttctccatca 
tgatgacgtg 



acctcaaaca 
gagctgttct 
agaaggagga 
aaagagatct 
ttctgactct 
aaggaaaccg 
ctacatgtct 
tgaagaagaa 
tggttgtaag 
gtatgaagaa 
ttgtgttata 
ccagttctct 
gaattcagca 
gggtgacaca 
ttcatcatcg 
aatgggaaga 
tttccgccct 
caagaaactc 
tagagaaata 
gaataatgac 
tattattggt 
agatgcacta 
gagactccgt 
tgatgttccc 
tgcttctgct 
caatagtagt 
taatttaatc 
aacattaact 
tagtggtttg 
attaatgtat 
aaactttgag 
gaagactgtt 
cgtccctttc 
gcttttcgtg 
gacagaggaa 
aacatttaag 
ttctggtcca 
agccagggtc 
cctttcgagt 
gtgctgacaa 
tttggaagtt 
gccgcatcat 
aaagttgctg 
aactttaaga 
tcgaaggcag 
aaaggaggaa 
gtagaggctg 
tctaaagata 
aaattgactc 
aatgatagga 
ttcctaatga 
tggttcttca 
tccaccta tg 
aactttattt 
catgtttata 
gaaagagtgg 
tacatggaat 
ggcggaaaga 
aacaatagta 
ctttctccat 
tgggggaaga 



acaacaacaa 
tcttcttcat 
ggaggcagaa 
tctgttaaaa 
gatagagaag 
cccaaagtaa 
agttcatctt 
agtgacgatg 
aaggaaaaat 
gaattcaaga 
cctgatcgga 
cagccagttt 
ttctcttcca 
tctaaaattt 
acgttcgtca 
gtagcagaat 
gtagtttcct 
aatagtaaca 
ca cagtgtta 
aacacaaatt 
aaaattcctt 
tatttgtacc 
atccctcaac 
attacctcag 
atggtggatc 
tattccgtat 
ctatctattt 
cagagcgtgt 
accagtgaag 
gagggtgtga 
gatgaagacg 
ggactatctg 
tactgaaaaa 
gtgttgatgg 
agaaagggta 
agagaagcgt 
gaactgaatc 
agtcccgacc 
ataagtagag 
caactaagaa 
ctgctaataa 
ctgacgatca 
ccattcttgc 
atgtgattaa 
ctgaaagtgg 
gtgatggaaa 
aagaaatggc 
atccattgag 
agaagaatct 
gtatccgtag 
ccatcagtgt 
acagattctt 
tagcaactag 
ctggagaaaa 
acaagatcag 
acaacacgat 
acaggtgctt 
gtggtgtcct 
gtgcagctgc 
ttagcagtga 
agatgatatt 



caacaacaac 
cgtgctcaag 
tcggaccaac 
gtacatcatc 
aaaaagaagg 
gaaaactgga 
cttcttcctc 
atgatgatga 
ctccccaaga 
gaatggcatt 
ttttaaccct 
ctttcctccg 
tgttatccag 
cctcctttat 
ataattgtac 
tgctgtcaaa 
taatgcgagg 
ggcaaactat 
tcgcgctata 
ctagcggata 
ataatgaaat 
agaatgtgat 
aacaagctga 
atttactttt 
ctgcatatcg 
ataaagtcct 
tatcagctag 
atttgttcct 
agagttctct 
ctcgtcaaac 
gtgtaacact 
cattgctatc 
atatggaaac 
tgtacatctt 
gtaaattatc 
ctcctgaatt 
taactgacgc 
gttaacggaa 
gcgttgcacc 
tatacattga 
ctttaatggt 
gcagttaggt 
caacagaaca 
caaccctaac 
tagtgcaaac 
gaaatcatcg 
ctttaaacgt 
ggatgaccct 
ggaatatctt 
tggaagatat 
tgacattaga 
ccgcatggtt 
cgatgctatt 
tgtgccccag 
taaatgtgga 
tggatttgtc 
tgacacaatc 
gtcttctgga 
agcttctgcc 
tggagatgat 
caacacatca 



catgacatcc 
ttcatctgaa 
agaggccaag 
ttcttcttcc 
aagaaaacta 
tactccttca 
ctcatcttcc 
cgattatgac 
catagaagct 
accttcacgg 
cttttctaca 
gttggtgatg 
ttctggtatg 
aacacctcaa 
agatgaagat 
tattgcagcc 
accaacttgt 
tccccaagta 
tttatcgtcc 
cgctgaaggg 
gagtagagaa 
cacggatatg 
tttttgttac 
aactggaaag 
cctggctgtc 
agatcttgac 
aaataagtgt 
aaatcatctt 
aggaacagcc 
catcgaggac 
gaaatgtttg 
tgataggctt 
tactacgctc 
tatacagtac 
atcatcatca 
aagagaagac 
tgaatttata 
aaaaggtgca 
gagccaagca 
aataacccaa 
gacaagaaat 
ccccttggac 
gagtccctat 
cagatatcta 
aagaatgaaa 
cagcagaaca 
gtggctgaac 
gatgctattc 
ttctgggaag 
cttcaagcca 
atcctgcaaa 
tctggacttg 
gcccagatct 
gcactgaagt 
cgtcagccat 
aattctgata 
aggaagaatg 
acattcttca 
cctgctgttt 
gatgacgacg 
ggagacggat 
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caggagaatc 
gagaaaatac 
gaaatgaaga 
ctgttgaaaa 
attatacgtc 
acagagtaat 
tgagtgacat 
agatggttga 
agattcgtct 
gctttgacaa 
aggaaccttt 
atgacaagta 
agattgacac 
tacttagaaa 
tggtgcatag 
atgaacgtaa 
ttgaggatat 
gtaaaggact 
atgagatcac 
atttccacgt 
atatctccat 
acgtggctgg 
agtactttaa 
taacctctgc 
attgtaagct 
gcatctctta 
actcttcttc 
agttcgataa 
taaaaatgtt 
gtgatagtag 
cagcaaagaa 
ctgccgttaa 
cagcagaaga 
acgacgatac 
attctaaagc 
aaacacaata 
atggttatat 
acagatgtct 
gctatattaa 
atttttatat 
tcaatgttaa 
cgtatagcag 
taagctgggg 
tcctccaatt 
gaatctcttg 
atatggcgcg 
ggaatcttgt 
tttgaagcaa 
gtctttatac 
agagaagagt 
gtattgttat 
cttccaatgg 
gttccaagag 
ataaaaagtc 
atagtaatcc 
ggatctagtg 
ttaataattt 
ataacattcc 
cgggcaggca 
tgttggatcg 
gctgcaatag 



ttctgggcag 
tgcttctctt 
taaacaacta 
ccatgtaatg 
aaacaaggaa 
ggttgaactg 
tgaaactgta 
acatggatgt 
tagtgccaag 
gtctagagag 
tccccgcaac 
cagggcaagg 
ctttacaaca 
gtgcttcctt 
aggagacagc 
cggaattgaa 
tacaaaaatt 
ttctttgagg 
aaaccaactc 
ctttacttct 
ttttgcagaa 
agaggacaag 
gctctttaat 
cattaacagg 
gtctgataac 
tgcagactat 
agaaaagagc 
gagtgaagac 
ccaaatggat 
tagcagtgag 
gaggaaagtg 
gaaggtgaag 
agaaaagaaa 
agaagaggac 
tattttctaa 
taaagttaaa 
ttcttgtagc 
gtgaagcagc 
cagcataata 
aaaaatcaac 
caatcttccc 
caatacggct 
atttttcatc 
gagttgtagt 
ttgcagtttc 
gattaactgg 
tgaggttgat 
taaagtatga 
ggtcctttct 
ctctaatttt 
tattagcctg 
tatttaattt 
cacctccacc 
tccccgaaga 
cattttgttc 
caacatataa 
tcatctcttc 
taaatttgtt 
cattaactgg 
ctcttcttgt 
caatgatgtt 



aacggtggtg 
tctcagaagg 
ctcaagaaca 
tgtacagatg 
ctatacaagg 
tttgtcaacc 
caaaatcttc 
acagatatgc 
agagcttgtg 
gcaaatctca 
cacaattctc 
atgaacaagc 
acagacgatt 
tctgcctctc 
ttcaatatta 
gaagttactt 
aaaatgaaga 
gaattctcta 
tgcaacaact 
gtgtgtgtct 
ttgggtgaag 
acatatgatg 
gcactcgtta 
agagggtaca 
gctatccagt 
tatacgtctc 
aatgcagatg 
agcaagaaaa 
agggttttgg 
aatgaagagg 
gaagatgttg 
caggaagaag 
gaggaacagg 
gagaaagcag 
atatcatgtg 
cattgttttt 
ttgctcggca 
catgataccc 
aatcaggttc 
gtctactatt 
ctgaacaccc 
agcttcatta 
agttacttga 
cttgtccgct 
gatgatggaa 
aacaaacaga 
attgtccttg 
tgggattcta 
tccatcatcc 
ttgcatttgt 
cctctgttga 
gtccaatatt 
gccgatggcc 
agcatagcgt 
agctgttatc 
tccatcgacc 
gttgctgaca 
ctgaagaaat 
aacaaccgct 
tccaccacct 
ttgtacacct 



gtgcatcaac 
aaaatgttcg 
ttatcaactt 
aaaatatctt 
ctattgtttg 
ttattcttcc 
cttcaaacaa 
gctatgacat 
agtgcagaaa 
cccctagcca 
acaggagtaa 
tcaagaagga 
ttctcctgca 
ttcatcacat 
actttgcaaa 
cctcacaaac 
gaggggatga 
agaaggttag 
gcaacgttaa 
acatccacaa 
aattgaccaa 
atattatccg 
aattctgtca 
tgtgcatggt 
atcacgaatc 
gcaataacaa 
tagccaagac 
ataagaacaa 
atggcatgga 
aggaggaaga 
atagcaataa 
atgtggagat 
aggcgaagga 
tagcatctga 
tataaaatgt 
tatttgacac 
gcaataagag 
ctcgatatat 
ttccaatccc 
gacctggcaa 
attacaattt 
tttaaccatt 
gatacaatat 
tgctcttgga 
atacagttga 
ttctgtgcag 
gtgtgtttcc 
ctgtatttct 
ctaatgtaa t 
tcattgtgaa 
cgctgctgct 
gaacgtaatt 
gctgcagctt 
aaagatccat 
ctttccagta 
ataatatgag 
agcatcatca 
agttcgatct 
tgtgtttctt 
actatattat 
ttaccagatt 



ttacaagaga 
tctcatggcg 
tctgaatagt 
cgatgaggat 
ctccaaccca 
ccgtcttagg 
tggatcggtg 
ccctccatat 
actgtgcaag 
gaaggcagga 
cgctcacgac 
ttcaaagaag 
agataggaac 
tttctgtcct 
caacaaactc 
ggtcaacgcc 
tattatagat 
taagattgtg 
ctcttctaat 
cattattcct 
gcttgttaag 
caattacgaa 
caggaattat 
gagcaacctt 
actatgctct 
ttctgaagat 
tatggcctct 
aacttcaaat 
tgatgatgat 
ggaggaaatt 
gaagacactg 
ggaggaagtg 
ggaagacgct 
tgaagatgaa 
agtattttaa 
atgtacatca 
caagagaaga 
tattaataag 
tagaagccca 
cagccactat 
ttgatttttc 
ccaattttac 
tattccattg 
aactttgcat 
gccattgcat 
gagaaaagtt 
atgggatagg 
ttagtgtttc 
tcaaaagttt 
cctttatatc 
gttgttgttg 
ctacaggcac 
cttcactagc 
acccttcatt 
tttcttcaat 
acacgtgctc 
ttctagatat 
ctgtgaacgt 
gttgttgctg 
tactgcctct 
ctgcaattac 



tttaggtgtg 
atgccaaagg 
gcacttaact 
caagccgagc 
gccaatgtgt 
aacccaattg 
agaacaaaga 
gcaaagggaa 
gacgtgaggt 
agggaggtgg 
tttactttct 
aaggtaaaga 
gcttttgatc 
gatgtgctta 
gagtgctata 
aaggaagcac 
gttgtgaaga 
agaaggttta 
ggagatgtgg 
gtgctcgaag 
gagtgtagag 
attactgtaa 
aatgtggcag 
gtcgggtatt 
ttgcactcta 
ggaggaggaa 
ttctatgacc 
gagatcctta 
gatgaagata 
gtaaagaaac 
ccaaaggaac 
aaggaagcag 
actgagtatg 
gatgatgaag 
aggtataaat 
tcaatttcac 
gtcatataaa 
gggagttgct 
cacattgggc 
acctagctca 
tgaattctct 
acttgaaaca 
agataaaagt 
gacgatagcc 
gtcagtaagg 
caagtcttct 
tttctgtgcc 
tgaaatgttt 
agatgtgaat 
ctcgggaaca 
ttgagtagta 
ccttctacca 
ttcatcattt 
tccagtatca 
ggcctttcta 
gcacacttct 
aatagtagca 
aggctcgaaa 
ctgttgttgt 
gagttggtcg 
ttccattttt 



163020 
163080 
163140 
163200 
163260 
163320 
163380 
163440 
163500 
163560 
163620 
163680 
163740 
163800 
163860 
163920 
163980 
164040 
164100 
164160 
164220 
164280 
164340 
164400 
164460 
164520 
164580 
164640 
164700 
164760 
164820 
164880 
164940 
165000 
165060 
165120 
165180 
165240 
165300 
165360 
165420 
165480 
165540 
165600 
165660 
165720 
165780 
165840 
165900 
165960 
166020 
166080 
166140 
166200 
166260 
166320 
166380 
166440 
166500 
166560 
166620 
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gagtcttcgc ctgaaattgg tacagaagga atcttgggtg acaacctttc tagagttgtg 166680 
ggtaatccag tatcatactg aagaactttc caaataaatt cgaatccagt tcctgccact 166740 
gtgcccactt tactggggaa aatatcgatc ctccctgtag atagaggact aacacattcc 166800 
ttatagatcc tcattgggct taaatgttgt gtcgatttag tttttgaggc ttcataaagc 166860 
acagcagacg agagaagggg ttcaaagaat acagtttcca tgcccaattc ctttaccgaa 166920 
tatttgactt gtttagatgc atgaacagga agatttgtgg gctgagaagt cgtaccaggg 166980 
aggaatgcgc taacatgttc aattttccca tcaaccgaaa aacttgtccg tacaccatca 167040 
acgaatgaag aagaagaaga agaaagggat ccagtgtcgt cttgatggtt attaatatca 167100 
ccaatttcca tatcctttat accatcatcg cgttcagtat tctttacaac tttaaagtac 167160 
gacaccaatt cacctttatt tttcaatcca aggaatccat cttctgcgat attctgacca 167220 
tatgtcacat ttcttgttag agatttaggt ggtacataat ttaagaacga ttttgaaatg 167280 
gtttcagaag aaaatggagc agtggcagtt gtttgctgct gttgttgttt ttgttcctta 167340 
gaggatatta gaggtacatt aatgaaaagt tccagatcct tattcatctt taaaaaccgt 1674 00 
cctgttggta caggaagatt aactacccta ataggggttt ctgatgaata tctggaggag 1674 60 
gatgatggag aagaagcgac tgaagtggca gactttaagt tttcggctac attatctcca 167520 
ttcaaaatac tctgaggtgt atcggtcagt gtatagtatc tgagagacgt atcattttct 167580 
acaataaagg gaggatctct agggaaagca aggcacacgc ctccattatc ggctgccgaa 167640 
tgaataactg cataatgtgt aacaatcttt gcaattagat ccatcacggg cttcttgaat 167700 
aataaaaatt tgtgttgatc aggatcggct gtatatttta gtagcacttc acgtactata 167760 
taatctgtta gttgtctagc ctcctttaca gcccatacgc ccgatttcaa aacgcaatca 167820 
aaaaatggaa gaagaacaac ttgaagtacc tgctggtgca tgtacatcca agcggtctca 167880 
tccactttca ctggctgtcc tctcataata tttcttgtca ttgtttctaa acacgaccga 167940 
taaagaccca ccacgcagtt aatgggatcg taccatctgt actgccgtcc ataacttctt 168000 
cttccggcca gtataggctc ccctgggaat agtatggaat cgcagttaga ttctcttcca 168060 
atattgtata tggatgcaat ctctcgagca atatctccta ccgtaatagc agatccgccc 168120 
gagaaaagtc ccacaacagc atcagagaca tttgagaatt ccctcccttc tagatttaca 168180 
ttattataat atatcagaga tgctattata gcagcagaag attccccaga atcgtctctc 168240 
agaattaatt ttaacgtagg aatatcaaca tctggtataa aggctgtagt attattttga 168300 
ccgaatgtaa gaaaccttcc agaatcgtgt atttccgaca cgttactttt tctgtcgaca 168360 
acgagtgttt tattatgttc agcgtcgtta agagaggtta aaaaaaggtg catttctatc 168420 
cagtctgttt tctccgatat acagtccatg attgaagtgt attgaggccc aaagaagcaa 168480 
gtaactgggt ttttcatgga aatggaagta ggtgcccctg aagggacaat gttggtatcg 168540 
gtagaagttt tttgcctctt tgtacgtttt ctattttcat cgtattcctt agaaagggtg 168600 
gaagatacat ctcctaaaga ttttccttta gacgataaaa tcgattgtgc gattttttcc 168660 
ttttgttcgg tcgtcaatga ggcggttgag gaggaggagg aggctgctgt tgctgttgct 168720 
gttggttcat aaagtttggg ttgaggttta tttttaaatg atagttgata ataccgtgta 168780 
accaaatcct ttaaagaatt ggtatttttt ccaaaaaagc actgggccgt aaagtagtgc 168840 
ataccaaatt cagtcgagtt aaattttgac aactctggta tgataatgga agaaactgac 168 900 
ttggagtttg aagccatgtc atttaaatca cgtagccttg taagagaggg atgttgatag 168960 
agaattttct ttgcatttgg gttcttactg gtgtcaggaa atgctatctg gaggggggtg 169020 
tttatccaca ccttcacatc cattagacga aatatcacag ggtgttgtaa taatttccct 169080 
gtgaatctag aggtcgtaag aacaatatca agtatgccgt gatgggcata aaacacacca 169140 
taggcgtctt tcaatactat tgaggaagaa agtctttcca tttcccattc ttcttgattc 169200 
agatgggata tattgacaag ttcttccatg ccgtcgtttc ctttaagatc tagccgtatt 169260 
tctttctcat catgcattgt ccagtcctta atattaatga caacaggcaa aagccctgcc 169320 
aatctctcta aagacgaagc agtttcttcc atgttcagct tgatgttgtt aatttcgcct 169380 
tcgataaacc tctcattttt acgagaataa catcctcctc ctcctcgagg aaatttttgt 169440 
tctgtcagag atgaaaatcg tcgtgcttct cgtgctcttt tttccactaa tgcatttaga 169500 
gattttgtag atatttcttc aatagtcatc tctttattcg tctctataat aaacacagac 169560 
gggtgtattt ttacccttat ttttgcattt tctgtttcac ttcctaaacg cccctcggtg 169620 
aggtttgcaa gggtagaatc ataacacgca aaagttcccg cattaacaag cggtagagaa 169680 
atttcgctat cgccatcttg ttgttgttgt tgttgctgta gcccttcagc caataggcaa 169740 
gctaaacggc tattgtcttc gttgttctta ccaggcaggg gccgttctat tgtttgcgtt 169800 
tgcgttgttg acataatttc ctcctcgacg tttctcgtca gcctgtaatt cgagtagagt 169860 
ttgtttcctt atacgggtgt gaaccaatag tcggtaaaac accaatccac cggttacgat 169920 
gatggatatc aacaatatat agaatagggc cttgtcacta gccttgcagc attgtctctt 169980 
tctgcagctg cagtcaaatc tttgtagtct gggcctaatt cgctctgtac gtggttccgt 170040 
attctctcag acacgtccat gttgttgggt ggataaagta taaatactca ggtgtacata 170100 
aaaacctctc acatgatggc aggaatcggt agaagagata atagacccgt attacatctg 170160 
gacattgatc caaataagga aataccctat aacgttccac caacccctat tatctgtgaa 170220 
aaaaacccct tcgtgtttaa catgcagaag tgctcagact gtgctccttt ccctccctac 170280 
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cccggcactg 
caaaaggaaa 
aagctgagag 
ttcccttgcg 
ccaccccgta 
gagcttaacc 
ttggatgttg 
cgagctggcc 
aaccaacttt 
caataggaac 
ccgtgatgat 
aaccaacacc 
aaatgagaag 
attgtgacca 
tttcagctgg 
ccaagaaaca 
atgtggctaa 
ctggaggaga 
agggagaaaa 
aaaatcttgt 
ataaaagaat 
atgatacttt 
tttccataca 
acttgggtta 
ttcttgtgtt 
ggtacttctg 
ttcttgatat 
acatatgaca 
ggaagactat 
tcttcctctt 
gtaggcggac 
acattggcca 
tctggagtgt 
ttcttcttat 
atgcacttca 
gccaatccac 
tcgagagatg 
tccttttata 
tctggtcttg 
cgagattctt 
ttatgcaagg 
tcaccttgtt 
gcacggtcct 
tttacagaat 
cgacactgga 
ttcttgtttc 
atactttctc 
cattaattgt 
gtggaccaga 
atgttatcct 
tagacatcga 
tgggtaagaa 
gtgacgaaga 
aaaaaggtgc 
cggtagaaaa 
gaatggtgct 
aatctttctt 
atccttcgct 
agaaaaacat 
tccacttgaa 
ataacccaga 



agaagccttt 
ttgaggagct 
atatccccag 
ttgacactac 
gtgacgttag 
atatcgttgg 
atgacgttgt 
tgtgcaagtt 
ttatataagc 
catggcctca 
ggaaagagat 
tgttagcgcc 
acttgaagat 
tagcgatgac 
cctgtgctct 
ggtgttcctt 
aactgaagaa 
tggagatagt 
tggatctctt 
agatggtgga 
acataattta 
taggtgaagg 
catgattctc 
tacgtggagc 
tcatctcctt 
gcacgtattc 
tgaagatggg 
cgtaatcacc 
tgtcagaaga 
tatttgtact 
tagcataggt 
gatcgtaagt 
tgaaccggac 
tcttattctt 
cgagcatgtt 
tcaggaaagt 
tttcttcctc 
ccgtcacttg 
ggattggggt 
cccttgatgc 
acaccagaat 
cgactgttat 
cttcaggtaa 
gacagtaatg 
gaaaggttga 
tggagtttga 
ccattccttc 
gtttcaggga 
cgtcgagatg 
aggaaatgtc 
tttcaaggac 
aacagcaaag 
catatacgcc 
caccgaggaa 
ggctatgcta 
ttctaagcga 
cagtgttttg 
tacaaaattg 
cagtgtggct 
ctttattcga 
aacgttcccc 



ccctccatac 
tctggttgac 
aacctaccct 
cggtcacagc 
gcacggttac 
taagctcact 
gattaatctg 
ctcaaaaaac 
ctaatattat 
tcatcatcat 
gaagaaaata 
gcctgggtgc 
ttttcttcag 
gatgacgatg 
tttgtgaggg 
cttacaagcg 
ggcgcagcaa 
ggtattgctg 
ttggaatcta 
gaagaaacca 
tataatattt 
aataaaggta 
ccacatgtta 
attgttgttg 
caattcctgg 
gtagacatgt 
agaaggtggg 
gccattgctt 
aaatgacgac 
tgttggttgt 
tggaccagaa 
ggtagatttg 
gtggctgatg 
ctcccacggt 
catggccagt 
gacgaggata 
gactaccaga 
gttagaaaaa 
tctgatactc 
agtttctatt 
attgaactcg 
acatactcct 
gaaaactgca 
tttatctata 
aggaaaatct 
ctttttagga 
agcgggattt 
ggctataaaa 
acgagacatg 
gactacactt 
aatatcacag 
aaaataaaga 
accaaggaat 
aacgccatcg 
tcagaaagtg 
gacacgtccc 
atatttgaaa 
tgtgtgatta 
aacgtgtcta 
ctcgggaaat 
ggtatgtttt 



cctggtactg 
caatctttcc 
ctcgaatttc 
gatatcccct 
cactacttaa 
gaaaagcaag 
tcaagcactt 
tagacggtaa 
aaatcaacaa 
cccctgttgc 
cactgtccct 
ctgttgatga 
atgaagaaga 
acgaggagga 
gtttctttgg 
cagccgttgc 
ccatggaaga 
ctgatgttgt 
ttgcaacaac 
cagaattgta 
tattgtttcc 
ggagaaggtg 
ttattgttag 
ttgttactac 
atacacttct 
tcttccacat 
tttgaaggaa 
ctaggtggag 
gagcagctcg 
tgttgaggtg 
aatgttacca 
agcatgggtt 
aaactggatt 
gagcaagaaa 
gcgagcccaa 
atgttgtcgt 
cgagatatga 
acgatagtgt 
tgcccagcct 
ccatccaaga 
tgaacgcttg 
tcaatagaat 
aatattgtat 
atgtcaccag 
ttccgcatat 
ttatttgttg 
tccatcttac 
caaaactctt 
gtgtgcttgt 
tctgtactac 
accaaaacat 
aggaagatgc 
tcgaacagac 
cggccgcagc 
aaggaaaatc 
agaaacagtt 
ctggatcagt 
aagccacgac 
tagtgaatac 
tcttcgaaag 
tcaagctgcg 



cagtagaaga 
ctcccccatt 
ccgagaagaa 
tcatcgatct 
tcaaccccaa 
aaaacctgaa 
tgaaggaact 
aggtatgacc 
ctaacaaacg 
cctctcctct 
caggaacagg 
agaggatgaa 
cgatgataat 
tccttcatgc 
cttcctcagg 
tgctattttt 
aaattcaaca 
ctctctcgct 
actcatcaag 
aaatgttttt 
tcttaaatga 
tggaagacac 
gtgggaattt 
tgctcctggt 
ggatagatgg 
tatcatcctc 
gcttcagatg 
atggcggcgg 
acattggcat 
tattggcaac 
gttttgtgac 
cagagatgtc 
cggtgttgct 
tgcaagtagt 
tagtcatgga 
tggtgataag 
tgctctgggg 
agttcacagt 
ccagatatag 
tatcttcacc 
acttgttgat 
cgtatagccc 
taggcacaac 
aaaagtgaaa 
tcttgagttc 
ccagacttac 
ttttgtagtg 
acgcatctgt 
tccaaaaggc 
tgacaataat 
tcagttattg 
acctgaaaca 
aataaaaggt 
tgccgctgcc 
aatggtcatc 
cactgcattg 
gatagttgtc 
tgatattgct 
agtgtccact 
aaattgcatc 
agtgcccgca 



ggaggagaag 
ccctggaaat 
ggagaaggat 
ggagaaaacc 
caaggttggg 
caaattggtg 
tgagaagctg 
aaaccggtat 
cgaaaactaa 
gtcgca teat 
aatgtgaaca 
gatagggagg 
aaatcatgtc 
tttaagggtt 
aagtcactta 
aagactagag 
gatgtgatta 
agtgagggag 
actacaattg 
tggtatacaa 
caatttcctt 
aattgtagtg 
ettaaaegta 
tagggtgttt 
agattgttga 
tccctcctca 
tcgtccgttt 
tgt tggcaga 
atagacactt 
tggcctgggt 
ttgaggggta 
caggtcct ta 
cccgttcctg 
agttctgtcg 
gcagccgact 
attgtccatc 
ccctgcttcg 
atttgtccct 
gtggatagac 
aaataagtcc 
gatgeataat 
ggggttattg 
atctggcacg 
atctatcatc 
ccgcaataac 
agcatcgaaa 
gttttcagtt 
tcattcagta 
cgttctaggc 
tgtgtcagtc 
aacaagaaat 
aaggaaaata 
ctacagacaa 
actgetgetg 
aacagagcta 
aagaacaggg 
gggcttcaag 
gatattctac 
tttaatagat 
tcttacagtt 
aagcctctct 



70340 
70400 
70460 
70520 
70580 
70640 
70700 
70760 
70820 
70880 
70940 
71000 
71060 
71120 
71180 
71240 
71300 
71360 
71420 
71480 
71540 
71600 
71660 
71720 
71780 
71840 
71900 
71960 
72020 
72080 
72140 
72200 
72260 
72320 
72380 
72440 
72500 
72560 
72620 
72680 
72740 
72800 
72860 
72920 
72980 
73040 
73100 
73160 
73220 
73280 
73340 
73400 
73460 
73520 
73580 
73640 
73700 
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tgcctggaga 
atcccaattt 
ggaaaattac 
aattactatt 
aagcacaaag 
acatgttgct 
tggtggacca 
tggacgctgc 
aaaaaataag 
gacgtgctat 
atgaccaata 
tttctgaaaa 
tagatgaaaa 
tagtggagga 
aggaaaaaat 
acaaaatgag 
atgcaaaact 
cgtaccgatt 
ttggtaatga 
atgctctaac 
gaaatgcata 
ataaaggtaa 
tgtgcaacgc 
gtacaagttg 
gagacaagaa 
tggtagattc 
agaacaatag 
caacaggtgc 
caagcaggag 
ctactcaaaa 
ccactgaaac 
ttctctgtca 
aagaatgcct 
tgtgagacgg 
gaggttacaa 
gcgtcttctc 
aatccaaaat 
ggagatttgt 
aaactggata 
agagagcacc 
gaaaaggcaa 
aaagaagaag 
gttttaacgg 
atatatcctt 
gaaactacat 
tgcaatgaaa 
gagtacgtgg 
cacaaaacat 
tttctaactg 
ccaaacttga 
gaacttgtaa 
agagatataa 
gtgaatctcc 
aaaatgtctg 
gatgtagctg 
gctctttctg 
atcatcactg 
aaaatgtctg 
gatgaaatta 
tatagcccgc 
agcattgttt 



gactataggg 
taaaatgtct 
tgtgctcgga 
tggtttattc 
agtcagagaa 
ccactcctac 
acaacatatt 
catgtctgca 
atcacaacaa 
agacacactt 
ttacggtaaa 
tacaaacagt 
cgaatacaag 
atgtgaatca 
accaacgcct 
tattaacaat 
acaggaaaaa 
tttgaataat 
tgtatatagt 
atacaacgaa 
ctttagtact 
caatagaaaa 
gtgtggtgtc 
cgatcaaaat 
gattaaacgt 
tttatcctcc 
atgcacccct 
tccaaaggtg 
agggatggaa 
taaaaccaat 
tgaagcacag 
acttgtggaa 
tccatgactg 
acagagattg 
aattctcttg 
tagaaagagc 
tggaggaaga 
ttgtgaataa 
atgtggacac 
tactgaggaa 
agaaatggat 
aagatgcgaa 
gcacacatga 
tgtcaagaaa 
ctcctcaggt 
gtatgaatgc 
ctacttgtgc 
acaggtatgt 
atcctgtact 
gggcaattta 
agacggcatt 
atcaagctga 
taaatcgcat 
catctttaat 
actcatttga 
ttgctctcta 
gagaaagaac 
ttgctcgcgt 
gaaccatcat 
tcgaggaaga 
gtggatatta 



gaatactaca 
gactggttga 
gaaggagaga 
cattacttta 
aaatacggca 
ccgtacgtca 
tctgaagtgg 
aatttagtgc 
ctttttggac 
tctttcgatc 
gagagatgtg 
atgatgaata 
gataaacttg 
aagggtttta 
tatgttttat 
aatagtaact 
caccgtgtaa 
tacatctcaa 
ttattacatt 
agggcccttt 
ctatttgaaa 
cgtaaatttt 
aatttgaaca 
agtaccagtt 
tttaaagcag 
tcttcatctc 
agtgatttta 
ggcttagtgt 
gatcgtccca 
ttgaataaac 
atattcaaca 
aggaggaaca 
ggtggtatca 
tggcgctgct 
taaccaacag 
agccaatgat 
actccctgaa 
gcgagccgcg 
tcttgcccag 
gttggactct 
attgga tata 
aaaagaggat 
ccctatgccc 
cgaattgaga 
tttgagtagg 
agggagggca 
atgtctttac 
tatagactac 
tcattttaaa 
tagtccagat 
aacttt tagg 
aaatctacca 
tcaagaccag 
attggacgaa 
gaaaatcaag 
tggagcacct 
atctttaaac 
tggacaagaa 
ggaaga tatt 
aaataaggtg 
tacaatggac 



caaaggttgc 
ggataaaaac 
gtgggtgcgg 
tggacaacaa 
tcccgcatct 
aaccatcggc 
ataggacata 
cttcaaaaga 
atttgtgtaa 
ctataaacca 
acccgttttc 
gtcgcatttc 
atcatattgt 
ttgcctcccc 
tagcgagagc 
atttgtcggg 
ctttagccag 
cagacattgc 
taatgacaaa 
caagtaatga 
aatccattat 
ctcgaatcgg 
agggtagtga 
acatagagaa 
ctgcaaccca 
cttcttcttc 
tagattatgt 
ttaaaatgtg 
cagccaacta 
tattagtttc 
agattattgg 
aagacaataa 
aagacagatt 
tgcgagaaca 
tcgggaagat 
cttggccaca 
tcatttttgt 
tactacgaca 
ggtttagata 
atacttttac 
acacaggagg 
caatctctta 
ctgagggcta 
gaattagccc 
caacatgatg 
gaatccattt 
tcggaatatt 
gacaaatgga 
aaggcagaag 
aataaaggtt 
ggttcacacg 
tccaattcat 
tacacgtaat 
tacctcaaga 
ggagaaatcc 
ccaaaacctt 
gacaaatatc 
aataatcgca 
acagggagtt 
catataggca 
acatctattt 



aatgatgcgc 
tgcattaaca 
tgatgtttct 
cattaaaatg 
agaatggtac 
cgagcaagtg 
ctatggaaca 
agagagtatc 
accttcaaaa 
agacaggtgg 
tgttgcacgt 
ttgtcaaggg 
ggatttgtgt 
atttttgagg 
ctgtaatcaa 
ttcaagtagg 
gttgaacacg 
acctgatttt 
cttgcctaaa 
aagtacatat 
aaataaccaa 
acaagaaaag 
tgaaatcata 
tgcattatct 
tccgccagtg 
ttcttctcag 
gtacaaattc 
tgatattctt 
tagaacctcc 
tgctatcaag 
tagtgaaaag 
tgtcttcgac 
gtgaggtgtt 
cgtactctgt 
gtgccaggag 
ttataggtat 
ggtttatcaa 
cgatgcatct 
aacggatggc 
aaattgataa 
ctggcaccga 
gcgttagtga 

gagggtttat 

ttaaggaact 
tatctacgcg 
ttagcgaccc 
taacagggcc 
aaaggactgg 
ctgtgtgtaa 
tcttgtgtgc 
aaccgtctct 
ttggtgtaaa 
ttgataaaaa 
agactgcttc 
aatcacctga 
cagcttcggc 
tatcggataa 
agagagccga 
tgtccggtgc 
tcatgaataa 
cttccgaacc 



gatagtaagg 
tttaaagttg 
gtcgtatcca 
tcccccaaag 
ttgtacattg 
aaaagggcga 
aagaacagta 
tccttcatta 
gaaactactc 
tggaataaaa 
ttagtgtctg 
aaatggtggc 
acagaagaaa 
aagcaccaga 
aaaaatggta 
gcgaagagga 
atgatggcgt 
gccaagttat 
tcccgtggac 
aaaacacctg 
gaaactgcta 
agctcttttc 
aagggtattt 
gacattaaca 
aagcaagaat 
acgtctaaca 
actgacgaaa 
gcatccttag 
ttacattcag 
gaaacaggag 
ggactatcaa 
tgatttgtct 
tgatgtacac 
tgacggaaag 
tgtttatagt 
catcaagaaa 
tcacaatgga 
aagcataggg 
ctcatcattg 
agttaaatat 
agaggacaat 
aattgtggat 
ccagaaaaaa 
tttccctgaa 
tgaagattta 
tgattctgga 
tgcgtgtaag 
aagacctgaa 
atcgacaaat 
gcctgtagct 
cattgtcgag 
ctggccctat 
aagatcgggg 
agccgttctg 
ggaggctgcg 
tgtggcctct 
tgtcctattg 
ccaggcagct 
gtacaggcaa 
caaaacgcct 
tctttctcta 
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acagattttc 
tcaaaggtcg 
aatatggata 
ccagattcta 
gttaaaacag 
acacagctaa 
gaggcactca 
tttaaggaag 
atgtctggaa 
ttcgggctct 
ttccctacca 
ctacttagaa 
agcgaaagag 
gatgcagtag 
ctaagtgcgc 
ttgtggaata 
agaggtgttt 
aatcggaacc 
aatccaataa 
agcggagcga 
ctgaagctat 
gccttcttcc 
atcagcctgt 
ttgtggagaa 
aatcccccaa 
tggttggtgt 
acagcatgat 
agattaatat 
acacattctt 
aggattagaa 
tatatttgta 
cttggaggtg 
gcacacgtca 
agtgttcgac 
gtcgttgatc 
ctcggcccca 
agcagaaaca 
aggagtattg 
gactcgcatc 
aatcattata 
tgcaacgtcc 
cttcctttac 
aaagggtttg 
atctaatttc 
ccctactcct 
cgtcttctga 
ttgctagaga 
aaggtgtgga 
tgtcatttga 
cgtccattgc 
ggtacattaa 
tgtatttctc 
tagctttctt 
aattgactag 
agcctgcact 
tattcttgct 
atgtacggtc 
tcttgtcctt 
taccatcttc 
tgggagggac 
aatcttttga 



aaaaccccac 

actctgctag 

taaagtcatc 

cacctaaccc 

cagttaatga 

cctttatgag 

tgtccgatac 

atgcatcaaa 

accccagaca 

gtttcatgtt 

ttccccattc 

aatacggttc 

ccaatgcagt 

caaatctcac 

agagatcgtg 

ggtttgtgtc 

tgaggaggaa 

acccagtttt 

acacaccaag 

ccccactgtt 

tgctgaaatt 

cattgaacgc 

gcgcaagatg 

atataaggac 

agaatttgac 

aattaagagg 

catgaacagt 

ttgtgtgttt 

tagggatgat 

aaataaaaga 

caattcccac 

aatccaatgt 

atgagggtaa 

atagtatttc 

aagagatcga 

gtgccagcgg 

tccttcatgt 

taggacctct 

atctgatcat 

accatgataa 

aggtttgtta 

gtgttggttg 

tttttaaaat 

agtaacgata 

cccttcacct 

attggatgaa 

attgacgcta 

agattcttgg 

aaaagaaaac 

tgttgctgct 

gtttcgcata 

agatgcttga 

taaaagtttg 

tgtatttaat 

gtatgctaca 

gttattgtta 

attactgtcc 

ctctgaaaac 

atcctcttca 

agccatttta 

caaagtgaac 



tgtcattgcc 
gtctacaaga 
aatagattgg 
ttgttcagtt 
tatatacgct 
cgagattgaa 
tagggagagg 
taccaattca 
gcccataaaa 
taaggtaatg 
aacacaatac 
ttcattcgaa 
gaaaaaattt 
cggacctgtg 
ttcaactaaa 
ataataaaaa 
gtggaggaag 
gaagacaaga 
accaaggaag 
cctgatgaga 
ttttcaatgt 
ataaagagca 
ttggttgacc 
gatagcgacg 
ctctcctttt 
gtggccggag 
gttctgggta 
ttgactaact 
gagaaatttg 
tggtataaaa 
tttacttctt 
tcattgtagc 
tttttgaaga 
ccttaatgtt 
cgccagtgtt 
gatcagtcct 
ccttatcgga 
ctccacgaat 
aattagcgac 
cgattagagc 
ggttgccaaa 
gatccagcta 
ttaaccctac 
cctcccacta 
cctcctctcc 
gaattgtaat 
gatttgttat 
tcaatataca 
gttcttttcc 
gctggggtgg 
ggcacaacag 
tttagcttat 
atcactactg 
gcattatcaa 
ccaaaccttc 
ctcactccag 
ccgtatctta 
atgttgccgt 
tcgactgcct 
gaagaacaac 
tcttttggcc 



aatgtgacta 
ttcgacgctt 
gcaaatatgg 
gacactattg 
tctgttggaa 
aaactgcgaa 
gcattcgtag 
aaattgtttg 
ttggtacgcc 
cctccagaat 
catggtctat 
aagtcctggg 
ggtgtaaacg 
tatgttctca 
tttctccgtg 
tggctcagac 
aaaggcaaca 
gttcatccac 
aacaacttct 
aggtcaagca 
ttgggatcgc 
tcactactaa 
acctctatca 
aaaaactgag 
ccgattacgt 
ctattgaaag 
ctgttgtgtc 
ttatttgtac 
taatgagtca 
ttacttgtat 
cttgatttcg 
ttcgaaggct 
tgaagattta 
tcctgcaata 
gttgaggatc 
tggggctgtg 
gagaccattc 
tgacattacc 
gacgcttctt 
aatgattgca 
ttccattttt 
gttagttctt 
atcgaaccgg 
cccctacttt 
ccctttactc 
attttgtgtg 
tcaaaaggtc 
cactgctagc 
taccaacatg 
agggagaaac 
atgtagattt 
attttatgca 
cgcttagcaa 
aaatagggga 
ccccgccttg 
aatacgcgtc 
tatta tcgta 
cttcacttct 
tatcaaattg 
tagaaatact 
aatttaatag 



agcggatgga 
ttgttaatgg 
tagaaaatgt 
tgtccagaga 
aatcttattg 
aggctgcagt 
agttcctatt 
ttcagaataa 
gttctgctga 
tcatgaactg 
atggtacatg 
ctcattttga 
atacgaggat 
ttttagatct 
aaattaagga 
atcaaagatg 
acctttcaca 
atcttctaag 
agaattcgtg 
agaagttgaa 
tcaagacagc 
aattgttatc 
ttttaaggag 
cgtcattctt 
tgatcgcctt 
taaggaattg 
caacattcct 
atttgctaat 
ggtaacaaga 
ttttattcaa 
tccttgatat 
gcgccgtctt 
atgtccttgc 
acagttggcc 
ttcaaagtca 
acggtagaga 
ttcatggcca 
tttgcccttc 
ccaacacgtg 
atggacaaga 
ctttgtttta 
ctgtagcacc 
atgaagtgta 
catctccatc 
gtattggtcg 
cttcattctt 
cataaaagtc 
gctactcatg 
agaagataca 
atcttgattt 
tccgtgctta 
gtacacttca 
gtgtttcctc 
tctgtgttcg 
tgtctggaga 
accgaaaaag 
tactcgacga 
tttctctgcc 
tatactgttt 
atccaaaaga 
tacaattggt 



gagcattttt 
tgttgcgaat 
gatcaaatta 
cgcaagtgta 
tcgtcctgca 
tgtatgtttt 
ttacgttagc 
gctatcttcc 
ggaaacacta 
tatatttaac 
tttaacccct 
ggaaatttta 
agattgtcta 
tgtacgtact 
aaactatctt 
ggaactaaca 
aagaaatcta 
aagaagagca 
aaggatctgg 
gaaaagtccc 
aagttcaaga 
gatgcaatta 
atgcagaatg 
aagagtaaga 
aacaggattc 
ttgcagagta 
tacaacatga 
gatgatttgt 
tacatttcaa 
acaaaacctt 
cgatcacatt 
caaatttgct 
tcgagaagta 
gaacagtaat 
tgttagagtt 
tgacaagatc 
cctttccaag 
tttgaattgg 
tgttgaatac 
ttgcaataat 
gatggaagtt 
atatacccag 
ccttgccgtg 
tctaccccct 
tcttcatcgt 
ttcacttgct 
ctccgtctag 
ttggaattga 
ctgctaggca 
ttagaaggac 
ccctttttat 
atggacaatt 
ttgtccatga 
atgattgatg 
tcaaacatac 
tggtttaatg 
attgcttcct 
tcacttccat 
aagacgtaaa 
agaactttat 
atgttcaaga 
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aaattgttaa ccttcccacg tccatacata 
aaattgcccc gaaacaaatt tctgccccta 
ttgcagattc tttactcatc tgttttagtt 
ttgtttcagt ttgatgtttt tcaaaatcct 
ccactgatgc cttggtcact atgtccgaaa 
cacttgatgt tttaaaggat atgaggttgt 
acgtttgaca tggttgatta tcgtcgtctt 
aatttgcttg agtttgatta ctataaaatg 
ctcctcctcc tcctcctcct ccagcaatag 
cagatgaaga catggaagtt tcatcctcgt 
gagaaggagc aggagtcgat ggtcttattt 
acgcgtatgc tgctgaaaaa agagggttgg 
atttctcatt tgttttcctg aacagctcgg 
ctaaaaatgc cttcatggtt aattctgcag 
tctcggatgg ttttactggg ttttctggat 
attgaaggtt ctttgttgtt tctagaagag 
ccatagtctt gtgtaatctt tcttcggcct 
caaccactgt actcaaaaag gatggattgt 
taggtggagg tggaggggga ggaagaagaa 
gaggtggtgg tggaggtgga ggaggaggag 
tgtctctact ttcttctccc atttcattgt 
gcggtggtgc tatatcgttt agttggtcat 
aaaacattct aggtgctttt ggtgaaggtt 
tcatgattgg agggaacaca actgcatcct 
ttcgaaccct gggcgcattg ttaaattgat 
tgtgccgaga ttcatcttca tattcttcat 
gagattgaac agactcttta gtttctacat 
cctcctcttc ctctccctct tcttcatcct 
cacctgctat tgaagagaag aaataatcca 
ggctggattc aactctatga gagtgcacgc 
cgggttcatt tttttcttgt tctataggtt 
cttcctcttc ctcttcctcc tcctgttctt 
gttctatttg ttggttatca tcttctttgt 
tggcgttcac attttcaagt aaaaagttca 
ctggcgaggg ggtgttcact atacttttcc 
cttgttgtgc cgctagcgcc aataacgcag 
cattgtcatc ttcatattca tctccttctt 
cacccccttc ttctccctct tcatcttcct 
catcatcgta catgtcatta gtgacatctc 
gaggatcttt atttcttcca tttctaactc 
cctcctcctc ctcctcctca tcctcgttca 
agttactata atcaacatta ttattgatat 
caccttcagt tatacgcaga tttgatgccg 
tgtgttaacc gtggaaaatc aaacactaca 
ttacctctat agggtaattc tcagggtgta 
tggtttttgt tataactttc tctaaaatct 
ggaaggggtg gttcagagta agaccaaaaa 
accgtttata ttactatttt agggtactag 
atcccgatgt tgataattgc gcttatcgct 
aattcaaatt acctggatca caatatcaat 
cttcggggaa cattcacacc agaagatata 
ctgagtgttt tagaaagagg gtcaatcatt 
caacctacag taaataattc tcaagagagg 
cagaccaccc cctcatccca atcatcttcc 
gctgatttat gttcctgtat gtgacgagtc 
cgatgaggaa gaaaatatat taaaggagga 
tatcgtagta gggagtatta agatgatgaa 
gctctactcc ctccttgcaa gtgcgatagg 
acgtctatac aattcctttt cttccatgac 
gatttattgt ccatcctaca acaataaaag 
gctgatccat gcctccaggg aattgttatc 



gtttgttgat gtgtttctct atgattgtag 181320 
tagaacctaa ccccatttct tccattatgt 181380 
tagatgaggt agacaaatct gatacatcat 181440 
cttgggcttt gttggtgtct ccgaacagtg 181500 
gcatattgac taatctagac caagaaacag 181560 
ttaggaacgc cagtctttga ccaagcttat 181620 
ctttcagtat atttatcagt tccatgtgca 181680 
aagaatgctc caccacctct cctcctcctc 181740 
actcgtcgcc ttcagcacat actccaccag 181800 
ttaattcaga tatactttca acctttttaa 181860 
tgctagctct ttcccccatt atataagttg 181920 
acatgatttg cttcctctga aacaactcca 181980 
cagatttaat gtctgtcacc gcagcagaat 182040 
cgatgatatt tttaaggcgc tccattactt 182100 
cttggaacac aactttcacc agttcagagc 182160 
tttttaccct tgtttttaag attatagact 182220 
ctttcccacc taatgtgttg ttaacctttc 182280 
tgttgttgtt tgaaaataag gactgatttg 182340 
tattggtagg tttgagttga ggaggttgat 182400 
gaggcatcag catcgaatct tggtccatga 182460 
ctggaggaaa agtcacgata gatggaacag 182520 
attgttcctc acctccacta ccaagaatag 182580 
cctttatctg ggggagaata tctgtctctt 182640 
cgcactcttg tatatctgga gggggagatc 182700 
gcatagtact tatttcttca tcctcgagtg 182760 
attcttcctc ttcttcaata tattccgtgt 182820 
gctcttcaga tttttgtgca ggttcctcct 182880 
cttcttcatc ctcttcatta tcattgccgc 182940 
ttccatcatc attacctcct tcactcatag 183000 
tttcattttc tattgcatct tcatccttct 183060 
ctttttcttc ctcttcctcc tcctgttctt 183120 
cctcttcttc atcctcttcc tcttctacgg 183180 
tcttctgttc ttcacctgct gatccgccca 183240 
acatgggaag aacttgccta tttggcccta 183300 
actggttttc aggtataatt gtcgcatctt 183360 
ggtcagcctc atcttcttcc ccttcacctt 183420 
cttcctcgtc ttctagattt ccgtcatatt 183480 
cttcctcctc ttcctcttct tcttcatcat 18354 0 
tcacaaattt actttgtttt ttagaagatg 183600 
ccgtgtcctc atcttcatct tcgtaatcct 183660 
tttcctcctc atcctcttcg tcttcataca 183720 
cgtcctcgtt gtcgatatcc aacactcccg 183780 
cttccatgat gaaattttag tatacaatat 183840 
tttactaagt tcctttcctg aaaaaacgag 183900 
tattctatag tttggtcttg aacgtgcttt 183960 
actaacggca agagcattga cagttgagtg 184020 
taaaatattt tttgattgtg tgaatttcga 184080 
aaaacgttgt cagaaccaat catgaggttt 184140 
gcatttgtta ttgcagctct tcttactgcc 184200 
aaggaactaa atttgacccg ttcccttcaa 184260 
gctcataaca acagaattct cccttccaag 184320 
ttagcagaaa tggacaagta caaaaatgca 184380 
aggaacattt ctactcctca acaacaacaa 184440 
caagttgaac tttaaattct ataagaaaat 184500 
tcgcaccgac gagaaagaag attttaacaa 184560 
agaagttagt ggaggagaaa aggttgtaaa 184620 
acctggttca aaaatcaaaa caaagtacag 184680 
gagtagaaag aaagctgaag aatatattga 184740 
gttgaatgaa ggaggacgtc tggcttcagc 184800 
gataaaaaac aatcgttcta ggcctgtgaa 184860 
tgaaactacg gtgagggagg aaatactacg 184920 
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gaaatctccc 

aatttttttg 

catacagtat 

aaactatgct 

taagacaata 

tttttgtatc 

cttttaaaag 

tgatatttgt 

cattactgtt 

cctctgtagg 

caatattatc 

ctaccaatgg 

atctcgttac 

aaatgatgga 

cttcagaaaa 

ttttaactgc 

tgattagtag 

cgctcgtcta 

caaaaaaatt 

ggtatggtac 

ataatgagga 

ataatgatag 

acggctttgg 

tgttatgtaa 

ctacaaatcc 

ctggttccga 

ctactggcga 

cagttggatc 

aaacaggatc 

aagtcaatca 

ctgtggaagc 

aaatatcttc 

agttttatta 

tcttcctcca 

gaccgctggg 

agattgcacc 

acatgtat tt 

tagggaattt 

agtttagagg 

tcgagccaat 

tactgtttct 

cacccttaat 

ttctgggtca 

ggtggaccgc 

gtacattaac 

gggaagggta 

ttttcaaaaa 

gtcaagttcg 

gagctacaga 

acgcacccag 

ccatctcaaa 

cactcggtct 

accagaaacg 

tttctggagc 

attttattgg 

atccttactg 

agggttcctt 

catctttaaa 

aggggtcttt 

ttgcttggcc 

ctctgattcc 



gcttcttctt 

tttgtacctt 

gaagaagata 

tcaagatttt 

aagattctat 

catttcgtag 

gttattggac 

catgacgttg 

gttcttaacc 

ggaagcaacg 

tttaataata 

atattctggt 

cactgaaaaa 

aaaatataga 

agaaacagac 

cctgagtaat 

agatgatgtg 

ctactcttct 

catgacatca 

tgcttttgta 

tctacacact 

attaacagta 

agggtctgat 

ttgtgacaac 

aactgaaatt 

gcctgtaaaa 

gcacgatatt 

acccattgta 

tcctctggca 

aaagggaggg 

tgcatcactt 

tgttacatgt 

ccactaaatt 

tctcaaaaac 

tcggtctgat 

agaaatgaca 

ctggagactg 

tactctgcct 

gtggaccgct 

gtcagattgc 

ggaacatgta 

ccactagaga 

ctcgagttta 

tgggtcgagc 

ttttacgtgt 

taacctacta 

tttttctggg 

gagggtggac 

aacgtacatt 

aaagggaagg 

aacttttcaa 

aatgtcaagt 

tttgttgctc 

acgttaaaat 

ttctatccat 

aaat ttttca 

gg tgtagctg 

tcttcgcttc 

cctacaggtt 

ttct tcacat 

gtttgcatct 



cttcttcttc 

ctaattgtac 

ttaaccgttt 

ctaaaattaa 

aaaaatcaac 

tcgacgggta 

atggctatca 

ggagtggcat 

acctgagcaa 

ctcgtgggtt 

tatgccgtga 

tgtacaacca 

gataaacctg 

tcggacttgt 

aaattgaaaa 

ggtcatgaca 

aaggacaaaa 

gcaacctcta 

acacagttgg 

ctacccacgg 

tggaacccac 

gaatgtgcta 

gataatagac 

cacaaggact 

caagtagatg 

tcttcccaag 

tcccatgagc 

ttcaagccag 

ttgaactgcc 

atattatata 

tcttcctcgg 

gaaggagaaa 

tattggtctt 

ttttcaaaaa 

gtcaagtttg 

gagctccaga 

acacgtccag 

ccaccttaaa 

gggtcggtct 

accagaaatg 

tttctgggca 

attttactct 

gagggtggac 

caatgtcaga 

ttctggaaca 

tttacactag 

tcgctcgagt 

ctctgggtcg 

aacttttacg 

gtgtaaccta 

aaatttttct 

tcggagggtg 

cagaaacgat 

aagtagggag 

ctaaaggtgt 

aatcatccaa 

ttttcgtgtc 

tactcctctt 

cgacaataag 

tcctgtcgga 

ctctaaatgc 



aacgtcatca 
ttctaaaact 
aaggtataac 
tagattaact 
atttatttgt 
gagatggtct 
catacatttt 
tttttgttgc 
acatggctgt 
ttatgatttt 
cagacgttta 
atgtggtttc 
ttcaatttgt 
ctcctaagaa 
gtgattataa 
gtaacaagag 
gttatgtctt 
aatacgttag 
ggagtaagct 
gatacgtgat 
cctcttcttc 
aaacggaccc 
gtgcaaaaga 
tgcttaaggc 
ttgctgcaaa 
tgacatctgc 
ccgtgaagct 
tttatggtac 
cctgcaccga 
gagatatggt 
actcttcttc 
aaataaagaa 
cttggtactg 
tttttctggg 
gagggtggac 
aacgtacatt 
aaagggtagg 
aacttttcaa 
aatgtcaagt 
acagagctcc 
cggacgcacc 
tcctccatct 
cgctgggtcg 
ttacaccaga 
tgtgtttctg 
ggaattttac 
ttagagggtg 
acccgatgtc 
tgtttctgga 
ctgttttcgc 
gggtcgctcg 
gaccgctggg 
aacacacatt 
aataatacat 
tagaggaaat 
agagaataca 
cctggcgaga 
catagacctg 
tcttcctata 
tgcggcttta 
tttccagatc 



tccacttctt 
gtatgcgatt 
ataatacata 
acatgtatac 
atcatttata 
gtagaaagaa 
cttattgttt 
tggtgtatga 
aggggattat 
aaattttata 
tagaagatgc 
ttcgacttta 
tagaggttta 
cgtgggggaa 
aaagggtaag 
gattataggg 
taagagattg 
aaaattttcc 
cgtgtatcct 
caacaaagca 
tgtgctcgtc 
aacacataga 
agaaggttat 
tcctctaatt 
acgtgtttta 
tgctcatcaa 
atcagatacg 
atctttagta 
caaggctgat 
ggggtatctt 
gtggttgaca 
aattgtgtaa 
tttacactag 
tcgctcgagt 
cgctgggtcg 
aactttatac 
gcatagtcta 
aaatttttct 
tcggagggtg 
agaaacgtac 
cagaaaggtg 
caaaaacttt 
gccgaatgtc 
aacaccagag 
gagactgacg 
tcttcctcca 
gaccgctggg 
agattacacc 
atagacgttt 
tagagaattt 
agtttagggg 
tcgagccaat 
tctggagcag 
aacattatat 
actttcattg 
atctttccag 
cggtacttat 
aataaggcta 
gatgcgttcc 
ctcccactag 
aatttttctc 



cgttttcttc 
ttgtaaaaca 
tatctgaaga 
aaggtataag 
aaattccata 
gatgctgctc 
tccctattga 
caaacggcaa 
ctttctatgt 
aactttgtca 
aagcggccgt 
caagaagcta 
gtccccagaa 
tatattttac 
aaggttggtc 
ccaagggatc 
agcaaagatc 
cctttcagag 
caccctatac 
tacggaatgg 
ccagactcta 
atcggcatct 
gtagaaatgt 
acagagtatt 
ttccctgccc 
ctagacggag 
ggtgactatg 
aatcttccag 
ggaatatatc 
aacgccaacc 
actggtaaca 
tcatatgtgt 
ggaattttac 
ttagagggtg 
agccaatgtc 
tgtttctgga 
ctgttttcac 
gggtcgctcg 
gacctctggg 
attaactcta 
aggctgttaa 
tcaaaaattt 
aagttcggag 
ctccagaaac 
cacccagaaa 
tctcaaaaac 
tcggtctgat 
agaaatagca 
ctggtcatgg 
tactcttcct 
gtggaccgct 
gtcagattac 
tagttagtta 
tttcaattcc 
tag tataaga 
tataacttag 
tattgcttcc 
tgga tgaaat 
gtatcaagtc 
ttttccatat 
tgttaacatt 



184980 
185040 
185100 
185160 
185220 
185280 
185340 
185400 
185460 
185520 
185580 
185640 
185700 
185760 
185820 
185880 
185940 
186000 
186060 
186120 
186180 
186240 
186300 
186360 
186420 
186480 
186540 
186600 
186660 
186720 
186780 
186840 
186900 
186960 
187020 
187080 
187140 
187200 
187260 
187320 
187380 
187440 
187500 
187560 
187620 
187680 
187740 
187800 
187860 
187920 
187980 
188040 
188100 
188160 
188220 
188280 
188340 
188400 
18846Q 
188520 
188580 
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ttccactttg 
agataagaga 
tcttcgtgct 
tgtattttca 
ccatcttgcc 
ggtggctgca 
gacatatttt 
ttcttcgtcc 
tccccctctt 
attgacattt 
tttccagtac 
gatggtaaaa 
taaagaatgc 
attagttttc 
tgaaaatcgg 
aacttgtcta 
cttgttacat 
ccctaaaatt 
agtcatatcc 
tgtgacactt 
tgtcaaagga 
gatggctgct 
taccttgggg 
tatagttgga 
ggacccttca 
taggtgagga 
agattgcttc 
ttttgctcgt 
tgaatcaatt 
agctgctgat 
tggtgcttgt 
ccaagagggg 
gacggcatct 
cgttttattg 
ttcaggtgta 
tcgtttttca 
catgattttt 
aatgtcttcg 
aatttcaaat 
ggtgtcgact 
aacaggctcg 
cttgtcaccc 
attgccagta 
aaacttaaaa 
atctttggct 
aagagtggag 
tacaaccctt 
gcctgcaccg 
tttttcaaat 
gcctctgaaa 
agtatcctcc 
gatactatcg 
tctagaatgc 
agccgacata 
cgtatgattt 
tgtgatggaa 
tctacgttta 
ccggttctag 
aagagggtgg 
gttaaaggta 
cgtcaagcag 



acataattat 
ggtaagggcc 
ctaacagcat 
tctgttggaa 
aatagtaaat 
gccgcgtaca 
gcagcttcaa 
tctctttttt 
aatagacccg 
tctgctcctt 
gtctctagtt 
gtgtctagag 
atatctttca 
tgtgttagcc 
gcagctgagt 
ttattgacaa 
ctcttgtatt 
ccgtctatcg 
gtagttttaa 
tcggatggca 
tttggtttag 
gataatagta 
gggtagaatc 
acaacataag 
ataaagtctt 
ggcatggcag 
catcttagaa 
cccattgcca 
acccatgaag 
ctaaacacac 
ttcaaaaaag 
tcaattatat 
ttagataggt 
aaccacgtgg 
ttttgtaaat 
tcggttttag 
gtggtgctgg 
gttaaaggga 
atacaatgga 
atattctgga 
gtgccatgga 
atattatccg 
ttgcgggtct 
cgtataatca 
ctctagatcc 
gggaaaattt 
gcaagattga 
gtctcaagga 
gtatcaacca 
catgcatgag 
tcggctgcct 
agtatgctga 
ctatggtaga 
ggcgagtatc 
actctgcgat 
agcacttttt 
acgtactaaa 
acattatttc 
cgtacgacat 
acaacaaggc 
aaaaatatat 



tcttcaacat 
tccctggtac 
tccttaaagc 
catttcccca 
atactgctct 
acaggtagaa 
acacataccc 
gtttcatttt 
aatctgcaga 
tattattacc 
cctgttctcc 
attttgttat 
aaaacacttg 
ctccagagca 
cgggatcaag 
gaaaatactc 
cacgttgaag 
cctttggaag 
ttcgttctct 
tctgcagatg 
cgtttctggt 
ccatgatttc 
cttcctgtat 
attgtccatc 
ttatcctttt 
caaagggatc 
tatcacaaag 
gatcggccac 
gttcattagt 
cgtccataat 
taaagtcaca 
gctctctaac 
gtcctttggc 
gtccgaatac 
ttgaaatggc 
tttctgctac 
taaagtagtt 
tatttatttg 
gtacatggaa 
cgaaaactct 
cgccgaggag 
tctggcccac 
tttcatcaac 
attcaaggca 
tctgagtatg 
ggacaaatct 
cgatat tgtc 
agccatccgc 
taataggttc 
atactctcag 
cgacagggat 
ttcagataac 
cagagccgtt 
tggagacagc 
tgaaattttg 
gagctatgtt 
agcctaccgt 
cggaggatgg 
ctctcaagtc 
tagcaacgtt 
cccgtttgga 



gttatccact 
aaaactagtg 
ccaaaagctt 
taagttatgt 
gtcgaaattg 
agaaagggca 
tttcttatcc 
tccttcaaaa 
gagtatctca 
accaccacca 
catagacggt 
gaaaccagat 
aagaattaac 
taaaaataat 
aacgcgggta 
caaagtttct 
agcttgaagg 
atatgggtgg 
ttccctctcg 
gggattgttc 
cctggacagc 
cctccctaaa 
tacagaaacc 
ttcttcactg 
ttcaatcatg 
taaacccctc 
agtatatcct 
cgttgaattg 
atcacctcta 
attctcaaag 
tagttcttcc 
accttctata 
taatcttagt 
gtcagaagaa 
agcaactagc 
taatgccaga 
tttgtacacg 
ttggtactgt 
gaaggagaca 
gcttgtgtag 
gcacagtacg 
tgctccgaat 
aacatctttg 
gagtcacaaa 
ctttctagct 
ttgggtgtgc 
cttctggaaa 
caggaacaac 
aattttggaa 
gacgaacgcg 
gatccagctc 
gcttgggttt 
attgctcact 
ttcaagcagt 
ttctgtgata 
aatgccatgg 
tcatgggtgg 
aagaagaact 
atcaatcatc 
acatctggcc 
atacttgaaa 



aacatcttgg 
tttgacgcat 
gcaaaggata 
gtaggaaact 
ttcgttattg 
gcaaaccctt 
actaggtgca 
cctatatcac 
accatcaact 
cctaaagcat 
ttactttgtt 
tcacaacgct 
ctgcttttca 
gcattccata 
acagagtagt 
ggtgaaacaa 
gcggtaggtg 
tccctaaaga 
acccattctt 
acatatttat 
actgccctgt 
agagcttcat 
atgttggtta 
actgtaactc 
tcagcagcgc 
tctttcattc 
agaaatagag 
tttctgttgg 
gtggaataaa 
gcattcaaaa 
gctgcagttt 
tctctgtaga 
cccgaattaa 
ataacaacct 
ccttccgata 
acatagtcct 
ctgcaatatc 
tttcgtcaag 
tcgctgaaag 
ttaatgtgaa 
cacaggacat 
ccaacaagat 
acaacaattc 
acaagtctag 
tcatgggttc 
tctttgaggt 
tgtgcccatc 
ccatggaagc 
gcgacataaa 
cagtcgttgt 
atactctttc 
cttccctgtt 
tttacgtgta 
ttgtctatac 
ctgaaaattc 
ttaacgtatc 
tggatcaagc 
acccctcacc 
ttgcatcacc 
tggatagtat 
ataaagcagg 



atgaaaatgc 
caatcacatt 
ataatgatga 
tcaaatctcc 
aaggattgga 
tagatacatg 
tgttacgcac 
cacctaaccc 
cccctaaccc 
cgagtactgc 
cctttaaact 
ggcagacgaa 
ctgcagagct 
gcacatacat 
tggagaacat 
tatcgttaaa 
tgcgtttagg 
gattctcagc 
tttccctttc 
ccagtttagt 
actctgcgtc 
ggaaagaacc 
aaacagtcga 
catcactaaa 
gtccgtactc 
tagaggcgat 
caatggctct 
ccaacatttc 
gaactagatt 
ggttcaataa 
cggtcaatct 
cactcctcat 
aaaattctgt 
tcctttgttt 
cataatttag 
ttgactctgt 
ttcgacctta 
gctgaaaaat 
gcgctcagaa 
gagtatccgt 
ttccgcccaa 
taaggatacc 
aacaaagaac 
cgttctcaat 
tgatccagca 
gcttcaaaat 
caagtgcgcc 
aatgttgttg 
gtcagcatac 
gcctctgagg 
ctctttcggg 
tgcagccgtc 
cacaatgctc 
cgtatttgtt 
gtctgtagaa 
cgtgctgggt 
atccgtcgca 
tgaccacatc 
ttctagaatg 
caggtctgtt 
gtatggtgta 



188640 
188700 
188760 
188820 
188880 
188940 
189000 
189060 
189120 
189180 
189240 
189300 
189360 
189420 
189480 
189540 
189600 
189660 
189720 
189780 
189840 
189900 
189960 
190020 
190080 
190140 
190200 
190260 
190320 
190380 
190440 
190500 
190560 
190620 
190680 
190740 
190800 
190860 
190920 
190980 
191040 
191100 
191160 
191220 
191280 
191340 
191400 
191460 
191520 
191580 
191640 
191700 
191760 
191820 
191880 
191940 
192000 
192060 
192120 
192180 
192240 
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ataaacattg ccaagcacaa catcagccgt ccagcaagag aacaatccaa cggaaggaac 192300 
tttaattgca acgctttaca cattctacct tcaattaagg gctgtgaggc acttggggca 192360 
caaaagggga gcgcagatca aactgtcaat gtttttgata attttgtcgc atctcatatg 192420 
gatattgcca tgaaaaagca ggggtcgggg aagattcttg gactgctcac tagcatgatt 1924 80 
gacaggcaag gtctgactac ttcattccct agtagtgaag cggaatacaa gaagagaatc 192540 
catgatttca caagatacgt catcttctct tcaacaccca tcaacgacga actagtcaat 192600 
tctcgctgta ttcttcccca ttctaatgtt ctgaactccc ctatcagctt gagaaatatt 192660 
gacccagaat cagtccccga cactcgattc cactttctgc tcatgatgtg gcagcggcca 192720 
aatatcgatg aacctaatct ttctgctcta actacaagtc agctagaatt gttgcttagc 192780 
aagaaccaaa aatgggacaa actcaccacc agagcgttct tcaatatcga caggatcaat 192840 
ttccagatgg cagacgctat cattaagaac gtttctggaa gcggcttcct agatgggagt 192900 
aaaactgcct cttcttcttc ctcagcgcct aactttttcc aaatcttcag tggtgctgaa 192960 
tgcactgcaa agcagctcca aagtattcgc aaattcattg gagaatctat gcagcatgta 193020 
caaaaggaat ggagtagtgc agtaaacaat gggaacagag gagtagaaaa ttatgacgga 193080 
ctcaatgctc agttctctga agaactgttc gagctgctct acaaattgat catcgaggag 19314 0 
gatatgcggc catccagcct gatcgcctca tctgaattct tgagcaacta cgtcaacgcc 193200 
atggatgaac ttcttatcag agctaatgct tcttagatag ggtttttatg aaataaaaca 193260 
aatataaatt atatatcatg caattttatt tatacctttt tttgtatgtg atacaatatt 193320 
tttagtgtta tgacagtgca accattatat tattatctgt aataggtctt ggtaaatttt 193380 
taatgaaagg aagattaggt gcacaatcta ttgtcataaa aatgtccagg tcaatttctt 193440 
caacaccttc ctcaaagtta tagaagaaac ttgtttcatc atttttatta gcaataaaca 193500 
agtacgtttt ttccttagct gtgcttgcag agcttatcca atagtctgat gcctttttaa 193560 
cattataaca gttggggagt aaactacaaa aacagtcctc tgttttgttt gtagcgtata 193620 
cccgtacacg gttaccttct actacaaatg aaaactctat ccgggctttt tcaatacgtc 193680 
ctcccacagc tgctctacct tgtttttgta gcacgggttc taatccaact tgtagagaat 193740 
ccagattcgt tgctgcacta ataacattaa atgtcctctg tttcatgttg acatgtacag 193800 
acattcgtcc ttgccaagtg cgcatatcac gatcactgat aaaacctcct cgtatattat 193860 
ttatcaccaa ttcctttgag taccgggcaa ggtaatcagg tgtaggttga ggaggttgtt 193920 
gttgtctagc tagttgtgaa gaaggtgttg ttgtcaacat caccatttct gcgtctgggt 193980 
ctgttgatct ctgtaggctt tcatccccat tttcatcaga aggatcaact gttactgtac 194040 
caaacaataa aatattggtc aagtctatta cgagttgttc gtaagtgata ttggtagatt 194100 
tttgtgcaac aacttcatcc atcgtttcct tgacttcgtc atggttcgtc cctatgagat 194160 
cgaacaagtt ttcatcagca attgcctgtc cgacatcggg gacgtttatg tcaccttcag 194220 
ggattgaaac ttttccatct gttattttat cctttattaa ccatacgttt tctgatttgt 194280 
atgtgtcaaa ttcctgatct gctaatttgg tgaagaattc atcctcagct ttttcaagag 19434 0 
ttcttgcaaa atgctctctc gtttgacgtt gaagcgctgt ctcattgata acatcaactg 194400 
ttcctccccc tcctcctcca ttattattat tatcgtcgtc agttttttca tcatcgtcag 194460 
tgtatttaga gtctaattta tatttcttac tcttttttgt tttggcattt gaatgtagag 194520 
ttcccttccc tacgcccgag atagcaaaga agagggcaat agaagctcct attaccaaaa 194580 
caataaccgc aaaaacaatg tatattcctc cattgtttga aagtgccatt tttaaaccct 194640 
tcaattctta tctaaaaaac aaacagattg aaatctatgt ataaatcatc actcgcgttt 194700 
tcgatttcag gcatgtgaat tgtaatgaga gaggccgaat cctcggatgg gttgattatg 194760 
ccatttgtat catgactaat atatatagga tcatttctct ttagataatc acctatgtta 194820 
tcagaagaat cactatttct tagatttgat aatatcacta cactttttgg tctgtttaca 194880 
ataccagcga acgtggattt gcggttaaat tctaacgtga gagttttggc tgaagaatct 19494 0 
attttgaacg ataaacttaa atcaattaaa tccccttgag cagctccagg gggagtgatt 195000 
gataacatgt gtttatcgtt tattggtaca ctaaatgaag ttgaagagtc ggtaaaggca 195060 
atagtattgt acttgaaaga aaatgatgta tgataaaaac caggtatata tcttctccac 195120 
cgctcctctt ttaaaatttc agttatgaaa tcttccctgg gttccatgat ccgtctattc 195180 
acatcgtata ctttttccat aaccactggt attcctttct caaaaattgc ttcacagata 19524 0 
acaggattga aagcgtcttc gtaaagatgt tcaaaagtca ttacattagt taaatggcta 195300 
ttgatttctt gccttacttg atcctctata tattttatag aggcattttt tgctaccgaa 195360 
taggcaattg cacctaagcg atactttaat tggtcactct ttcttgtgat ccaagaattg 195420 
tcatggtgag cagctcccaa ttgagcctcc actgaagcga tttcagcttc agtcacagca 1954 80 
ggctcatctc cgttagacga cattatttac tcctccttaa aagcagtgat tgcaaacaat 19554 0 
atatcgtccc ttgtatcgcc tctttttata cgtttaacag cgtatgttgc agattctccg 195600 
tttagtatat gggtgtgttt atatctagat ttacatacaa tatcaatatc tttactcttc 195660 
ccttcaacaa tcacattaaa gaatatgact acgcgttctc ctgttcgatt aaatgcttca 195720 
aagcttagca cgtcattaac tacgtcacta cgagacacaa gaatcactgc atctacatcc 195780 
tcatcattta tggaacgaaa caattttgcc gttattttaa aatctcccaa gggaatggag 195840 
aaatcgtacc ttgtaaattc tccatcgtca ccaccaattc tcaaattgta cgtagcaaca 195900 
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ctacaccgcc 
attgggggcc 
gttattattt 
tttatgtctt 
ccatgtctta 
aattcagata 
ctctcatcgc 
catcaacagg 
gtttagaagt 
ttaacgtaga 
tcctggaact 
attacctcct 
gtagagggag 
tacccaaggg 
aatttctcac 
caaagaacat 
tggttcatta 
agagctaaat 
aacatgggaa 
ctcttgtggg 
tacaccatac 
gtttttaatg 
atgtgtaaat 
cacaaatgtt 
acaacatcca 
ggcctattct 
gtggtcttaa 
gcccatgcgc 
ttaatctggc 
ggtttcttac 
gtgatagtag 
ctcattgaag 
tttacgtctc 
agaacggatg 
aagtcattcc 
tttccaaatt 
tccaaaagcc 
tcacaaggaa 
taactgttgg 
atgcctctaa 
aattggcccg 
gccttagcga 
gacaccaatt 
cctccaccac 
aaaacactta 
gtattttttg 
atacacatgc 
ttctttaagc 
ccatcagaag 
gatgcgtcat 
gcagcttctg 
atactgtcaa 
tcgacagcta 
gaaagatcat 
gcggatgtat 
tcccccaggt 
tcggaaaacc 
gatgtattcc 
ttgtcttcaa 
cattcatcta 
tcatcaataa 



ctacttgacc 
gataccaatg 
gtctaacatc 
ctaggcgaga 
cagtgttctt 
gattagaaac 
ttatagttgc 
tgtaaaaaac 
attttcaacc 
cctttcttcg 
ggaactgggg 
ctatttttgg 
ttttctgtgt 
tagccattgc 
gtggtcaata 
gttttacaca 
caatgtactt 
tggtagagtg 
cagatagtga 
atgaagagga 
aggctaacat 
aggaaagtct 
ggtggcaaga 
ttgttagtga 
tatcaggatt 
ttgtggcagg 
caaagggcat 
cttgtctaga 
acatttgaag 
ggcgcaattg 
cagaagaaga 
tgtacaaatg 
tcaattctag 
atacgtcact 
atgctttctt 
tgatacatct 
cgataatttt 
aaacaataag 
acatcataat 
tcatgacgct 
gaagcatatg 
tttcggtaat 
gccttctttt 
tagagttaga 
aaaagtttgc 
atgaattaga 
gaataaagga 
aagaaaggac 
taaatccagg 
aacgttcgta 
ggtcctttat 
tattaatatt 
caaagtgttg 
cagaaatttt 
taccatcaat 
attttttaat 
tttcagtggc 
atttcaaaag 
ttttcatatt 
tatcttcttt 
tggattcaaa 



agtttcttgt 
tacaagattt 
tattccttgt 
tctcacatcc 
caacaattcg 
accacgtctt 
gccattagac 
atacaaaaga 
ttttattgtt 
tcttcttctt 
ctcggtaact 
tatcctgagt 
tacccttata 
tcaagtgatg 
aaggaaacac 
gtttgggcac 
gacaatatca 
tctaagagca 
agaaacttga 
caaaaacaca 
tacagcaaca 
cctcttacat 
atacatcaaa 
agaaccaact 
tggtaaactc 
agagtgcatc 
ttctacaaat 
acgagcaata 
aggtgatggt 
ttgttcctca 
agaagattct 
gaaaaaggtg 
gaccttgttt 
aattctatcc 
gtgatgatta 
ttccatcatt 
cttaacatcc 
tcttttagac 
aggagacaca 
gttttggggt 
tttggttata 
aatggaagaa 
aggcgtatcg 
agcaaaggaa 
aatatcactg 
ggaagagtcg 
tccagaattc 
attttcgcag 
aaaactatat 
catgttcata 
tatggacgat 
cgacagagta 
tttcttggga 
tccagtgtaa 
aactgacttt 
ttcgagcatg 
agggttgcag 
gccattgttt 
ctgcttattt 
acaatcgttc 
tgattcgtta 



tgaatttttt 
tgcgttagtt 
ttttcaattg 
agttctgcat 
tttatgtgtt 
tctacaattt 
atgctgttgc 
aataataaaa 
ctggggcgaa 
actcctactt 
ggagaagttg 
acaatttttg 
gcctgtttca 
ctccaccccg 
aaaaaatatt 
agataatcag 
gaatggtcag 
tgcaacggtt 
ttctcatccg 
gggtatttca 
taatcaacac 
tcagtatact 
tccctaacat 
cctattaatt 
ataccaatac 
ttgatgttaa 
aaaactgaat 
ttcatagcca 
aatttattgc 
gcatcacaac 
ccgtaatttg 
ttagtattct 
ctgatcaagt 
ttctttttgt 
aattcgtgtt 
aaagatgaat 
tcattcttat 
atagcttcgt 
gtaatgctgc 
gaagtcatat 
gaactggaca 
gatccgtcac 
acattgaaac 
aattgtgaac 
ctgttatcgt 
tccttttcgt 
ttagccgaca 
tcagaagtgg 
tttgacggtt 
aagtccagat 
aacttgcgag 
gggcccagaa 
gggggaagta 
acgcaatggt 
ttaacgcccc 
tcctgggcag 
actaaaacag 
ttctgtatta 
ttcttgttct 
caggaatgaa 
tcgcatattt 



cgtctatatc 
ttcgtttagc 
cctttaaaac 
ccttcatgta 
tggttatggg 
tatcaaggat 
tacttgcttt 
atactttagg 
gattatatac 
ctttttctcc 
tgttggtggt 
gttttggttt 
ggaacaaggg 
tgcttttccc 
catcaattct 
ggt taagtct 
atttttgaga 
tattcacaac 
caatatcctc 
gattactaac 
cagttttatc 
ggagagccat 
cca ttgaacc 
ccccttcctt 
aattgtcaaa 
aatctctccc 
tgagcatgct 
ttgtattcaa 
acatggactc 
tattgtcttc 
ttgcagcctt 
cctctaaacg 
ttttcattaa 
tgtcactgga 
gacgctgttt 
ggatatactt 
tctcaaagac 
cgacaaagaa 
ttt ttgcgcc 
tct ttaataa 
catcattctt 
tgttatgttt 
aagcctcttt 
tttgttgcct 
ctgttttagg 
cagaaagcaa 
tcttatcctt 
aagttgacag 
tcttaaccac 
ctctatccac 
aaatagtctt 
gagccttctc 
aaactgggta 
acgcaatcaa 
cgttagtcga 
acaaagatga 
gcgaaccgtc 
tgttcataat 
tgctcataaa 
gattattaac 
tggatccaaa 



tgtgcaagat 
caagtaatca 
acaatccttt 
aactctgtct 
gtgtatcaac 
gaggatcaac 
tggtatacaa 
ggaagaaaat 
attcaatctt 
tcctttctgg 
agtgacagat 
taattgtact 
taggtgtcta 
ccttatatac 
tttatttgat 
caaaccccat 
agcattagga 
cattgccaga 
tgtatttgcc 
catcaattgg 
actctttagt 
ttcaaacaac 
ccaggcatca 
atcatatcct 
attgatggct 
aaagaatctt 
atttagttga 
agatttagga 
gtaatttgct 
ttctacttca 
agcacaaaga 
ggaaactgat 
aactgatgga 
atgcccacta 
tattccatgc 
cattcctcta 
aaatgttgta 
tgagaagatg 
ctgttggaaa 
actggaggga 
ttgggggatg 
ttggagtgca 
attactgtct 
attttctcca 
taaagaagac 
atttttatcc 
ctcaatacgc 
aaccactgaa 
atcgaataaa 
taatagtttt 
gatgttacga 
ggcagcatca 
gtttgttttt 
tacaccaggg 
caggcaatta 
cgtgatcatc 
gcgatcaaat 
cgcatcaact 
tttctcattc 
ctggcagttt 
tttacctaat 
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atgcacttca aggcagtttt tacttcggat ttagagaatt tgccttcata ttcgataggt 199620 
ttcttacagt gcctgtacac agcatcggtt tcgttaagcc aaggagaatc tgtgcagtct 199680 
ttaatagaga aaggcatggc gcctctatgg agcatattat acacaatatt tttctcttta 199740 
aaggtagttg aaacaacacc agaagaacaa ttccaccagt gagcttttct ggcaaagaaa 199800 
gaaatattga taggatcctt tatgcggaca ccattttcgt catttttacc gtcgtttctc 199860 
tcaatttcct cacattctac cccttcatct tcaatttcgc cacctctttt acgttttttt 199920 
gtatcattac aagcattttt cctattcttt tcgtgtttga gttcagaaat aaaatcgtac 199980 
tcttggtcgc atacttttag ttttgattcg gtcattaaac ggttaatatt actaaaaacg 200040 
atcttagagt ttcctccgtc accgtacgcg tttcttgctt cattgatgtc aacactattc 200100 
atttctcccc cactgccgcc aaaatcgaca ttaccacttc tggaagcaga ctcgtcctta 200160 
tcgtcttcgc tcgatgactt ctttgagcca aatgaacctt ggtggccaac ttcgtcaata 200220 
atcacgatac cctttttgcc agcaaaggaa gccaaatttg cttgagtaga tgaagaagaa 200280 
tccccagcat tattgagagc cgtaacgtta atatttgtaa acctttcgaa ccaagtcctc 200340 
ttacacaaat cgacaagagt ggtctttaaa ttgtttccct ttgttcccac aaatacaaca 200400 
gcagtttttg tggcaatatt caaggcaacg gaacaaagag tttgaaccat gtggtgggct 200460 
ttctctccat ctcctgaagc aatggctgtg cacatacctc caaagttcat taggccacgt 200520 
aaaagataca cgacatccat tctgggcatg ccagtttttt ggtcaacaca tgagatgtaa 200580 
cattttctta gcgcatgaag ggctgtagtt gtagaattaa catcatcctt ttgcttcctc 200640 
ttcttattac tactactact actactacta tgttttgaag aagaagagga ggaggaggag 200700 
gaggaggagg aggaagaagc cgtttcacaa tcgctagtgg ctttagcaag ttcgtataaa 200760 
attgataaaa atatataatc ttcatcagca ataaaaattg atggaggttt tagagaagaa 200820 
ctgaatgctg tttttgatac aggttcgtac tttacaacgc tagtattttc cgtgttttct 200880 
tcgttctcat cctcatcttc ttcattttcg tcatcatcat cctcattttc atcggtttct 200940 
tcctgatctt cagccataga ccaaaagttt ttgttcatat tactgacaga gtcggtattt 201000 
ttcatataag atacgtgttc gattaaacaa agtcctctac ccattgtagt gccttccatg 201060 
gacgaagcta gagcagcaga aataaattcg gttgagaata aagaattcaa aataacgtgt 201120 
ttttctgcct ttaatttaca ccttgtatac atgtccgtga ccaaatttat cccttcaagg 201180 
cgtcctccta gaagataagt accatcaatc aaaaagtacc tgttattatc aacgctaata 20124 0 
caggcagtgt tttcagctaa cctgaaggga attgctgtac cattactaat aaacgttcta 201300 
gcatttccca ttttttcaac ttcagccata attttagaaa tgagagcagt tcctgcattt 201360 
gtaatacaaa gacgtatatt tttggctctt ttgagcatat tcttggcagg acacactgga 201420 
ttagctaaaa atgagcattt gtatgcttgt tccgaaaaaa tattctgctt gttgtagtct 201480 
gacaaaaaat cgggacagta tctatcgtgg aacaactcgc cttgttctga ctcattgtca 201540 
aacattgaag gagttaatcg ctcgctgaaa catcccccgt ttctaatttg gagagtagac 201600 
tcgatcgcct tcttgttatt cttacccttt gatgatgaaa caatacaggc ccaatcagta 201660 
aagaatatgc gggcaaatac tcctcctgtg cctttgttca ataagaaatc gcaacacttc 201720 
ttaaagattt ttgttcctct aggagcgaca aatttaccct gcacagggga gttaaagaag 201780 
gaattatcca agtagaaacc cttatctgtt cctgctaaaa agttaaaatt atccttctta 201840 
atggccagac tagattcgtc cagacttttc ccattaatga cagaccgcat agaaattaac 201900 
atcctcttag catccattga atcttcttgc agaaggcaag ttttatctgt accaccttga 201960 
tcgtcatcat catcttcatc gtctgcgcag ccagtatcaa tgccagcata tagaccagaa 202020 
tggtccttat tgtaattgat gacacgaagc acgcgcttat ggccatcaga gttggtactt 202080 
aaaaacacgg cagttttaag gatactgaca atttcaattt ctgatgaaac aaagttggga 202140 
catttcttag tttcttcttt accaatgaaa cagtaatcat tgtcttgttg ccgataacaa 202200 
gatttgttga aaaaatctcc tcctcgtctt atatttgtcc ctggtgtaaa caacactgaa 202260 
caactattag tgatccaaga cgtacagaaa ttaccctcgg gatctaaaca tgaagacaaa 202320 
acttcacggg cgtaaacagc cttagccaaa ctagctgcag aaattttttc gggagaagga 202380 
tcagaggaat tattgtcttt agatttagca gagaaagtgc ggtgcagatt cataagaggg 202440 
aaagtgtcct tcagtagttg gccatagtct tccacactca ggaaaccaga atggtatact 202500 
gaaagtggag ggtttgaatc ggtaacgcca catggagcat tattctttgt catttcaaag 202560 
tggctattct ttgaactatt gggatgaata catccccagt tacacaaaac gctaatgtga 202620 
cgctcgtaca atttaatcca caaaatgttt agatttttaa acagcaattt tttaagcaaa 202680 
actatctggt tcttagggga cataccattg ttacagttcc acatcttacc caaactttta 202740 
taggtgttga tattggatat tttttggtgg gtaaagtatt gctgggagaa agtctggtca 202800 
aatgaagatt ggttagaaga gcccaagata acgtacgcgt ctttagaatc gttaacaatc 202860 
ttaatgtttt ctaatttttc agtggaagta gttagagaat catcataaag agacaataaa 202920 
tcttcatcat aacccatagg aacattcttt aaagagtctt ttttctccaa atcttgaatt 202980 
gaaacatatt tcaccgatga attactactg ctattagagt tattgttaaa gttcttccag 203040 
ggtctctttc caaacttctt ttttgacttg tcctcatctc ccagggccat attcttaatg 203100 
taacaaattg gcataacttg agaccagtac gagtcagaaa tatgattgga gaaatagcgt 203160 
gtcaggacct ttgtttcagc aaaacctttc cccttgtact ggataaaagt tctcaaggga 203220 
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cggccaacta 
ttttccgaag 
tcatgtttca 
gagagaaggt 
acagaagaca 
gtcaaatcgc 
ctttcaatgg 
tcttttttcg 
acaatgtcat 
ccatcgtacg 
tggctgaaac 
gaatcatcat 
cgtcccctat 
ttaatctctt 
tgaccggcag 
acagcagcga 
ctagtctctt 
acatcattaa 
tctgccgctt 
tcatacatat 
ttcttaacta 
gaaaatctat 
gacatgagaa 
ataacatatg 
ccttctttta 
cacttgctag 
aaaggagcag 
accattgcta 
atccactctg 
caaatttgat 
ttgttatttg 
tctttgagag 
ttcaaaagac 
gccctcgcgc 
tttgacacac 
ccccaatcaa 
agtctagaag 
tcagaatcat 
ttaaagcact 
tctaaacaca 
actttagttg 
tgattcgcac 
acaccatcac 
atgagtttgt 
agacattttc 
gacctaatat 
ctaaagtatc 
agatgtttag 
ttactactaa 
gggatattgt 
tctcgtacag 
atttctttca 
aaatttttct 
ctggatctaa 
ttattaatac 
cctagaaaac 
cccttggcag 
tcgttactgg 
cttagatatt 
aaaacagctt 
gcgcattcat 



ttctaccctc 
aattgacaac 
attggtcttc 
gacctgatga 
tgtaatcgcc 
ggcactcttt 
catccattcc 
agaaaggaac 
tatttatctg 
aaaacaagcc 
ccgtgcggga 
tgtaaaaacc 
tacttccacc 
ctagagaagg 
gaatgttgtg 
gttgggcagt 
gcagttcttg 
cgtccaaaat 
ttactaaaac 
cattgctttt 
gtgaaagggg 
aggtgataaa 
tctttcgggt 
cggcaaaacg 
tgctagcgtc 
tgattttaaa 
ttgcaatttc 
gcacacaaga 
cccgactgcg 
gcctaacatc 
ttagcaagtt 
caaactcata 
attcccacgt 
taacgcccag 
ctgcctttaa 
taggaaggct 
ctggactatt 
tacttgtttt 
cgtatatgtt 
tctgttgacg 
gttttagagg 
acaatgccca 
ctcccgatga 
tatcttcact 
ctgccaattt 
taccattaac 
tcctttcaca 
aactaggaa t 
ggggagaatc 
tgttgctgtt 
gcatagtaaa 
aaatatcacc 
ccatttcctc 
agttgaatgt 
tactagcatt 
cataacaatc 
ctccataatt 
ccacattgtt 
cttgtttttg 
ccttatcaga 
acattggtcc 



atggcctgta 
tttcttgtag 
agaaatttca 
ggatgatcca 
aggacatatg 
tcttatcttg 
agcacttaac 
acaaatctta 
attttgtgtg 
actttctctg 
tctcgagtcc 
acccgcgtta 
ctttcctgtc 
atcgataaat 
gaaagaaatg 
tgtacaggaa 
gcgagtcttg 
acaacaatcc 
aagccttgca 
ttcgctattt 
cctagaaaga 
ttgcaaccaa 
catttcttca 
gtaaaagtgg 
ggcctttttg 
gcatcgaatg 
tcgcacactt 
gagggacatt 
caaagtagac 
attgctaggg 
gcttgaattt 
ggccttggcg 
atcagcatcc 
aaattctgat 
atttagcatg 
gttattaaag 
ggttccagaa 
tccattcaaa 
aagttttgaa 
tttttcttta 
atcaacaact 
aataccagaa 
acatgccaat 
gatcacattt 
acactgtttt 
tttgtccttt 
tgccattgtg 
tgggagtagt 
gctcttgaag 
agcggttccg 
aggcactacc 
agattcagct 
cacaatgaca 
tttgatgaaa 
gtcagatggt 
atcttcatca 
ctcctcatcg 
caggatctgg 
ttgatagagt 
cttgacgctg 
aaaagacttg 



tttgccaaaa 
aagttaaacc 
ttggtctggt 
cccccacacc 
atgttcatat 
ttagtttctt 
gcagtttcag 
aaaaagttct 
atcatgttct 
tcttcttcgt 
tcatcaacta 
ttaaaacctc 
atttgttttg 
gcttcttcgt 
ttagtgttga 
ttactaacca 
gcgcggaact 
aacaaatcat 
atacgtttca 
ttgtcggcca 
agcctgcttc 
ttactagtat 
atgacaatat 
agaggatctt 
ggagaatctc 
agtggtacca 
tctccctgca 
tctgtcttga 
ttgtttcgtg 
gtgggtttta 
atgaaatttt 
acctgcgctt 
aaatttgtgt 
agctcaaata 
tcactaccgt 
cactccctaa 
ccgtatcttg 
gctgtacgca 
acgttatggt 
agttccttct 
gtagaaccca 
aagaaaccat 
ctggctaaac 
tcatagatac 
atggcatccc 
ttcacaaatc 
ttgaaacgga 
tttatgcact 
ttattaagag 
tcttcactat 
caattcctca 
gcagaattgt 
gttgaaacat 
ttaacatccc 
acattttcat 
tcaccaccat 
tcactctctt 
cgagcagaaa 
ttctccctct 
gcaaaaagtc 
tcagattcat 



cactgccctt 
aaaacgagca 
tagcttcatt 
caccatttcc 
cataaacgct 
cctcaactgc 
tgacaccgta 
caatgtcaga 
taattcggtt 
ttctggcgtc 
cggatgatga 
ctccgccaga 
caccactatt 
tagcgtccat 
tgattctaga 
gattgtagtt 
cgtccaaaga 
ttgaagaagc 
ttaccttccc 
aagaggaaac 
gccgtcccat 
caaatccatt 
cattacttgc 
cttcaatatc 
ccttgcctgt 
ttttatccac 
tcatgatacg 
gaacctcttc 
gataatattc 
tcttcaattt 
caacagatct 
tccagtcctt 
ccagatcttt 
cggcgccaga 
cccaatcctt 
cagtctctag 
ttatcaaatt 
tgatacattc 
ccttgaaata 
cagaaagcga 
gcacaacaga 
tattaacact 
ttctaggtct 
ccaaaggtaa 
acctcttatt 
ctaaagctgc 
gagctacagt 
tttcgttaga 
aagcacaaga 
tgacattagg 
ttctcaaaga 
ccaatgttag 
acttttggcg 
taaacacatc 
catcactaga 
aataatcacc 
catcactttc 
cgtcgtcgtt 
tctccgactt 
gttggaaagt 
cataatcaaa 



tccttgcttc 
atcacttcct 
attattatta 
tccctcgcca 
tccagtatca 
cttattccac 
caaagcatac 
aattactttc 
aagtacatag 
catgtgtatc 
tccccgacta 
tttctttgat 
atcctccttg 
tactttagct 
ggacttttct 
agtatcgatt 
tttaacactg 
acgcacagag 
aacacgtttt 
actaaccaaa 
taggacgtga 
aaaatcaaac 
aacttgtcca 
gtaaaatcta 
acagctgcac 
agatagacgc 
atacatagcg 
tgttgcagcc 
gtcttcaacc 
cttaattacg 
aatgtcgttg 
attgtcgtca 
accaaaaaat 
ttcgtccaca 
aactaaactt 
aattgtccac 
tgaaaagaac 
ttccatcatt 
atttgctgca 
ttgattaaac 
ctctaaatct 
cttatctgat 
aaacgtgtta 
tgctgaagat 
atctaaaata 
agtaacatca 
cagatctttc 
catggtattg 
acaaaaggga 
cttgacaatt 
tcttctatta 
tacatcacac 
atgacataaa 
atcttgtacg 
accaaattcg 
tgttgcacct 
ttcatcgctt 
gtcacatttt 
cttgtctgcc 
ctccatgggg 
cgttttaaat 
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gtatcgttgt 
ctgtttgtct 
cagacagatt 
gcgtcgcaca 
atttgctgat 
gcaagagtaa 
gttgacgtga 
tcagatgacg 
gatgatgttg 
ggaatactct 
cacatttctg 
tgtagaatgt 
tgaaggatgg 
ctgataatgt 
ttctggggaa 
aaaccgtgtt 
atgctattat 
ttgatgttgt 
ttatcttctt 
tcctttaata 
ttgtaacaca 
ggcatttctg 
atagaaccca 
cacatccact 
ttgctgatgg 
gcagtgtcgc 
atcttgaaac 
atacattcac 
tctgtgtctg 
ttgagcccca 
gctgttcttc 
ctagaagttt 
gttccaacgt 
ttcagcttgt 
cctttcttct 
tttccatcgt 
agattgttgg 
atttgtttga 
ccttcgttga 
agacagatgc 
tctcgcacat 
atattgacga 
tgaatgtagg 
atatcaatta 
atccagtgtc 
ctggtgccaa 
atactcctca 
cagctgccgc 
ctttatccgc 
tcaataacct 
ccacgaccac 
ctagagccct 
ttcgtgctgc 
tgttcaattc 
gtgatcgtgt 
aaaatctttt 
cagttcaaca 
caaacctcga 
aagcaggagg 
cctcagttga 
cgtcccttga 



tcaattcgta 
tgttcttgat 
gcttttcaag 
tgtgtctaaa 
agtactcttc 
cgactgtcca 
tgatggacaa 
aagtggatgt 
atgcagcagc 
ccaattcttc 
caaatttgcc 
ccgtatctct 
cgaaaggcat 
tctttgtttc 
gaagaaactc 
ttctcttgaa 
agacattggc 
tctcgtctcc 
cctcgtcgtc 
gggtagcatt 
tgccaattgt 
gcccaagttc 
tgaggttttc 
cttcggctct 
agagaatgtt 
tcactttaca 
aggcaacagt 
tcagtaattc 
aaacagctgc 
tcaaagaggc 
cacaacccag 
cttcctcata 
ggggtccttc 
ccagaaattg 
tggattttgc 
ttcctctgtt 
ggttgacact 
ccagagggtt 
gaaggagaag 
gtagagtcgt 
cactttcaag 
gtgcagtgtg 
ataaaataac 
cagctacaac 
ggactgtgag 
ctagtgattt 
gacagaacag 
tgctgctgcc 
catttctgaa 
taaccctgat 
atacgattca 
ccccaccggg 
gtcagaaatt 
gtggaatctt 
catcaagaga 
aaaggaattg 
gaaatcagca 
aaaattagta 
tctttttagt 
acatacttct 
tttagattca 



gaccattgtg 
cctctccatg 
cacacttttg 
cttctcaaaa 
tggatgagaa 
cgcgaatatg 
gagcttcttg 
tgcaccttct 
agcagcagca 
aattgtagga 
ctttgatagt 
gggcaatttc 
gattgtgtca 
tggtt tcacg 
cattgccaga 
gaaacacaca 
agcaagggga 
agaaccgccg 
tccatctcca 
tgtcttggtc 
acagaaatct 
agaaaaaagg 
gagatataga 
atctgccttc 
cttcttttct 
atgtccggga 
tttcttgatc 
aagttgtacc 
aagacgttcc 
attggtgatg 
aaactctctg 
aagcctcatc 
ttcatccatg 
atcaccagta 
aacatccttt 
cactgtgtca 
gccgtctccg 
gccattcttg 
aggtagacgt 
tggttcctct 
aaacttgcct 
ctggttataa 
gtaacttttc 
tggtaagggg 
gcagagagct 
tatagaacag 
gaacaggcgg 
gctgctccta 
gatggaaaat 
gaaaaaatgg 
gaaaatgtga 
ggcacgtgtt 
atcgattcta 
attcccattt 
tcaatgatcc 
cgcaggagaa 
gtaaatactg 
gatttttgca 
gcgtcagcaa 
aatgatttct 
gcaacaaaca 



ctttcatcat 
gaattttcgt 
ggtacatata 
cctccttcaa 
gtgatgttgt 
gtgtcgttct 
atattttcca 
tctagactag 
gcttcttctt 
ggttttagtg 
gcatcccatg 
aggtttctgt 
aacttgatgg 
cgttcaacac 
agaggagagt 
atctggtcaa 
gggatttgtt 
ccagatttcc 
gaatcgtcca 
aataacaagc 
tgatccataa 
atctttcttc 
cactttttct 
acgacaccca 
cccaccaact 
agattccagt 
atactttcaa 
tttaagattg 
tcttcaagga 
aaggcaacct 
aaggcaggag 
ataataataa 
cgcctatttt 
atttctggaa 
aaa'gttgcag 
tattcaacaa 
ccgactctat 
gtcaagttat 
ctgttgtcgt 
ttcatctggt 
cctcctaggt 
gttttgtttt 
ttgtgtcgtt 
gttgtccgac 
tgtgagggcg 
aaatcatgga 
cggcgcaaca 
cgcagtacag 
tggagaggtc 
ctcaacgtgt 
accctggtag 
tagcccccaa 
ttgcatcaac 
ttatgagtaa 
gtaattgttt 
aggttaacgc 
cccttgcagc 
aattgaagta 
cagctcagtc 
tactcgatat 
catttgatac 



acttaatttt 
atgctgcaac 
tgctaagttc 
attcaaaatc 
acagcgcaca 
tgtcagcacc 
aagaagctat 
aaaccatatc 
cactttgagt 
attctttgtg 
tgttattctt 
tctcatatct 
tgttaacgcc 
aattcttctt 
tgtcaatgtc 
gcttatcatc 
taggggctgg 
ctccagatcc 
atcctttaca 
cacctccttc 
tcaaactttt 
tccaatcgtc 
cgaaactttc 
gagaagtcac 
gcgcatggtg 
tatcactccc 
cacgttcgac 
actttaacaa 
tggttgggta 
taaacactgg 
tcacacatag 
gttccctgag 
cattctggca 
ccttttttac 
gttgactgta 
cattacatcc 
cttgtacagc 
cagtgactag 
actcgggcag 
tggcacattt 
tgagcacatt 
ttcttggtta 
gttcaccaag 
gaccgtctgt 
tatttataca 
ccagtaccca 
acaacaagca 
taacactgtt 
aatcgcagct 
ccaattccat 
ttctgttgtg 
ctacattgca 
tagtctatac 
tagcaaacat 
ttccaaacaa 
cgccaaggca 
atggaacgcg 
cagtcccgat 
acaatcaggt 
cctaaaacgt 
tgctctttct 



gcacaatttg 
tcccattctg 
tcggttaatt 
ttcacgaatt 
aaaccgctca 
cgtggcgaat 
ctggttttca 
tggtgttgtt 
aggtggagga 
ggtaaattgc 
gagtctttgt 
cacaacaatt 
agtcaaattc 
gctcatggct 
actcagaaag 
gtcattgatg 
aggaggcttg 
tccttcctca 
cgccatcttc 
tgccagaaac 
gtttctgtaa 
ggtattggcc 
cagtgttttg 
catgttttcg 
ataggggaac 
gccaatatca 
gattctgggc 
caccacattt 
gaatattacc 
cacgtctgtt 
tttctctgca 
aagctttcca 
agaacgctca 
tttgtacatg 
cattatcaat 
agcatacaga 
tagatgcccc 
aatttgttcc 
tttgttggag 
tattaagtga 
tgccttaaac 
gttatttttc 
gaatcttttg 
atcctacggt 
agaatagcag 
gaagtgaggg 
gcaacaacaa 
tctgcagaaa 
tcgtgctgga 
ccacttagtt 
tttttgaagc 
gtgcctactc 
caatgttcaa 
agtcaattcg 
aaaaatgtag 
ttttctcatg 
ggaagtgctg 
agaaaataca 
acttcttctt 
cacaagggaa 
agggttttta 
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ccgagtttaa ggaacaggct agagcagccg tagatgcagc cgcagattca gatcacctat 210600 
ccgcgtcgga ccctattttc tctattgttc gacacaatag caggcgtgag ggtattctag 210660 
attcagtgcc aaatatcggt atgttggctc cacgatcaaa atattctgtc gcagagtact 210720 
tgatggcaga. cagagacgag tctgcagata tcgctgctaa gatagggaca aagattgcaa 210780 
cagattttga agcattgaga ggagataata ataagagaag agcagatact tcagtcgatg 210840 
atctcaaaga atcactagcg gattctattg aaaaaacttc cattaaaaat accggcgata 210900 
ttaattcggt cacaaatatt cccacagata cagaggagta tgaattttct ctccacatca 210960 
cacagctttt cgcgcaggca tttttggaaa ctatgggaag tttattgagc tgtgctttcg 211020 
gtgtacagtt ccccttctca gatgaaggtt ttgcggctat cgaaaggatt atacgtaaaa 211080 
cagatcctga tactggaaag gtttcagaaa tggacccttc ctctctttca gatcaatatc 211140 
tcttgttggt aggaaatttc caggtttctc ccttccacgt ttcagatcct aaggatatcg 211200 
ttttcggtag acaagttacg cctaatacgc ctattttgag tatcattact agatctaaaa 211260 
atgataagaa tgaaacgtct actattatta atttcaggga taggcttctt gttaacgata 211320 
ccgtgttgag agacgccacg caaaacgtct caacttccac cccttctcaa agaagagtcc 211380 
ccaccgccgc aggtgaacca aaaaagccaa tgctctctgg ttgcctacct atcatcaggg 211440 
gaccccaggt agtgaccagg gaaagtgatg atatgattag tggtcttgtg ggtgattggt 211500 
atatctcttt gggggtctat tatgccatgg gttcatcggc tgcagctatt gccgctggcc 211560 
accagagagc tctcgcttca gccgagtcaa ttaattcacc catgatgaaa aaattctcaa 211620 
agaagggagg aaagtataca gaggaggaaa agcgaatcaa gaaggctatg cgtaggaatg 211680 
cagatcgttc tgctaggatt ttggctttgt tggggcaaac tgatgcccag tatgggtatg 211740 
tcgaacataa ttctacccta gattcgttct ggtcttcgaa tgccgctatt cgtgcaaagg 211800 
caaaggagga tgctcttagc cgtgcagaaa tcttggcagt taggaaacaa ttggacggaa 211860 
aatgttcctc ttcaagggat gaatattcca tggtagagag atatcttaga gactcatttt 211920 
tcaggtcagt taataggtca ggaggaggat atgaaatgtt tgatcaaggt tttgatatgg 211980 
gaagatttgc cgactttttg agtgacaact ctgctgccag gaacgcttgg caacagtacg 212040 
cagaagtaat gagaggactt tctaagcatg agaaacgcgt attcaacatt gaaggtcttt 212100 
tcagtgctct aaattctttc aagttccccc ttgttccaga acaggggcgt aaaaagactg 212160 
taggtggaag gcataggctt aacaatttga aggcggccaa taagatcatt aacggcatca 212220 
cagagatgac tctccagtca gccatcgatg gtactggaat ctctgatatt attggttcag 212280 
ttagcgatgg gtggggaaat actacagctc agccgtctcg cgtcaaagct cttaaaacat 212340 
tgtccaattt cagcggaaac ggaaatgttg tgtcaattcc agttagtcga gcagtaaagt 212400 
gtgccgcggg aagtcgaggt ggggaaactc tcaagtgtgt ggacattcct tcagtcatta 2124 60 
ttgcaaacct aatctctgat aagagaattc tagatcaact ttgcggagga gggatgaacc 212520 
tcgctcacga aatcacaaac tttatcgaga cgattgcagg taaagaacat acaggaaagg 212580 
aatccgtttt cttgtctcct agattgtctg tcattctttt gcgttatatt tggttcaacg 212640 
cagcggttgt ttcccttacg gatagcaaca ttaaaatgcc cc tcaacaca atgtccgagg 2127 00 
gcactggcga tgacatttat agggactatt tggccatcag aggcatggtt aataattaca 2127 60 
acagtagtct ctcctctatt tcagtcaagg ctatttccga taggtacaat tgcggtagtg 212820 
gaaatacttc aaccagtaat aagaatgtga ccattaagac tcaaggtgaa ttgttaactg 212880 
tccttcaaca gactgctaat gccttgtccg cctttaccaa caagggaggc gtgggtgcaa 21294 0 
cccccgatgc tgccaatatg gccaacgtta tttccccaat tgctaatgcg gatgtagtta 213000 
agaacaccaa cgtggttgtt tcagggctag ataggatcac tgagaccatc aacttctttt 213060 
catttttgtc tcagatcaaa acaatgaacg agaacattga agagtatctt aggagatata 213120 
ggctaggaga aggactagat aagaaagaat tggataattt tgtgtatcca aatattgcag 213180 
ctattgttaa gcgagaattg ggggtaagtg gttccgcatt gtccagtaat ctcgatactg 213240 
atcgtccaat tactatcgac ctgaacactg aacagccttt gatcgtaaag gctagcaagg 213300 
gttatgcctc taaccgctac gctaaattat tcaacaaaac aacaagaaca gcagcagaac 213360 
aagctcagat ggagcagtat aatgcacaaa tggctgccaa tactattcct caattagtaa 213420 
acaggttgac catccctgga tccatcacgg cagacactgc catcaatgtc gttaaagctt 2134 80 
tcacagaaaa tggagaattt agtaacgcag aaacacactt gggggttatg ggtaacgcga 213540 
ttaatgaaat gcaacctctt ttcacggacg gattcaacgt tgcgaacaag cgtttaacag 213600 
ttaacgtggg ttcagttagt aagctgattc agaatgggtt aaccgtatct ctcattcttg 213660 
ctcactcaaa ggctagcccc tatgtcttta agcctctcgt gcaagatttc gctaagcttt 213720 
tactggcagt cactgcagag acttctctgg ttgtctctag gtcccagaag agtttcttcc 213780 
ccattcctcc ttcagtattt tcttcaggtg gtcttttcaa aattgatagg gaaatgttcg 213840 
ataatatgaa gacagattat gtagtggaag taattagaca gctatctaag aatgctaccg 213900 
ccgccatcga aaggtgcaat gattccgatt cagctgctag gattgccaag tcaggtgaaa 213960 
tttataacaa ggatgttgca tcaaccactg cagctcccgg aacttcttcc tccgctttaa 214020 
ccttgttcgc caataatctc cagaaccctg caaaggtatg gtctatggga gctctccccc 214080. 
atttcgatat ggccgtggta ccaaaacttc atggcatttc tcacgatcaa atgttccgcc 214140 
tatctacata ttatcagggt attcataaga tggaacttaa cagcgattgc aaaccagaag 214200 
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aatgggataa 
gcgataacaa 
tttgcgatct 
gatcgaattc 
aaaaatacga 
ctctggctaa 
tcacctctaa 
ctacatccta 
ttagaaggaa 
tcatgagtaa 
aagaatcgcc 
ttgctacccc 
tcattagaaa 
ctagtggtaa 
catcaatgac 
agtgcctaga 
tccaaatcac 
ctggcgttga 
ttgcaaagag 
tggatcaatt 
ctatctatag 
agaaaatgac 
ctggcctggt 
ttatcaaggc 
gtcaattgat 
gtcagcgtct 
gcctgttcgt 
aaatgaacga 
gtacagtttc 
aagaggatiaa 
tgaggaagaa 
aaaactttat 
tttccggccc 
cggttcacaa 
agagggctgg 
tcttgaacga 
agggaggatt 
aagaagagtc 
atgagtatga 
cgtatgatgc 
gcaggggaga 
ggagacgtcg 
acgacga tga 
caaccagaag 
ttaaccgtcc 
cgtacgatag 
cttcttcttc 
gccgaacaag 
ccagagtaac 
cagtaggat t 
aagaatacaa 
gagcatctaa 
atgttgcccg 
caaacgccac 
aacaactaga 
ctccttcact 
ccaaacagat 
gaactcttgc 
ataagaacca 
atactacttt 
acagcgct tt 



ttctcttcct 

ccgttcattc 

ggtgacgagg 

caacacggac 

cgagtctact 

gagcaaatct 

ggagattgat 

ttctgtttct 

tgttgtggat 

agaggaggat 

cgttatgcaa 

caaacagcat 

agttgcatcg 

caccgattac 

cccctcgtcc 

gactggccct 

tgcacctaaa 

gcggctgaga 

catcgaagct 

gcgtaaggcc 

tgatttcgga 

cgctagacaa 

acacaaacaa 

aaagtatgtc 

ggccgtggcc 

agttgcttcc 

ttcattcctt 

attgaacgat 

agatgcgcta 

gaaattagac 

gactgaacta 

gtcttacgaa 

taggccaaga 

atcgttcatg 

actcagggct 

tatgaggtct 

catgatgatg 

tgaatcatct 

ttttgatgat 

tctagaccgt 

aggtgaagac 

ctcatcaaag 

ctactttgat 

agctggccgt 

ggctaaagca 

attctttatg 

ctcttccgat 

cgaggaccaa 

aatgatcaat 

ctatgaaaat 

aattgttaag 

acaagtgtct 

cacccaagaa 

tctagttaat 

ggcaattacc 

tttcacatca 

cgtttcagga 

ttcatgtatt 

tctcgaatgg 

ccatgcactc 

caccaacctc 



ggaaataggg 

aatttggcat 

gaaatggtaa 

gcgcttcaaa 

ctttctacta 

cctctatctt 

aggacatgga 

gaagacgctc 

gctactaaat 

gtgcgttctt 

gacattggca 

cgcagattct 

aatccccttc 

aaccttaaag 

atgtctgtct 

tcaatgtttg 

ctttacggtt 

gactctattt 

ctggaaacgt 

cagaacatgt 

aacattgact 

caatcagata 

caacctcaat 

actaatgacc 

ggtgttgccg 

gacgtaacaa 

cttaacaacg 

gaaacaaagt 

ggatctaggc 

aggagttttg 

tacagggaaa 

tcttccatgc 

aggtacagca 

gtttcttacc 

ctggctgata 

tcggcggtct 

attagtgatg 

actgatttct 

gataataatg 

ctgaattctg 

gatgaggaaa 

atggggaaga 

gacgaatttg 

attcaatatg 

cgcgctttct 

gaggatgacg 

tctccattct 

tgtgcctttg 

ggtcgagtta 

taccagaagg 

ggtgcttcag 

cccagggaac 

agtaacttga 

gacccatcta 

gagaggagga 

ttcggtgatg 

ggaaatttca 

aagggcacta 

atgaccacag 

gaagatacac 

gtcggatcgg 



ctagcaaatt 
tggatactct 
agaccagtaa 
agagccttca 
aagaaactga 
cttcctcatc 
acacccccgc 
tcaatgctcc 
ctctgtacga 
cgagtagaaa 
ttgaccgcat 
tacagacagt 
tctcttcaag 
ctgtatatga 
ctgacagatt 
ccgatgctgg 
ctagagtcaa 
cttctgcgac 
tcgtaaccga 
acaacaaact 
gcgctaaaat 
ctattcttag 
tggctactca 
tcaataatat 
attactataa 
tgttcctcgg 
tacttttctc 
ctcttttggt 
acgtgtcttc 
ttacatcact 
ctgatactat 
tcaagaggac 
cccttgagga 
cagagagggc 
acaggatgga 
cttccgaaat 
atgaggatga 
cctcatcaga 
gccagtctcc 
ccgctaagcc 
atgacctcta 
tccttagaga 
atggcgaacg 
gtccaggttt 
tgacacgagg 
attccctcct 
cctccttcag 
ttaagagagt 
gcatgatcac 
ccaacaagag 
ctaccttgcc 
tgaggaggtc 
atattatctt 
ctcttttggg 
ataggctaat 
caagtaaaga 
agtctgccgg 
atacagttga 
ccgctattgt 
taaaaatgac 
aacattctca 



ctttggcctt 
tttggcttca 
cgatattgtg 
agttggtgcc 
cgtatattcc 
tttgtcgtct 
tcttctcggt 
tctttcagcc 
agttgcagct 
gattatggga 
tgctagtctt 
aaacgattac 
attgggagga 
tggtgttgtt 
ctggtcggga 
tcatggaggt 
cacctacgca 
tcaggaaaga 
tgtggtgggg 
gtcagatatt 
catgaagaat 
ctctctcttg 
atttgctctg 
ccacgagaag 
tgtgtcggca 
cggaaccatg 
ccaggtttct 
taaactggta 
aattagacgt 
ttat tcagca 
taacaaactt 
ttccttggta 
tgtacttgag 
agctgcttct 
atctctttac 
gatggatata 
tattgccttt 
tgaatattcc 
ttattcaact 
tcttactgcc 
tgaagaagaa 
tcttca tgag 
ttcaatgtca 
cctatctcat 
caagaaattc 
cttctctgac 
caaggggaga 
tgtacgtgct 
cccagtgact 
ggaaagggcg 
cgacgaatac 
tcttatcaag 
tgacgctctc 
tgatacactt 
gaaagaccta 
cacccaaatg 
ctatctaggt 
tcgtcttttg 
ttttgcacgt 
ctccgctttg 
gcgtctaaaa 



tcttcggtga 
cctgcagaga 
cataatattg 
tcagcagtag 
cttgtttctg 
gagggacatc 
accgctaaaa 
gtgttggact 
gtttgtagtg 
atggtggaac 
gttagtacag 
aaaaattatc 
atatccccta 
tcttcctcat 
gtattttctc 
agtaacatgt 
gctctgagct 
aagaatagga 
ggagatactt 
acttccaact 
gtgacgagca 
cacgaactcg 
gcgagccatg 
gaaacattca 
gctgccatgt 
ctccaacaag 
gataatatta 
ggattttgcg 
gtacagaacg 
tacagagatt 
ttcggacatc 
catgacgctg 
gctccttcca 
aggcgagtga 
ggggaagaag 
gagtatggtg 
attgattccg 
gattcatccg 
acatcttatt 
atctacgggt 
caagaaagga 
agtgatgatg 
gaaactattg 
tctaatattc 
aggccttctg 
gaatctacca 
aaatgcaagc 
tttgtgccca 
agtgaaaata 
cgtctgatcg 
gtagagggta 
gcagctgctt 
accacaacat 
ttgttcgcca 
actgaaatct 
atggccgatg 
gtccctctca 
gctaccaaaa 
tcattcaacg 
acagacatgt 
gtaaagagta 



214260 
214320 
214380 
214440 
214500 
214560 
214620 
214680 
214740 
214800 
214860 
214920 
214980 
215040 
215100 
215160 
215220 
215280 
215340 
215400 
215460 
215520 
215580 
215640 
215700 
215760 
215820 
215880 
215940 
216000 
216060 
216120 
216180 
216240 
216300 
216360 
216420 
216480 
216540 
216600 
216660 
216720 
216780 
216840 
216900 
216960 
217020 
217080 
217140 
217200 
217260 
217320 
217380 
217440 
217500 
217560 
217620 
217680 
217740 
217800 
217860 
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ctcttttaga 
tataccctac 
tgcaatcact 
atattggaga 
gaggagaaag 
cctctgacaa 
gagaggaaat 
tttctctgga 
ctgatattgg 
ggcttaggcg 
tgggtaattt 
cgttcgttct 
ttatcaaccg 
ttgccaccta 
tgcgtaactc 
tctcatcccg 
tagaaacatc 
tcgccgaaca 
agacacgcgc 
ccagcgagag 
ttgttcaaaa 
aaggttcatc 
accgtatgtc 
ttgtaccaat 
ggctgaagac 
tagctaaaac 
tacagtctct 
atattaagaa 
ctccttctag 
ctcccagttc 
ttgccaaaac 
cttctgacgc 
atgttgagca 
gtaacgtact 
ttggcatctt 
aggctaataa 
ctgaaaatgt 
gagatgataa 
catctattga 
tggccatcca 
cctctgctaa 
ccctcgaatt 
ttgcagccgt 
gatctgctcc 
actatttcaa 
cagaaactgt 
tatcatatgc 
accaactgat 
gggccatcaa 
ttaataactt 
ttttggacaa 
aaactacaaa 
tcgattacga 
ctgctgataa 
aatcagacag 
tcatcaatga 
ctagacaatc 
gagaacgtct 
tgaggaagag 
ctgttaatag 
cacaaatcgc 



ttctattttc 
agcgttcatc 
cgctcttaac 
cactgctggc 
gggaggtttg 
tcggctcaag 
cttctacaag 
aaatgccgta 
catgaccgat 
cgcagaagag 
gtacggaggt 
gctccaagat 
tatgaagtcg 
taagaacttc 
tattagcact 
cagtggttct 
tgctgttctt 
ctiacaacaaa 
agtcgttgcc 
aggagcatct 
tgaactgaaa 
ccaactgcac 
cgcccaaaat 
gactaacgac 
cgatgaagaa 
catcctcgca 
ttgctttgga 
tatggacaaa 
ctcggccatt 
ttccccatca 
tatcccctac 
atcatcctca 
caaaaaagtg 
ttctgttacc 
caacggaagg 
taatgccacc 
ttcctccacc 
taaggtacca 
aaacgcatca 
ggcagcttcc 
ccagactgcc 
attggaaaga 
ttctcttccc 
ctcttcatca 
ctatgcttac 
ccagaaaaac 
tccattcagc 
tgatgccatc 
ggaaacggac 
gcaacaaaac 
caaaaggaac 
gagccg tatc 
aaacacatct 
ctcggggaca 
taaatt tgtc 
gggacaaaat 
caacggtgct 
tcaggctgtc 
gt tggcagat 
tggtatggtt 
tgaaagcact 



aacactagga 
aaccatgaaa 
attcttaggg 
ctgttgacct 
tctctataca 
ggagcagtct 
aggagcaacg 
ggtcctattg 
atcatttcta 
ggcgctggcg 
attgataccg 
tcagactcgt 
aggaaacata 
ccatcctcct 
atcgtgatgg 
actttggcca 
tcaaatacaa 
ctaaaacaat 
gtaattgctg 
gttagtgaac 
aacattgaag 
gaaaaaatgt 
aacctcgaat 
caaggtggta 
aataacacca 
attgtagagg 
aaatatgcca 
ctcattgaaa 
tcttcttcct 
tcatcatctt 
atgcctatcg 
tcaccatcat 
gctctccaac 
accaagcaca 
caacacgcag 
gtttccgcag 
agcatgcaat 
gtaactgtca 
gaatttaagg 
accacagaaa 
atcactgcca 
gttatcaagg 
gttaatggag 
tcctcctcct 
ggaaaattga 
atggtcgaac 
gaaatgatga 
aaaacaaaga 
aaggagcttg 
ttttcaaacg 
gaattagagc 
gaggctggtg 
gaaaatgaca 
gtttacaccc 
gacatgtaca 
actgtaaagg 
tcagagcctg 
gagatgttta 
tcggctgctc 
aacatcaaga 
gcactaaact 



tggctcacac 
tgccctctga 
gagttaattg 
ttattacatc 
gaatgtccat 
ctctagaggt 
atctggtcga 
ctaggtttgt 
gaacagtcaa 
cagcaggaaa 
ttgtgaacct 
tcaatacacc 
aggctctcaa 
ctgaagcaat 
acatttccaa 
agatttccac 
aggccaatct 
tcagccatat 
aatctttaac 
tattgacaga 
aggcaaaacg 
tgagcctgct 
gtaagaaatt 
cattcataaa 
gttctatcaa 
acaatagaaa 
tgaacgacat 
aactaggcga 
catcatctaa 
cctctttctc 
tcttccaaaa 
catcttcttc 
aacttcaaac 
gatttgcgtc 
agacagttgt 
gccaaggaat 
tgcctccatc 
ggcttaacca 
acttgaagga 
caaaggaaat 
tctctcaagc 
cagtcgaggt 
attctatggt 
catcctcatc 
agaacattga 
aagatgctgc 
gacgtgctat 
tcgtgtcaga 
atatggacaa 
aatcagacaa 
ttcagaacaa 
gaggagatgt 
acaatctctt 
ccactgacat 
acaaacagat 
tagacttttc 
tatcttcttc 
tggcaataat 
agtggaatac 
gtgaaaggct 
ccatcaacga 



tgaagcagtc 
ttacacacag 
tagccaattg 
acgtaaattt 
tgttgatgct 
aggaaagtgg 
tttttgttca 
tcccaatgga 
ggatgacgct 
attcattaca 
aactgaaaaa 
aacagaaatg 
aacaccattc 
tgtagttaga 
gticaagggga 
atctgaattt 
gagaaccatt 
tagtaatgat 
ccccgtgtat 
caatactctc 
tcacgtgacc 
cgttgcctca 
gactgaagga 
gcacaaagaa 
ggacaatgat 
tgcaaccatc 
ttttgcactt 
agcactcgca 
cacaacatcc 
aatggactat 
caaacaatct 
ttcatctgcc 
acaagaatct 
tcataatcaa 
tgctatacca 
tcttacccgc 
atcatcatca 
gtacgccaac 
agcagaaagg 
ggtcaccgtg 
taaatccctt 
ttacacccca 
ttcttcttcc 
ctcttcctct 
tgaaaatact 
cgttcgcatc 
tgacaagttg 
cactaaacag 
agaacaggtg 
gataaagatg 
caaaactagg 
agcaaacttc 
ccagagcctg 
gagcaatgga 
tctcgaggga 
aaaggctttg 
agttgtggag 
gatggagcgc 
tgttaataat 
cacagagatt 
cgagattgta 



atgggtctcg 
cgcagagaga 
ccacgaaagg 
gcaggttatg 
ctttcttgcc 
caggatatgg 
aagaacaata 
actaacatgg 
tcaatgatca 
gcctcagcca 
ctatacgact 
gccactgcta 
gggggagata 
gccaaggaaa 
atcaactcgt 
gaaaggatac 
gagaatagac 
ggactttccg 
gcggatgaca 
ctcaaattta 
gccgcaattg 
gccgacatca 
aatagtaact 
acaggtatct 
cagcgtagag 
cgttctcgtc 
gatgatgccg 
gagaaggcat 
tcctcttctt 
tcaaacaacc 
aatgtcaatt 
aatattgata 
aacgatttga 
gctgcaactg 
aatgcaaaca 
ttctcagccc 
tcatcaaatg 
tcaatcttat 
aaaatcgatc 
tctaagtgcc 
aagaaaagtg 
gattcatcta 
tcgggatcag 
aatgtgacag 
gaagaagggg 
cctcttctag 
aacgaatact 
gcttcctcat 
atttcaaaga 
gctattagtg 
agctttattg 
aaggagatta 
aaagcattcg 
agagacacaa 
ggaatcaaac 
gaggctttcc 
aggagacagc 
accgagagtc 
gtagaagaaa 
aggaatcaag 
gagtctcctc 



17920 
17980 
18040 
18100 
18160 
18220 
18280 
18340 
18400 
18460 
18520 
18580 
18640 
18700 
18760 
18820 
18880 
18940 
19000 
19060 
19120 
19180 
19240 
19300 
19360 
19420 
19480 
19540 
19600 
19660 
19720 
19780 
19840 
19900 
19960 
220020 
220080 
220140 
220200 
220260 
220320 
220380 
220440 
220500 
220560 
220620 
220680 
220740 
220800 
220860 
220920 
220980 
221040 
221100 
221160 
221220 
221280 
221340 
221400 
221460 
221520 
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tcaccctctc 
gtatccaaca 
cagaacagag 
cctgcttcct 
agaacaaggg 
tgatgaccat 
aaatgctgta 
tgatgatcac 
acaaaagatc 
gtcctgccgt 
attttggatt 
gaggtacttt 
ttaactgccc 
agaatgtggc 
ctcccaacgc 
tcaccgacgc 
gtgatcaaaa 
agcagggcga 
agaacagtgg 
aattcatggt 
acaggctttc 
caaactctgc 
ataaggaatc 
aaaaatctgc 
agaatctggt 
ctgaatctac 
acgaaaacgt 
gctacaatga 
ggtcaaattg 
cctttggagc 
aagacagcac 
tggatgcttt 
tttctctccc 
cttcggacat 
tcctgtcaat 
gttcttcttc 
ttctggggac 
gaggaaaaat 
ctattctcca 
ttgagactag 
caggaggcgt 
tcatgtcacg 
tggaagctgt 
gtggaaagtc 
ctctcaccgc 
acaacacccg 
tagtaaaaca 
gcgcttctca 
acatcaaatt 
aattcttggg 
ccttccatct 
cattcgcccg 
agcttattag 
gatcttccct 
atggaacatt 
tcaacctcga 
gtcatcgcca 
ccgtcttcca 
caaggaacag 
ttttggaaca 
tcgctgaatc 



tttgggagca 
gcagcaacag 
aaaggaacaa 
gcaacctctt 
tcctctcagt 
cactgacttg 
cgaaaatgtt 
caacgtccac 
cctgaagcag 
gagacacgat 
cactcaccgc 
cagtggaaat 
aggagttgaa 
tgacgcaccc 
cgagcacctt 
catcatttct 
tctggtaaac 
aagaaggatt 
catcccttca 
tgctccaggc 
agacattaac 
atggagaatt 
ggctatgaaa 
cagtcacatt 
caaggaagct 
gttcgccact 
tacaaccgtt 
cgctttcaac 
tgaaaatcta 
ctttgaagaa 
caaatattcg 
gtccgctgtt 
tacttctatt 
gaaatctaac 
gggagacagc 
catccctgtc 
tattctcgac 
ggtcgccggg 
gatgaacgaa 
gcaactgagg 
gggagccatt 
tgtcgatgca 
ggatagtttc 
ccccgttccc 
ccgcgctcta 
cgacactcaa 
agctaaaaga 
cctcatgctt 
gcacatgact 
aagatatggg 
ccaccacact 
cccagaagtc 
tgacaaacat 
ccttaagaag 
tgaaccatca 
cgttattgtc 
gcacgtgtgc 
ggcaaccgat 
atacatccaa 
acttatcgcc 
tgtaccagaa 



cgagtcgacc 
caacagcagc 
caatacgctg 
cacgagacaa 
aacacacgtg 
tctaggtctg 
ccctcatcaa 
gaagccactc 
ctgacagaaa 
gtactaacaa 
caaaaggttg 
gttgcaccta 
aatgacaaat 
aagtcatctg 
tacttcccct 
ggtatgagcg 
attgatcaaa 
gtgcacactg 
tgtacaaagg 
tctcttctga 
aacgtgagac 
ggtgaggtgg 
aagatgcttc 
aacgatccta 
ttcccagacc 
caactcgcct 
tcaaatatct 
aacatctttg 
ctgaaatctg 
gcttcatctt 
tccaaccaac 
tctactgttg 
acttctatcg 
ctcatcaaga 
caagtagctg 
accaccagcc 
aagactgtgg 
agtcctgaag 
aatgaactcg 
gatgctttcc 
tcttctggag 
tccagcggtc 
aatactactc 
atggccaagg 
ctgagcgaat 
ccagaaaggg 
ttcgacggag 
tcttccatgg 
gatactcaat 
gacgaatacg 
tttactccct 
tccatggaag 
.tacgtgatga 
gttagcgaat 
agactcgccc 
aaaccaaata 
accgccgacg 
ggaaacggta 
aagagcacaa 
gatcttggaa 
agcgtatatg 



agctcttgat 
aacagcagca 
cagatagggt 
ttaaacgtat 
gtgttcccac 
tactcgactc 
ttgttcctgg 
atacttctcc 
tgttgaacgc 
tgttagagtc 
cgtgtatcac 
atactgttat 
ttagatccat 
caagcatcgt 
tcaaggacca 
gcgaatcttc 
ctactggctt 
aaaacactat 
accgtcaaac 
atgctaacaa 
attatggcac 
tgagagccgc 
ttctaggatc 
ctgctttgtt 
ctgtttgttc 
accgccagcg 
gccetatgga 
gctctaaaat 
ccatgtctaa 
ctgtcaggaa 
ttgctgtaca 
tcggtcgcca 
caaccagtgg 
caatttcccg 
caggttcttc 
aggatggaaa 
agatcaacaa 
ctcgtgccat 
ctcgtgactt 
aggatttgaa 
caagtaccaa 
ttctgatgaa 
ctttgctcgt 
aaattaggag 
cttcccctct 
cagtcgacag 
ttgacccagc 
attcccatac 
gcttcttcaa 
ccatgtccca 
cagataacga 
aaattagagc 
acatgtccaa 
ggactgaaat 
tctccaactc 
atgcaagaag 
ccaagggaac 
acgaatctga 
tgaacgctca 
aggttatcgt 
aaaacaccaa 



caaggtagat 
gcttcccaaa 
tgtttacgat 
ttcttctgtc 
tagcgatgcc 
ttcttccact 
actctgccag 
tcattcattc 
tgccacttca 
caacaatggt 
ccctgttaat 
ccttcctact 
ggttaacagg 
ggagactctt 
gaggcgacac 
atctcaattg 
cccagtgttt 
ggaaggagct 
ttatatcgat 
ggaagaaact 
tgatgttcat 
ctcctcattc 
tgtatctgcc 
gagcaccaac 
ctctaattac 
cctgttccct 
tttgatggga 
gacatctact 
cgttcctgct 
taggctttct 
ggccatgacc 
gaatggcaga 
ccgtccatca 
catcaataga 
cttctttaac 
tgttgcagca 
gagattcgag 
ccagcgcaat 
gtgcgaaatt 
gaggtctatg 
caatgttccc 
caacaacagt 
taggcacatg 
catgctgacc 
tctcactgaa 
actactaact 
cttccctgcc 
aaagtcatct 
gaacattgaa 
caagcaaaat 
gcatctggta 
cacaccctat 
gatcgattct 
gagaatgaac 
tggcatgaca 
tgtactagga 
tgtcgcttca 
actgatccag 
aactgtcgtg 
gaacgaactg 
ggaaatgatt 



agagtaggaa 
ttgacagcta 
ccttcataca 
tataattcaa 
gacttgcaac 
tcctccaaga 
caatgcgcaa 
aatttcgaga 
tccagtgacg 
tacgtcaaag 
acacttctgg 
tctgagttgt 
acaaccgaca 
gctcgcacgt 
ttcaactcca 
aacactactt 
acaggaagaa 
cgcaaggaca 
atgggcacaa 
ctccgtctaa 
gtggcaggcg 
cctgacggag 
atctctgctc 
acgtctatcc 
ttggggtctg 
aacggagatg 
agtacaaagc 
aacaaaaagg 
atcaacactg 
cccctttatg 
gatactgctg 
aatactcttc 
ctctcttatt 
gacgctagcc 
tctttccttc 
gcggaaattg 
atgcttggag 
acaatgtcct 
gaaaataaaa 
ctgatgactc 
ctttctcttc 
gccaatgtaa 
atgttggata 
caaccaagag 
atctgcctct 
tcagcctatc 
gccctcacct 
ttcatggaca 
cgatttgaga 
tgtaactgcc 
tcctctttcg 
caggccaaca 
agagtaacag 
tccaacttta 
acggcaggag 
atattggaat 
gccatgccag 
aatgctctgc 
tttgctaatg 
gccggcacca 
gatagactag 



221580 
221640 
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221760 
221820 
221880 
221940 
222000 
222060 
222120 
222180 
222240 
222300 
222360 
222420 
222480 
222540 
222600 
222660 
222720 
222780 
222840 
222900 
222960 
223020 
223080 
223140 
223200 
223260 
223320 
223380 
223440 
223500 
223560 
223620 
223680 
223740 
223800 
223860 
223920 
223980 
224040 
224100 
224160 
224220 
224280 
224340 
224400 
224460 
224520 
224580 
224640 
224700 
224760 
224820 
224880 
224940 
225000 
225060 
225120 
225180 



WO 01/38351 



63 



PCT/US00/28888 



gctctgacga 
atagcgaaac 
tctatcacac 
gcgccagcgg 
agaacgccga 
ctactcttcg 
ctgaattcaa 
gtccaccaaa 
ttgtttcctt 
gcgcgttaca 
aaaacttttt 
ccattaattg 
accctattct 
tggaccccga 
acactgcttc 
tgggcttgta 
tcacagacac 
tagtgaaccc 
atgtcttgaa 
gcctcgtcac 
agtacgtcag 
gttctgatgc 
acatgaacgt 
ggctgaccaa 
gcaacaacaa 
ttatcaccaa 
catcatcatt 
ccttctttat 
agggaatttc 
ttggcacggg 
tcatgaaggg 
aactcactca 
ttattaacaa 
acgaggctgc 
ctgctctggc 
ccgcgccagt 
cagctatccg 
gaatcgtaca 
acaacactga 
ttatgcaatc 
aactgacgga 
aagctatcct 
tttaccttac 
cgtctctgac 
ccctgaaccg 
tgtaacaatt 
aagactttgt 
ggttgattgt 
ctcctagtgc 
gcatgaaagc 
tggtcatatc 
aacccaaaag 
agaaaagagt 
atttttgtcc 
aggtacctgc 
atgcaggtat 
aactacatac 
tgaaattgca 
gagaggatct 
accttttcta 
cccgtctgaa 



cctcttcaaa 
aacatccaac 
actaatttcc 
ccctctcgcc 
acaaggtgca 
taagcatttg 
ggatatcctc 
tactaaccaa 
cttaaccatt 
cgagaaagta 
ccaatacggt 
tagcaacaat 
tattatggcc 
tactgccatt 
aatggcgaac 
tctttcttcc 
cacagaaaac 
cttcaaggat 
tgataagtcg 
caatactgga 
gagagacaat 
ttcctctaat 
gagatttggt 
ggaagaggtt 
caacggaaat 
tgccacattc 
caaatgggga 
tggtaaccca 
cctcactgaa 
ggctctgcgt 
atggaacaac 
tcttttgaga 
gatgcacgac 
tctgctcgtt 
catggtttct 
tagaggtatt 
caagcgcctc 
caaatcagcc 
tcttcaagac 
tttgggactt 
ccttattact 
taaaatgctc 
aacaggtaaa 
agtcggagga 
cctttatttt 
ggtttatatt 
attatttacc 
agcatttcct 
tgatacaccc 
gaccatgctc 
agacatgatt 
tataaatttt 
cagtcttgct 
agaaatggat 
cgatetactg 
agaaatcgag 
aataaatcca 
atctgcacaa 
tgaaaggaga 
tcttgatttc 
aatcattgat 



tctaataata 
aatggtcccg 
ggcaaggcag 
tttgatttcc 
gcagctgccg 
gctcgagttt 
aacgatatcg 
aatgcctttg 
cttcgtaaga 
gccatctatt 
ctcagtaatt 
gtcaagaata 
ctcccagaaa 
gaaagcagac 
ggtgcaagag 
cacattcttc 
gtcaactatc 
tctgctaggt 
tgcaactact 
tgctcttctt 
acgcctgtga 
ctcttggacg 
ttcatgcccg 
attaagcaaa 
gggaagaaaa 
cccgacactc 
gatatcaacg 
accgccgccg 
gaaaaacgca 
gcagccgcca 
attgttcagc 
tcgggaggaa 
agcttcaaga 
gccaccagcg 
ccagtacgtc 
agccacttgt 
aacctaccta 
aagacacttt 
gttgtctcta 
aaggggaata 
gtaccagaat 
atttctaacc 
aatgctctcg 
agtggaggag 
cgggtctaga 
tgtatgtgat 
aaatttattg 
ccccctggag 
aaacctagga 
aatgcgatga 
tctggtatct 
attctatact 
ccagccacca 
aaagtttgtg 
tgcgttgcaa 
acaagagtgg 
aatgatgaag 
ggaaaaaaac 
ttaatcagtg 
tccctttacc 
gaagacacgt 



atggaggagt 
tcctcatctc 
ctcgcccgga 
ttctgtcaaa 
tatcctctac 
tcgaagccat 
aacgtaatat 
ctctagctat 
acattacacc 
tgacccttct 
cgtcctcagt 
ttgaagacac 
atttcacagc 
gctcccttac 
ccgctgtggg 
acgagtctac 
attcctctgt 
tgatcgttaa 
tgcaagtatc 
cttcttcctc 
atcttccccg 
tgttctccag 
agattattgc 
tggtttctca 
caaccgtcga 
gtcctctata 
acagaaaaat 
caacagctaa 
agaaaatcgc 
acacccgcct 
ttcaacaaac 
ttccacccag 
cattggagga 
atcttttgac 
cagaaatgac 
tgaaactggg 
catccaacgg 
tgcttgattc 
acgctaggga 
atgcagggga 
atggaaacaa 
ctgaaattct 
caccggtatc 
gaaaactgtc 
ctagtgtaat 
tttttcggac 
atttttttta 
a tttcatagt 
ggaatccggt 
cgacaacggc 
gtttataggg 
gctcatctca 
ccgaggcagg 
ttatatcaaa 
cagaaccgga 
tggtgttttc 
gattttctgt 
ccattggttt 
gaggaactgc 
ctaattattc 
acaatggtgt 



agaatcaatg 
agaagccatg 
aaatgtacca 
gggagataca 
ctattcttcc 
ctctaagcaa 
ttcttctgac 
caagagagaa 
cgcattagtc 
ttcaaccaaa 
tgatcttagc 
attcatgtac 
tctcttgcaa 
caccttcctt 
tgcaggagga 
cgtcacaaca 
tacacaagat 
caacaacaat 
catgccatct 
atcttcgtct 
tgtcacacca 
ggcagatatt 
tgccgtctcc 
gaacaacatc 
tccagtcact 
cactgcagca 
gcacgccaag 
cggagtgcct 
aggcatctct 
ctcatccgac 
attcaagaaa 
aagccaagaa 
atgtcgtagg 
cggtggctac 
tggtcttatt 
aggtgtttct 
gaaaacacta 
agactctatt 
taacaacaat 
tttggtttat 
tcgcgatctt 
agaaaatgtt 
tgctcaggaa 
atcagacgac 
attgtatgtt 
aaataaaaga 
aacacctttt 
gggaca tcca 
aagtgcagga 
caagccgacg 
tttcacccag 
ctcttccgac 
agctcctcgg 
tactagggag 
aatctctacc 
aaggtgcgtc 
ccaacttttc 
gtacatccaa 
atacctggat 
aatattcaat 
tgttttctct 



gattatgaag 
aagaatgccg 
ttcgcctcat 
ttcgaagaaa 
tcttctaaca 
gtaactgatg 
tatactaact 
ttcagcagaa 
gaccctaaag 
tcaaaactag 
catctaaaac 
agaaatgtcc 
caggaacaaa 
aatcatccca 
ggaaacccaa 
tcaaaccccg 
cctgttatgg 
actggaattg 
gaatcatctg 
gataccttca 
gccgttctct 
gtcctcgaaa 
aaattcaagg 
aacaacaaca 
ggggatattg 
aatggaggaa 
gctttcccca 
cttacatctg 
gaaggatcaa 
atggaacctg 
gcttcagata 
acaaacgcta 
gtgatccaag 
ggaggagatg 
ggtgcaatct 
gctgctaacg 
ccagaacatg 
agcaacctat 
ttgggaagaa 
tctgctagac 
accaagcgtc 
gcagatacca 
atggcttgtg 
aatgttcaat 
taatttattg 
attggaataa 
tatgccacgg 
aagagtgtaa 
gtaacagaaa 
atgatgtttc 
aagtgtttta 
ctcacaaagg 
acgctcctgt 
cgcacgttta 
aaggaagaag 
tcggtccagg 
aaggactacc 
ataaaggctg 
ccggcaacac 
gacatttcat 
aacagtgaag 



225240 
225300 
225360 
225420 
225480 
225540 
225600 
225660 
225720 
225780 
225840 
225900 
225960 
226020 
226080 
226140 
226200 
226260 
226320 
226380 
226440 
226500 
226560 
226620 
226680 
226740 
226800 
226860 
226920 
226980 
227040 
227100 
227160 
227220 
227280 
227340 
227400 
227460 
227520 
227580 
227640 
227700 
227760 
227820 
227880 
227940 
228000 
228060 
228120 
228180 
228240 
228300 
228360 
228420 
228480 
228540 
228600 
228660 
228720 
228780 
228840 
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aaaaagagaa 
ttgaagcagc 
atttcgggta 
gtcaatacgg 
tgaagcaacg 
ataaaaatgt 
agtgtgaaaa 
gggatataga 
aaatgcctga 
ctagaatact 
ttttgtctga 
tacacacgtt 
atgatgcttt 
aagccgcttg 
ggaataaagt 
aatgtagaca 
ataaaaaagt 
ttttccatag 
cttccccaat 
aaacagatgg 
ttcaaaggat 
gtagggcgga 
accgtattct 
atatcatcag 
tacaagaaat 
ccgagtacaa 
ctgtcaacaa 
aattagaaga 
acaactgggg 
tcaatactgg 
atgactactt 
ggagattggt 
ttaccaaaca 
gcaaacgcca 
gtaaaacgta 
ttggtagcag 
ccaccaggaa 
gcagataccg 
gggggtgatg 
tatgaagaaa 
actttttcaa 
ttagtagaga 
ttcggaggaa 
gatgaagttg 
cctgcaactg 
ctgtacaacg 
aaatgcaact 
gacccatgtt 
gcaataaaca 
gaacaatagt 
aggaactaat 
tcttcacgga 
cagcacttga 
atacaaattc 
aagctccttt 
gagctgaaat 
acaaggggaa 
aactcttgtt 
acaggagcag 
atgatcatgc 
agcagttgtt 



ggatgcacta 
cataaaaaaa 
cttacgtata 
gatagtaaat 
agtgcgatgt 
ggccgaagaa 
tttatgtttt 
aagagaaact 
aaatatgccc 
tttatgtttt 
gaatagatca 
attaaaaaaa 
aataatttta 
tgcgcatgtt 
agaacatgta 
ctggatttat 
tactgctagt 
aaaaatggaa 
aaattttgcc 
tacattttct 
ggatcttttt 
tttttatgca 
tcactacaaa 
tgatacaaga 
gtacgaagaa 
aagaaatgag 
accatataat 
caagttgagg 
caggtgtgca 
tatgtgtttt 
caacagaaat 
gcagtaatat 
gtctcccgtt 
cagagcattt 
caatcggatt 
atgtatactc 
agtcgggaac 
gaagatatat 
atgacgatga 
acgaccccag 
ttttggtcta 
cagatggacc 
gttctatgtt 
atggagttcc 
gccagaactt 
tcacgtgctt 
cattctttct 
tgcgggcaag 
ttgcctgttt 
cactcatgcc 
cagtgtgtta 
tttggtgcta 
tggaacttct 
cattgttaag 
tactgtaagg 
gtacgctgct 
acttgttgta 
gaccaataaa 
ccatacgggt 
agctcacttg 
agacgatgga 



gtgctgataa 
ataatgctaa 
ccaaaatcta 
gttgaaaaaa 
tccagatggt 
agagttggac 
agggacatag 
gaaagcagat 
tattttggac 
gtggatgaaa 
ctaagaaatg 
ggtgtgtcta 
aagagaatac 
aaccttgaac 
aagtcgcggg 
tttgataaag 
atggcatcta 
ttggaagatt 
aagttaaaat 
gttataagat 
tgtaggttta 
tgtaaaagag 
ttctccaacg 
ggacgtaagg 
gatggaagaa 
gacaaaacct 
attaaccatt 
agtggtttca 
actggaaagg 
ttgagaaggt 
atcaaagaac 
caagacggta 
tacgagaagt 
tatcaccgtc 
caccctcgac 
ttggcaagaa 
atttgtacgc 
gaaaaagaag 
cgacgatgat 
aaatgtattc 
tagagcaatg 
ctcatctcac 
gaacggttat 
tgaaagagag 
tgtatctgtt 
ctccaagcag 
catcatctaa 
tctacctacc 
gtcaaacact 
ggagaagtga 
ccagaagttg 
gtcaatagac 
gaccagcaaa 
ctagcatcta 
tttggtaatg 
gcctgccggg 
cttgaaggag 
aactcgcctc 
aaactcatca 
ttattttctg 
atccatgttg 



gggtgacttt 
ggaaagtgtt 
aactggacaa 
atatccctgg 
acctttacaa 
ctcgccagtt 
acctcagaaa 
atgtggtcgt 
caaagtgttc 
tttcttataa 
tttctattag 
ttcatgctag 
caaaaactta 
agtacgaatg 
tagtgactcg 
acactttaaa 
aaatatgtaa 
tgactgagag 
cttgcccgga 
tctttagaaa 
ttcccgactc 
gagaatctat 
ctccccatgc 
gtatacacat 
gatatgaagc 
tcaaatctct 
tgtatgagca 
tttcttatga 
gggcgtgcat 
gtcagaatgt 
ctttcagatg 
ttttcaccct 
cccaatacaa 
tttgcaaagg 
aacacaaagg 
gagaaaatgg 
tactctaatg 
tacgatgatc 
gatgatgacg 
gaggtcgaaa 
aaaaagtctc 
cctaaacggg 
ggtgcaggtg 
aggattacaa 
aaactggaac 
cgtggagtat 
ccatggctgg 
tgaaaaacat 
ccctagaaac 
ttgaaggttg 
tggatgtgat 
tggctgacaa 
tgttcagtcc 
aatgtatgat 
acaatgataa 
actgttacaa 
gtgacaggtg 
tttatggagg 
atgattattt 
caaatagatg 
tgatggatag 



ttctacgcat 
tttcaaggat 
atttactccc 
ttacatatgg 
caccgactcg 
agtgaaaaaa 
aaaggaagca 
aacactaacc 
agtggtgagg 
tgatgaaga t 
acataaggaa 
atttaccctt 
ttttgaagat 
gctttgttct 
agcagttaag 
tttaaactac 
tgcaaaacac 
cgcatatttc 
tgttaaatat 
catgacaaag 
acacactat t 
gcatatgtgc 
ggccatcgaa 
agaatacgcg 
taaatacact 
tcttgctcct 
atatggaaat 
cacgtatgtt 
ctaggaccca 
gcacattaaa 
tgttatccat 
tcacagaagg 
catgtggttc 
acaaatatga 
agttgacgcc 
tgtttgaagg 
aagataaaag 
cagaaaataa 
acaacaacaa 
aggatgaaaa 
ctcctgtatg 
ccccgtcagc 
cagatgcact 
attttgctct 
atgatggatc 
aaaaggcggg 
tcgtgtagag 
ataccaacaa 
taggtacggt 
tactacagtt 
tctcattgac 
ggggaaaagg 
tattcataag 
ttgtaaaatt 
taatgttata 
aaaaattaac 
cggtaagagt 
agaatatatg 
aactaagaaa 
ggaagtttgt 
atattactac 



gaaaaggcaa 
ggagatcttg 
tattttcgga 
ggagaaatta 
gaatgggaat 
tatggtgcca 
aaggaaaaaa 
cataagcatg 
ttggatgaaa 
gtagacgaaa 
aatgtacctg 
aatggtttgg 
gaggaactac 
aataatagag 
cgtaggagaa 
aaatactttg 
gactgtttag 
aaggtagaac 
gtgcagaaaa 
ggtgatctta 
acacttttga 
acaaacaaac 
caaataacca 
atcgaaaatg 
ggaactttaa 
catttaacac 
tttgatgaag 
actgcaaaag 
taactcaaag 
aacatacaaa 
caaggcaact 
caagggaaat 
aagagaggcg 
gcggaaaaga 
caacagatac 
attttgtgtc 
ttttctacta 
gaccagtagt 
tgttgacgtg 
atatgcctgt 
tagagggtta 
atttaatcca 
agaagaagag 
caagagagga 
taaagcagac 
acgcatcagc 
ctcgtcactg 
gaaaatggag 
tgtggaactg 
tcttctatca 
gaagggcaat 
attgtgattg 
ctattgcctt 
gataccaaag 
tgtgtaggag 
aagaaaaaga 
acccaagcca 
tgctttcccg 
attgaactag 
agtaaaatta 
tcggggattg 



228900 
228960 
229020 
229080 
229140 
229200 
229260 
229320 
229380 
229440 
229500 
229560 
229620 
229680 
229740 
229800 
229860 
229920 
229980 
230040 
230100 
230160 
230220 
230280 
230340 
230400 
230460 
230520 
230580 
230640 
230700 
230760 
230820 
230880 
230940 
231000 
231060 
231120 
231180 
231240 
231300 
231360 
231420 
231480 
231540 
231600 
231660 
231720 
231780 
231840 
231900 
231960 
232020 
232080 
232140 
232200 
232260 
232320 
232380 
232440 
232500 
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ttttctcttt agctagagga gtggataccg ttgagtggtg ctctgctagc gatgagggac 232560 
ttcctcagcc cgatcttgta ttgttgatgc ttttagatgt tgaaaagtgt tcaaataggg 232620 
atacttttgg tgtcgaaaga tttgagacaa attccattca agaacgtgct agagccctat 232680 
tcctagacct cgcaaataag gacgaaaaga atgtatggat taaggtagac gctcgcggca 232740 
ccattgagga ggtgcaaact aaaattataa atattgtata taatattgtt gaagaataaa 232800 
gaattgtaaa ctgtttccat gatgtgtttc tttcctagtg attttcacat gttacctaga 232860 
aagactttgc ccgacactga aaatggttat tttgtcttgg acgagtctct tctggagaag 232920 
gtgtactatg ataacaacaa tgaactgatt gtaagagttg gtgggattta tatgcagata 23298 0 

tgcaagtcaa aatacatctt ccatcacgat gatccagaga ggttctttta tagtgtgttg 23304 0 

gaggattatc accccatcaa agagattgtt gaacgactag cagaagagga tggggtattt 233100 
ttaggaccgt gggagttttt atcgcgcaaa caagtgaacc tccaacacgg gtgctacaaa 233160 

gctcttttgt cattgccaga ggacaaatat tgtaacctat tattacccca gcaaatgaaa 233220 

accaacctgg aaaaaatgga agaaatacag cgtactagac tcattcactc tagaacgtac 233280 

aatacacccc agatagaatt gtctgaccag ctagatggat gtgttatatg ttaacacttt 23334 0 

ttcagtacca tgtaatattt ttgatatcaa attgaaagga aaatggttca gaaattggta 2334 00 

gtaaaaactt tattgtttct ttcctctttc ttttgtctgt cactttattc gtcagacata 2334 60 

acaaccataa tactgagtat gggtcttgga tctttgttgg gacattttgg gtgtgttaaa 233520 

agtatcacag gaaaaatcgt ctcaagaaag gcaatggata attataggct cgtgtcggtc 233580 

ctcggctctg tatgtggagc aggaattttc tgtggatacc agcaatattc tacacctcag 233640 

tctgcaccca tggcgttgct gtctgtcttc tgctgcactg ctactgctgc tacatcaact 233700 

attgtaggat tctggatatc gatgattatc caggcattga aaaggagtaa taaagatata 233760 

atataaaatt tgtgtgtttt atttgtttga taatacaatt ttcaccttgt accgaaaccc 233820 

attctctaca agcactgtgt ctttggacac ggggatcaat atcttggatt ttggatgaat 233880 

gttcctgacc ctcatatctt cttcattttc aaatacgagg agggatgttt tctcagtatt 23394 0 

cctgttgtgt atttggaggt gaacagggaa ggagaatagt tgattgttct ttggtgttag 234000 

atggaactcg tataccatga attgtactcc tccaatatct tcaagataga tgcatgatgt 234060 

atttttgtca taccatctcg tatggtcgtc agtgctcagg tacaatctct tgtggtggtg 234120 

gttacacatc tttccactgg tgggagagga agtattatgg ttcaatattg atggaggatg 234180 

gatttttata ccctcatgcg aatgcgatgt ggaaaatttt atatgtttcc gacaagggga 23424 0 

aaaatatgaa taaactgagc atacacatgt cgtatccaga tatctaggtc gggaagaatc 234300 

cagaaacgat tcaaaccatt tctggaacat tcatttctgg aaagggttac attttattgt 234360 

atctggtgca ttatttctgg taccattttt ggcacctctg tgtaatctga cattgggtcg 234420 

acccagaggt ccaccctccg aacttgacat taggccgacc cagcggtcca ccctctaaac 234480 

tcgagtgagc tgaaaaaatt tttataaaat ttttgatgag gaaaaataga agtaagatgc 23454 0 

ttgtcaatgc atgctccctg tggaaggtgg tggatgagat aggagggaga ggctatccag 234600 

atatctaggt cgggaagaat ccagaaacga ttctagccat ttctggaaca atcatttctg 234 660 

gaaagggtta caatttattg tatctggtgc attatttctg gtaccatttc tggcacctct 234720 

gtgtaatctg acattgggtc gacccagagg tccaccctcc gaacttgaca ttaggccgac 23478 0 

ccagcggtcc accctctaaa ctcgagtgag cagaaaaatt tttaaaaatt ttttgagatg 23484 0 

gagatggagt gaaacttcct aggtgtaagg tgcactacag aggcttgtga tcctttctgg 234 900 

gcggctcact ctctagaaac gtctgctcca gaaacacata aaaaagataa agtacatttc 234 960 

tggaccactc ccatttctgg tgtaatctga cattgggtcg acccagcggt ccaccctcgg 23502 0 

aacttgacat cgggcctacc cagcggtcca ccctctaaac tcgagtgagc tgaaaaaatt 235080 

tttataaaat ttttgatgag gaaaaacact accagtggag cgaccgtaca cactcctcta 235140 

gctaactact aaccgagaga gtgaggcaga gtctgtccag atatctaggt cgggaagaat 235200 

ccagaaacga ttcaaaccat ttctggaaca atcatttctg gaaagggtta caatttattg 235260 

tatctggtgc attatttctg gtaccatttc tggcacctct gtgtaatctg acattggcct 235320 

gacccagcgg tccaccctcc gaacttgaca tcaggcctag ccagcggtcc accctctaaa 235380 

ctcgagtgac gcagaaaaat tttttaaaaa tttttgatga gggaatatag tactccgtag 2354 4 0 

ccaacatata cacatgaaca catgaggcgg tctacaacag aaagagaact gatagctgtt 235500 

ccagatatct gggtcggcca gaacccagaa acgtttcaac tcatttctgg acaagccatt 235560 

tctggaaagg ggtacaattt cttataactg gtatatcatt tctggtataa tttctggcac 235620 

cttcgtgcaa tctgacattg ggtcgaccca gcgatccacc ctccgaactt gacatcgggc 235680 

ctacctagcg gtccaccctc taaactcgag tgacgcagaa aaatttttta aaaatttttg 23574 0 

atgagggaat atagtactcc gtagccaaca tatacacatg aacacatgag gcggtctaca 235800 

acagaaagag aactgatagc tgttccagat atctgggtcg gccagaaccc agaaacgttt 235860 

caactcattt ctggacaagc catttctgga aaggggtaca atttcttata actggtatat 235920 

catttctggt ataatttctg gcaccttcgt gcaatctgac attgggtcga cccagcgatc 235980 

caccctccga acttgacatc gggcctacct agcggtccac cctctaaact cgagtgagct 23604 0 

gaaaaaattt ttgaaaaatt tttgatgaag gaatatagta gtatagtgcc cgccaaagca 236100 

tacacacact cgcccatgct cgtctagctg atagctgttc cagatatctg ggtcggccag 236160 



WO 01/38351 



66 



PCT7US00/28888 



aacccagaaa 
cttataactg 
ggtcgaccca 
taaactcgag 
ttcttgacga 
gcgtcataca 
acgtccagag 
tttttgtggg 
acaaaatctg 
caggcccagg 
agcttgtcgt 
agacgtacac 
actatgtctt 
aaaaatttcc 
ctctgtggaa 
ataacaacaa 
atgcttcagg 
tttgcaaaaa 
gagggagttt 
aagaaatgga 
aactagactc 
aagggtacat 
tcgtttgtag 
aagcgtttgc 
atcactacag 
aatgtgttgg 
gacatttcat 
ttaaatgttt 
ggaaaatggg 
ttgacacata 
ctaaacacaa 
ttgaagaggc 
aggccataaa 
tgaagaaaaa 
gacgactgaa 
acaaagagaa 
aggcgaaaga 
ttttgtttag 
tagccttcaa 
aaaaaggcag 
ctcaatccac 
taataatatt 
tgtcagtatt 
gtcaagagtg 
aaggagagaa 
ataatgattc 
atttagaaaa 
caatgatact 
gaagtttcag 
ctgtatttga 
atgaaattgt 
tgtacaactt 
tgtatagttt 
ctagggatca 
tattgtatgt 
tcttcatcaa 
gtaacacatc 
ctgcaaagta 
agaattggcc 
tcataa tgat 
ggatattggg 



cgtttcaact 
gtatattatt 
gcggtccacc 
tgacgcagaa 
taagaggagg 
ttatatttcc 
acaagtttta 
ttctgtgtgt 
tccatgagac 
accagttcca 
ggaacttctt 
caaacctata 
atcaaacccc 
cttgctattc 
gaagttgtac 
gaactggact 
aaggagtgtt 
caacacacat 
tcttcacttc 
ctctcatgtg 
tttcatacct 
cccgctggaa 
gtatggagca 
aataagaaat 
tttcaacaaa 
atgtttatat 
gcatacattt 
ccaaaccttt 
tataaaccaa 
tttagatttt 
acctgaaaac 
cataaagaaa 
gaaaaaggaa 
ggaagaagaa 
tgaagaatgt 
agtgagaatt 
gaggttgagg 
accgaaacga 
gttagcccta 
tgaaagaaga 
tgctttggct 
gttttaatat 
tggtggagga 
tgataaaaag 
tattggtata 
tacctcaata 
tatcctaaaa 
ggaaactcat 
tacatgcgta 
aacagagaaa 
agctcacacc 
tctaactctc 
tctattgaaa 
atttgttgat 
tttgctggaa 
ttcatatggc 
agcagatgtg 
atggaggaaa 
aatatctttg 
agattttcca 
gtggaatggg 



catttctgga 
tctggtataa 
ctccgaactt 
aaatttttga 
cagtaggtga 
aagaattttg 
gtctgataga 
gtataaaaga 
ctcagtagag 
acccaagaac 
ctcccgtcct 
ccgatttttg 
aatgaagatg 
cactgcactt 
aacaaaagaa 
cctgttccag 
cttatgagtg 
tacttgtttg 
tttgcatcta 
ctcagtaaca 
gccaacggaa 
atagcaatta 
tcttgggcaa 
gatgcaaaag 
aatgtgacga 
gatattgaag 
tgcttgtcta 
gatgacacaa 
acgactaacc 
atttgctcat 
aaaaaggtgg 
aaggaagaag 
gaagaactag 
gaactctcaa 
gtcaagctga 
aatggtgtat 
aaaattttat 
gtagaagaac 
gaaaagaaaa 
gatggagaat 
agtgtgtgtg 
cattaatatt 
gattggacaa 
agagagcaag 
cattcgtatg 
gaggaactag 
ctcaacaaag 
aaaattttaa 
cgctttgctg 
gaagcgttca 
aatgaccaaa 
caccctttca 
ggtaatggta 
actttagtgt 
tcaataaaaa 
atcttgccag 
cagattttat 
gaatctccag 
tcggccaact 
gtggtgtcac 
ttggagaata 



caagtcattt 
tttcaggcac 
gacatcgggc 
aaaatttttg 
ggctgcttgt 
ctgacgtcaa 
tttcttaaaa 
taggtgcaaa 
aacgcaccat 
ttctttctac 
acagtagtga 
ggaacaagag 
tgagttacca 
ataagacgaa 
aattcaaact 
ctatcccgtt 
aaataatgtc 
atatgttaaa 
ggaagaattc 
tagcgaaatt 
aaataccatg 
tggaagacaa 
acacatacgg 
attgtctgga 
aggaagaatt 
acgagaaacg 
ataagtgttc 
tttttagaaa 
ataaggaaat 
ataacgtcaa 
aagaagaact 
aactagcaaa 
caaaaaggac 
aatataataa 
gagatatttc 
tactaaaaga 
tgctagaaga 
gtatattcct 
cggaggacat 
atactataac 
tgttaaacga 
ttaccacaat 
caacattccc 
actactcatt 
aacacacgtc 
aagtactgaa 
gagaaaaaat 
tgaaagggat 
taaataagaa 
attctataca 
ttaaaataat 
atgatggtaa 
tcgtacctca 
attttagaga 
ataagtaaaa 
ttttctgtat 
ccccttcccc 
taatggggtt 
tgtacaaaag 
aaatggggga 
acaccaggtc 



ctggaaaggg 
cttcgtgcaa 
ctacccagcg 
atgaagagat 
ttgatgtgtc 
tggaccataa 
aaaagagggt 
ggtagagaat 
ggttgcttca 
aaactttctt 
tgtagtttat 
tatagtttct 
aatggtttct 
tgaagaagat 
cctcaactca 
tgacagggag 
cacgtcaact 
tatggaacgt 
ttttactaac 
catatgcaat 
ccctgataaa 
ttaccctgca 
ggatcataat 
aattatagag 
tgttaaagag 
ttgttacaaa 
taaagctaac 
atgtccccca 
ggatttgttc 
attagacaaa 
agcaaaaagg 
aaggacagca 
agcagaaatt 
aataattgaa 
aactgcagcc 
ttccgatcag 
agaaacaaaa 
aactaaagat 
aattgcgaag 
ttctcatatt 
ataaataaaa 
gtctacttgt 
attcgacctc 
tttcattact 
aaagattatt 
tatatacaaa 
tatactgatg 
tcttcccaag 
caatgaacgg 
aaatctagta 
aaaagcgtgc 
tggaagaaca 
tttttcaccc 
acatggagat 
ttcattttga 
tgttatctca 
atccctcatg 
agttacattc 
ctccttggtc 
tttatcaaac 
gccgaggagg 



ggtacaattt 
tctgacattg 
gtccaccctc 
tgagtaaaat 
agccacatct 
aaggctttgt 
gggaggtttg 
catcatatgg 
actccgtgtc 
gaagctcaca 
tgtgactctg 
accattggag 
tccgtcttag 
aaaggtattc 
ttgttggttc 
aatatatgtg 
tttcagacaa 
ggcaaacaag 
tttgaaaatg 
gaaaaggaaa 
actaatgatg 
ttgctatatc 
gaatctctca 
tttataagtg 
aagactgtag 
ctcccatgtg 
tttagatgtg 
actatacaat 
aatcgtgcat 
aaatcaaaac 
acagcagaaa 
gaaattgaag 
gaagaggcca 
aagggaaaaa 
ataaacatgt 
gagttggctg 
cttgacagat 
gatgaaacgt 
aaaaacaacc 
gagaaactac 
gtataaaatg 
tcgaatttgt 
gtccatacac 
gaaacgtgta 
gacacgggta 
gctataaacc 
gatgtagaaa 
ggtaaaaatg 
cattactacc 
gattattata 
gcatatttca 
gctagattat 
ataacacacc 
ggacgacctt 
ggcatttatt 
gcaataatgc 
gtatttcctc 
aatacagcaa 
atgacttcac 
ggcatggcgg 
gcaaacttcc 
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cgtgcttgta 
ggacagggaa 
aagaagacag 
tgtacaatgg 
tgaacatgct 
tgcttataag 
cgtgtgttaa 
ccacagataa 
ttgtaagggg 
aaaaactccc 
atcatttcct 
gtccacccag 
atctggcatc 
atgaaaaatt 
actaacaagc 
cctaaaatcc 
aaatactgta 
aaaaagggcc 
aaaagaaggg 
gataaatacg 
taccaataat 
agaaaagtac 
caaaagggat 
ataaatttta 
tttctatatt 
aggaacagaa 
tcctcatcat 
ggatcgctgt 
attgagatag 
tcttccatgg 
ttagtcgtgg 
acctgttgtt 
tagacttact 
tcttcttcaa 
ctcattaaaa 
gggaccttct 
tgatattgta 
gttttccgat 
tggcgaagga 
tgaagacaga 
agatccagtc 
tttaacactc 
ggaactcttt 
gggtatggca 
cgacacatat 
agcgtctcca 
acgccaaacc 
tcccacgccc 
agacattaat 
tgaaatgccc 
cttatctctt 
tactgccgct 
taacaagaga 
gtatgcgaat 
agaagtgttt 
cttatttttt 
tagacacccc 
catcaacgac 
tcctctgggc 
ttcgactaca 
gaatactaca 



aagctccttc 
tcgcctgagt 
tacactctct 
agactcgcaa 
gtccgcgttg 
tgagtccatt 
tttcrtcggcg 
gaatgtggtt 
tgaataagga 
tcttgttcca 
tatgtgcaat 
aaagggggcg 
cagaaacgga 
ggacatcaac 
atttcctgcc 
actatcgaga 
gaggaagaag 
ctggacgcat 
atatctgcac 
tggatagccg 
caactgaaga 
aacaaatatt 
tatgaagcaa 
ttttataata 
tttactgcct 
gagcggaccc 
cgttatcgtt 
ccttctttgg 
caacgatcaa 
tgtaagaagg 
tggccatttt 
ttatcgtgta 
cagcgttagt 
cgccgggtta 
ggagatggaa 
tatagcgttt 
gtgtctattt 
aagttgagaa 
gaagaagatg 
gaagcacatc 
gctgtagatt 
aaggaattgt 
gacttcattt 
aacaggagta 
agtaaatgtg 
atttcattgg 
tttttaacta 
gacacaaagc 
attagccaga 
cgccaacaaa 
ctaccttcca 
gccaatattc 
ctgaaaactg 
ggaaaaataa 
aaccatcttt 
atcaaaatga 
aaacaatttc 
aacaacataa 
tattacaccc 
aacaagagac 
ggacaccttc 



cacaggggcg 
acgtgggaca 
ccaacatcgg 
tcggtgaaca 
aaaaagtcct 
gctgctgctt 
gaggactgtt 
agacttgaat 
ctagactcct 
agaattggct 
ggataattat 
aatacacata 
tgcaagcagt 
acgcattggt 
taacctgcag 
gagtatatga 
aaaggactgt 
gggaagcagc 
tcgttttcaa 
aaagggaaac 
accagatgtc 
acagaagaag 
gtcaattttg 
aaaggtgttt 
cctcttgggg 
agccagaagc 
ggcagtgtcg 
tcccatccat 
aaagagaccc 
gccagcggcc 
ttttctgttt 
cggtttttgt 
tgtcgcagtt 
acgacaatct 
ctgaatttat 
taggacctga 
tgcagacaaa 
aattgggtcc 
gagatgaaaa 
gacgaaaaat 
tacccacgtg 
gcgatttaat 
ttgaagaacc 
aattcaccaa 
tagtattttt 
tactagatga 
gggcaaaaaa 
ttttggctat 
tggcattaac 
aaaaagtaaa 
cctcctcctc 
ttcttccagt 
atgatgggga 
gccctataag 
tgttccctgt 
aatcctttgc 
tcaacgggcc 
actcttccac 
aagagagccc 
tacaaaacgt 
agaagagtgg 



tattcgtctg 
gcactttgaa 
tgacaatatt 
tggcgtcggt 
gctcggcaat 
gctgcttgct 
cttatataca 
ctatttgaaa 
ctaccagaat 
agtgggcggc 
aatttgtaat 
acatttctgg 
atctgaaact 
gtggtggata 
ccaagcaaca 
agaaagagtg 
ctcagcaatc 
gataaaggaa 
cgcagcagac 
gtcagaaaaa 
atctctagtc 
tgccatactc 
ggtgtataca 
taattataat 
taagacataa 
atcatatccc 
tcatcatcat 
acattcatga 
atgcgagcca 
tgggctccgg 
gatgaagagg 
caggacacct 
ggtcatgttc 
tagatccacg 
aaccggagaa 
tccgtgcgag 
taatatacag 
atggattgat 
tgaagacggg 
gatgaagaaa 
gcgagaaaac 
agttgaatgt 
gtgggagatt 
ggaatcatta 
tgaagcagtc 
tatatattgt 
cccatcttta 
cgacgagtgc 
tgaaagggac 
caccttcctg 
ctcttcttcc 
gtacaggagt 
aaatgcatca 
gattatggta 
ctttgcctct 
aagtgcatct 
gtgcaaatgg 
gatgtggagt 
tcaaccctat 
gcagccatta 
gaaattccgt 



cttgtagttg 
gctcatcttt 
cttccctccc 
agggaggagc 
ggagaatatg 
gctgttaggc 
aattcgttaa 
ccatgacaag 
tcatgtgcgt 
tgcatgccag 
atttgaagtc 
gggtgtattt 
ctctctcact 
aaagggggca 
gcatattaca 
gagaatctag 
gattctgtcc 
cgagaaaacg 
gccaaaacac 
agaagaaagg 
aacacaacca 
aacatgcaat 
aacaatgcat 
ttttgtccct 
aaacggtggc 
tggtcctgtt 
cggtgtcctt 
cggccaggac 
tagacatgga 
tcctgccgaa 
tacgtcaatg 
tacgaaccta 
cgtcaattct 
gcgtccgctt 
ccaccttctc 
gacccagaaa 
gtaacaaaag 
aggagcggaa 
ggtggaaatg 
ttgtcctttg 
agtacagaat 
ggatgcatca 
aaagaggctg 
attgactggt 
aactggtact 
tgtgtctttt 
acagtggctt 
gtgcaacact 
tgcttcttcc 
gacacaatga 
aacaacaaga 
aacttttcta 
gcctgtattc 
agaaaatcaa 
aaagacactg 
ttactcctcc 
atgactctag 
tacacgctag 
cagacatgcg 
tacttttaaa 
ttaatctctt 



caatgaagta 
tcgtccgaag 
gacttgttct 
atggtttgca 
ttttccaatt 
gtccagaaag 
agggtgacgc 
gcattttctt 
cattgcaagc 
tctgtatgac 
gtcgggtccg 
agtgcattat 
tttgacatca 
accggctatg 
caatggagga 
aacaatggac 
tggaggaaca 
acctcgcagt 
gtaaagaatt 
aagcaacctc 
aaacactcaa 
acatcaataa 
aagttttcaa 
gatgttgtgt 
ggcggaaacg 
cttatatttg 
atcagtgtca 
gaggatacca 
gccttcaaga 
aggaagagtg 
taacgtaaag 
tttgtctttt 
gttcactata 
cggcagcggc 
ataaaatgag 
gggtatatgt 
aatgggaatt 
ttgagaataa 
ggggaagaat 
ttggaagaga 
ttgcacgtcg 
aatcaaaaga 
ctgacgttag 
tttttgagtt 
tgaaatctca 
cctacataag 
catccttttc 
ttttaaaatc 
ctcttttaac 
agagacctac 
gaaagagaaa 
cagcatccaa 
ttatcgaagg 
ctattattcc 
gtgcgaatat 
ctggactttt 
cagaaaacaa 
cagattattg 
gcaattttac 
cactcttttg 
gcttaatgtg 
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atgaataaca 
aaactcccat 
aacaacggag 
ttattgcaca 
aagggcaaag 
gcaaatggac 
attcaatact 
gggtttattt 
ctcctcgacc 
tggaaacaaa 
gaaaaaagag 
tcacgaggtt 
cttaataacc 
tttcggtcgt 
ttaggtatca 
acatgga tga 
tgactgatgt 
ctgatgcaca 
cactcgaagt 
acacatcaag 
aggcctttgt 
aagttggcac 
gaaatctttt 
atcgtgcttt 
ttgaggaaat 
tt taggtgac 
aggcgcaa tt 
aatagacaaa 
aatcaaaatg 
tgaacacaac 
ctcatcagtt 
atcaatcatc 
cggattcact 
aaagggcaat 
tgaatccgta 
ttgaagcatc 
catgagctca 
ctcgtccatt 
atccatctcg 
tcttccctca 
gtccgtcata 
ttcgagatat 
gtgtgctggt 
ctcccccaag 
aaaaacattc 
cttgcataac 
accaaaccca 
aaacttggac 
cgacacatct 
cggagattct 
gggtttttct 
t tcagtcacg 
gacagacagt 
tggatgagag 
agacgcggca 
atttaatgtt 
aaagttgtaa 
ttttcctccc 
agaatctgag 
ggctgaaggt 
cgattggtaa 



agtggagtac 
ggaccatgaa 
actactattc 
cgcctgccat 
accaccagaa 
aactctgatt 
gcactgatga 
agatctcgaa 
ccaactacat 
cccaaagatg 
taaaaggcga 
gtcatcacca 
aagcaacgtt 
gtcggccatc 
caacactgtg 
aaacctccgc 
gtcctttgac 
gatgaaggaa 
gactgtgggg 
aaagatcaac 
cggtagctcc 
ctttgtgtgt 
cgacatgtac 
tttatataga 
ttataccata 
tggaaaagtt 
tcatacgctc 
aatggaatgt 
gacaacatgt 
tcactgttaa 
ccccaacatt 
ttaggttgac 
aatccttgtt 
tggtttaaca 
atgggtttct 
ctaacaccgt 
gcggctggct 
ttacatttca 
acacaaaaac 
agagcgtgac 
aacttgataa 
aattcagatg 
acaggaatat 
agtgtagttt 
tttgccatgt 
ctgttccagt 
acaggccacc 
gctttttcct 
tgcacgtcag 
agaccggctg 
agcttgaccc 
gtatcacaag 
ggagtatagt 
aatctttttc 
ctaacagaag 
gcaaattgat 
ggtgtgctct 
acctctccat 
ggaggggatc 
atcttgaatt 
agtcccatca 



actcattcca 
aacaatgtac 
tacatgcagg 
aactaataag 
gctatatgac 
attattatta 
ttgggttcat 
atgcaaccac 
attctggcag 
acacatccgt 
cagctcgctt 
aaggtaacct 
cgataaagaa 
ctcgccatca 
accaagacca 
attcctgtga 
agcgacacct 
gaagatgcgg 
cagaatctca 
atcactggta 
aacacctcct 
ggtaccacct 
gtgcacgtca 
tagggaattt 
ttttattgac 
tagcaatatt 
tcggcctatt 
ttgtatagat 
ctattaacaa 
taaattcagg 
taaaattgta 
cagttattcc 
gagacaaccg 
ttctctccct 
tgcatttaga 
acagagtatc 
tggaggaagt 
ggtctccatt 
cttcaacaca 
taaaaacagt 
cttttccagt 
gtccaatcat 
ttctgcggta 
tcttctccaa 
gatcgatgct 
tcaagagagg 
aagatgaagg 
ttgcgtcaat 
aaatgaggcc 
caatattggt 
acttgtcaat 
aagatttcaa 
ctctagcctt 
tgtactgggt 
gcaattgctg 
tttcatcaat 
cgtagtcctc 
tattcttctt 
ggcgagtagc 
gccctcctcc 
cggttgctgg 



ggtgtcaaaa 
gagattgttt 
cgaatggtaa 
tatccacgct 
atctctagac 
cggttacgaa 
tttatttatt 
ccaagagagc 
ctcaaccagc 
tgaaggaata 
gccaattgtc 
ttttttttgt 
aaaaactcgt 
ctgctgtgat 
tcgaaaccca 
ctgctgaggt 
tgggcaaaat 
atcttgtcat 
cctttgaggg 
tgcagatggt 
ccttcacccc 
ttggcgcacc 
cctactctgg 
taatattaca 
ctacttaacc 
atccttgaac 
ggtcttttcc 
agaattggca 
ataagccaac 
agctgtatga 
gtacttttta 
atcaattact 
agtgacaata 
catttttcca 
ttgtagacct 
cctcatccta 
agtcgagggg 
taatacatca 
catctttctc 
ttcaaataca 
ttcaagcatg 
tacgacttga 
actttcccat 
tatagattca 
catggttttt 
gctagtgtct 
gggttgttta 
acagctctgt 
taattttgcg 
atcagcctct 
ggaagaaaag 
gtgttttggc 
caaattgaca 
gtcggcattg 
cttcttgata 
agtaataggc 
gcggagcctc 
tagtctcttc 
gtttttagag 
ttctcctcct 
cgtaagtttt 



taagtgcagg 
cttcgcccaa 
tggaatatcc 
ccagaatggt 
aaatgtttga 
aattccccaa 
ttattaatca 
aaaacttctt 
aggggtccag 
gaccctggct 
ctgttacgta 
cctcgccgac 
catggatctt 
tgctgtattt 
cacagacaat 
tggatcaggc 
caagatccgc 
cactcccgtg 
aacattcaag 
gccaaagatt 
cgtctctatt 
aattgcagct 
cactgagacc 
acaataagaa 
ttcttgctat 
gggaaacatg 
tggtcataca 
gacaaatctg 
ccaaaagtca 
ggatggttac 
catggtacaa 
atattgtcac 
tttacaattt 
acgatagcta 
tcaggcaggc 
cacaagctgc 
ttaggttgct 
atcacagaaa 
acacctactg 
aattcacagt 
taatataccc 
gaagcatatt 
ttcttacatt 
catgctccag 
atccaccctt 
aaaacgga ta 
gtacggttgt 
tttatggtct 
gcctgtctaa 
gccactgcaa 
ttcctcttta 
cccttgacat 
gtagagaatt 
gcgcggtgct 
atgtctgtcg 
ctcttcttct 
aaagtcacag 
atgggcctaa 
caacaagatt 
tctccttctc 
gtccgttggt 



tatcatatcg 
taataataat 
tatcgggggt 
cacctgtaca 
tataatagaa 
agactaaaaa 
tcggaatttg 
ccccaacaat 
gttctggatc 
tactgtaaca 
ctctgtggtt 
aaaacgacat 
tctttcactc 
attgtgattt 
atcgagacaa 
tacttcaaga 
aatggaaagt 
gagggccgag 
gtgtggaaca 
aacccatcaa 
gatgaggatg 
accgccggtg 
gagtaaataa 
aataaaacaa 
acaatgaatg 
caccaattac 
ttttagatac 
cagttctctt 
tggcagtttc 
taaagaacct 
ttaaaccaaa 
tttttatgtc 
ctaccaaaac 
tggctgaaat 
ggccagtagc 
catgatcttg 
gtacaacatt 
tgcctgcaaa 
cgcctttcat 
acatttcctt 
cgactttaac 
ctggttggac 
gggcagtgac 
gggaaagctt 
taccattatt 
ggtttttact 
tagaccacaa 
cgtagttgga 
tcaaatggaa 
agtaacacgg 
ctacatttgt 
gggaacggag 
ttgcacaatt 
tcctcaacac 
catacgttgg 
tatttctatt 
gaggcggtgc 
gagaataagg 
tcacagctat 
ctggcgtttt 
ccccaccctc 
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cattttaacg 
cactcgtgga 
actcttcttt 
ctgggtattt 
ggaaatgagt 
gcctgaatag 
ttcaacatat 
aacaaatcac 
gttacagcgt 
tcaccatacc 
gaacctcctt 
cttcttagac 
ataaatcgac 
aacatcatac 
ttagcagcga 
gcatttgata 
tcacacatag 
actttaaaga 
tcttggtgct 
tcgtcatcat 
gtgttcttga 
ttatcagcac 
tgtttggaca 
gcatatttct 
tttgtccagt 
ggggatgtag 
agaatgtcct 
tatatcagcg 
actgcagtcc 
ggtataacgt 
ttaacagctt 
tcttctacct 
gaatttttct 
ttctggcggt 
actaaccgct 
ttgccaatga 
gaaggactcg 
gatacttgca 
atagga tggc 
ttcaacaaga 
cagggccatc 
tctgtcctcc 
gagaagaacg 
aagccacaaa 
acaagaagga 
ctattatagg 
acgaactcga 
ttttcaggga 
ataacaccaa 
ttaacaaact 
aaattagagg 
ctgtcaatga 
cagaaatatc 
ttaaaggcaa 
aagttatagg 
caacggacga 
gtttgaaatt 
actacaagac 
tggtctttgc 
aatacacaaa 
gctacgcctt 



acgtttttta 
tagttccttt 
tctgtatata 
accagtccga 
gagtaaaaac 
aatcctcgaa 
ttgcacatcc 
agttcctact 
cctggatttt 
ctaaaacagc 
ctctccaaat 
agtttgaaat 
caaacacacc 
aaaatttgac 
tgggtaaaat 
tgtcagtttc 
gtattttcca 
tgtaatcgcc 
tcttttccct 
catcatcatc 
cactgttacc 
tctcattctt 
cgtaagcctt 
tacccgtaac 
gattatatac 
ttttactccc 
tcaataaagg 
aacaagtttt 
tcagaaacgt 
tcttgtacca 
ggggtaaatt 
ggcggcattg 
gttcaactac 
catcttcaac 
cggtgaatgc 
ccaacataaa 
ataacccgaa 
ctgacgtgct 
gtcctcgggg 
aaaacaagag 
tcaagctcca 
tccacctaat 
gctaaagacg 
cgagcgtgtt 
taattacttc 
cgcacacgtg 
tataaaaaat 
ccatgagaac 
atgcgtgggg 
ggtggtaaaa 
cgctaccaga 
aaacaaggac 
gcacgtcttt 
cactttttat 
gtggacagaa 
caatttattc 
taaaaacgtc 
tcaagagacg 
agaaatcgac 
aagcactgtc 
ttatgctcta 



tactctgctc 
atatatttaa 
ttaggtacaa 
atataacagt 
ttgctgagac 
gcttacaact 
aaacgttaca 
atttttgtca 
cttggcccac 
acctctggca 
tctggtagat 
aatgtccaat 
cgcaccagaa 
atcagaagga 
ctgtacaggg 
catggccttc 
ttcagggtgg 
agtgattgtt 
ttctgatgct 
atcttcactg 
aattttcttg 
cattatgttc 
gacgatcttc 
aggtttatga 
agtcgtgtac 
atatttcgaa 
gtccaacgta 
catgtctttg 
atcacttttc 
gtaatcgagt 
agatacagtt 
tcttttgttt 
cagaactcta 
aactggagcc 
gtcttttaaa 
tatcggcaca 
acctatatac 
ggactcgttg 
ggattcttta 
aaaaataaac 
gatccagtcc 
cctctccctc 
tcaaaaatac 
gatgcgttga 
caggttttaa 
aaacaggtgg 
aagtgctctt 
cttatactac 
gcagaaatat 
aaactcccca 
aatttactgg 
attcctcctt 
tccactctac 
caacgaaaac 
agtgaagcat 
gtgctacccc 
ctctataaaa 
ctaataaatc 
aagtgtttgg 
atacaatata 
aataaggcaa 



gggctctggg 
ccaatttttc 
acacacaata 
caaaaggctt 
atatactgtc 
acattaaatt 
gttttcatgg 
gccgcattcc 
agagctttaa 
gtaatggggc 
agattgtccg 
tctttactct 
gaaagggtct 
ggtttcccgc 
ttcacctttg 
attaatttgg 
atgtggggtg 
gtgactggag 
tcatcacaaa 
tccgacattt 
tacatttttt 
ttcagtgaaa 
ctccatgtcc 
gtcagcgagg 
aaggcctcga 
tccatatgta 
ctcttattta 
tatgacacac 
ccacttcttt 
atagattctg 
ctgtcctctc 
gtgtttttgt 
taaagatttt 
atgtctgtac 
gccattacgg 
gcatacattg 
tttgatcctt 
gacgtgtcgc 
caggaataga 
caactcaagc 
cagacccagt 
ctcctcctcc 
gtttaaacaa 
aagagaacca 
agtgctcgat 
ccaaaagtag 
tagtgtacaa 
aaattgccgt 
gtgttaaagg 
atgcaccatc 
agaataattt 
cagaacgagc 
acagactgga 
caacattcga 
caaaacaaac 
attctttcaa 
atagtaccgc 
cccagattga 
atgttctttt 
gaggaaagtt 
aacattctcg 



tcagaggggg 
aaacaggggt 
caggtcaggc 
attttccttg 
tggccgcccc 
tccccatacc 
atactacacc 
tttgcctctt 
tagatctctc 
agtaattaaa 
cttctttatt 
tgtgggaaaa 
cgttagaaga 
tcttacaaaa 
aagctctaac 
ggatattcct 
aaagaacaag 
tgttgattgg 
catccatgcc 
ggtgggcact 
caatagttaa 
gatccctctc 
ttataaagtg 
cttggtagag 
tggtttcctt 
gtagttcttg 
aaattttgtg 
aacatctcac 
ctttcttcgt 
cattggctgc 
cttcaaaaat 
ttccagagag 
ccactttttg 
cggagccaac 
tccatgccat 
aattgcctaa 
tatctaaaaa 
tagaggacta 
tgaccttttc 
accagaaaca 
tcctaaaaca 
tcctcctcct 
agctcttagt 
agcattaaat 
aacaccttct 
cgaaatcgaa 
cgaaaatgag 
ccagttattc 
caacgaaaaa 
ctcatcttca 
caacaaggga 
caacctggac 
cactaaaagg 
taataaattc 
cactaaatcg 
taatttggca 
acatcccggc 
ttcggcgaaa 
ggccataggg 
tagaaggtat 
cgcagtatcc 



ctatagaaat 
tattaaaact 
ctgcaaagga 
atctagagct 
ttccttatta 
gttctgggtt 
ctcgtctcct 
gacagattct 
ggacaattgt 
acagacaaca 
caacaataaa 
gggtcctcca 
tggttcatta 
atcccagaag 
ttctattaca 
attaacaaaa 
ctctagaagc 
gtatagagat 
ttcacagtca 
attattatta 
aacattagca 
cacgttgttg 
tatgactgat 
ct tgaaggcg 
tgccagctca 
tttgttgtcg 
ttcgaattca 
aagacagatg 
gtcaagaaga 
gatggtttct 
cactttgtct 
agtaattgca 
gaagagagat 
tttgttcaca 
atctcaaagg 
tactataaag 
atataaaccg 
tagaaaaagt 
aagacagtga 
gaaccaaaac 
ccaaccaatt 
aagccctcaa 
gata t tgttg 
acagaatatg 
gtaccaacag 
ctggccgtga 
tcgttaaaat 
tctaggcacg 
aacaagtttg 
actgtgctag 
gaaaataaca 
acgaccaagg 
aagcttttct 
aggtggacag 
ctagacaagc 
gaccacttac 
aaacgaaatt 
gagtacaaga 
aagaatgaca 
ttaatattct 
cctctaccat 
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ttaatttctt 
gttttttgtc 
ccccatccgt 
agggggtggg 
tggtgtcttt 
gtgtagaagt 
ggtgtgattc 
cgaaaaaagt 
caaataaggt 
aagaagaaga 
tacaaaaagc 
ctcaatacaa 
ttccagccga 
ctattgacaa 
gcatggaaca 
cattctacga 
atagattgaa 
tgtctatact 
ttctggccgt 
ctaaaggcaa 
tagtgaattt 
acacctttca 
aaaacacatc 
gcacccgaac 
cctatacaag 
atcttgggca 
acgaacaaga 
aagaattggg 
tggctgctgc 
tggacgactc 
aggtctcggc 
ctaatgcacc 
gggggttctt 
tactatgtcc 
ccttcgaaag 
aagacgaaat 
ggaataatca 
aatatgatat 
cggttattgc 
gtttagaaga 
ttgcctactg 
ctgaggatta 
gaaactgcgg 
aaccgcgtgg 
agccgaccgg 
acgtccctta 
gcagcttttg 
tttgatgtct 
acatgcccgt 
cccatgtctc 
attgaactca 
atgtacaagg 
cacacatcta 
aaacacacat 
tccaccacta 
ggtggggaat 
aaaacagacc 
ctggccgctt 
gatgatgatg 
caaaaaacaa 
atgaatgtac 



taaccttttc 
cacattgacg 
ctcagccaag 
tgtgagggat 
cttaggtttt 
gcgtgtatct 
agtcatcttt 
atcactcgaa 
gagggccgcc 
ggcgggcctg 
cctcggatta 
tttaggggat 
aggagaaacc 
atactacggc 
aaacacaccg 
agaagaaaag 
tagagacgtg 
cgataaaact 
gtggtgcaaa 
ctacgaatgg 
tggaatatta 
cagagacagt 
cctatctgat 
ccacgtagga 
aggtgataaa 
cacgaattca 
gaatggattg 
aagaggacgt 
agccattgca 
ttctaaagtg 
cgccagagaa 
agcagctggt 
ttctgaccca 
caatacagga 
tgaaacctct 
ggtatgccct 
agacgctgta 
ggttgaagac 
agcccgtatc 
ccctgaattg 
acgagatcta 
ataataataa 
tgaaact tgt 
agcgccttct 
ctcttta tga 
ttgagggtat 
atgtatcaga 
cttcaatgac 
catgcactgt 
ttcttgcctt 
agaaattgta 
acagctccca 
aaaaggacaa 
caagattgca 
acaactctgc 
ttttatattt 
ttcaatcatt 
cttcttcctc 
atgaagacga 
caacatcaac 
ttaaaaaagg 



tccttcatgt 
ttcgtctatc 
cggctcatgg 
tttggttcac 
gctgaaatgg 
ccagctctcc 
atatatttca 
tcggcgcttc 
aagagatgcc 
acactctctt 
cctgtcccta 
tttgtaggcg 
gcaggatttc 
gaaggagcct 
tatgacacat 
gatgttccac 
gaggaatatt 
aaccaaaaag 
aggaacgtac 
cttggtgcta 
ggtgacttga 
gatagattac 
cgaaaacaat 
agagtgactg 
gacatgtttg 
gctaattttg 
tgggggtata 
ttagggggcc 
atttcttctg 
agaaaaatag 
aaggcgagag 
tcatccagaa 
gacaagcgcc 
tttcttcatg 
a tagtgagat 
tcaacagctc 
ctgacgttgg 
gaggaagcgc 
agtaaagtgt 
caaagtgtgg 
agattttgta 
caacaactgt 
acgatggtat 
aggatcgttc 
agaaga taag 
acttttggaa 
aaagctggtg 
catatggatt 
cgagaaaatt 
ctttaacaat 
cctctcctta 
acaatcttcc 
gactcgttta 
gcctaggtgc 
atcttacgcg 
taaatacaca 
agttaataac 
tttagaagac 
aaaaactaag 
acttccacct 
agctgttaat 



attgtcatgg 

aacacatgtt 

atatcgattc 

cttcaaaaac 

ctatgggaac 

aacaaaggat 

cctttgtttt 

gcctcatcat 

gaatagaagc 

atgcccatct 

agataaaccc 

tggaacaact 

tcggcatgtt 

tttcagacgt 

cttcagcgtt 

agcaggaaga 

taatggcttc 

aacgtttcat 

tgaaaaagga 

aaatgtgcaa 

aaataaccaa 

cctcagttgc 

tggcccttgt 

caacatcatg 

ccgccctatc 

ttccatattt 

ctcgcagaac 

tgaataaggt 

ccttagatat 

cctccacctg 

aagctagtat 

acagtaacag 

agaagcttat 

ctgctgttcc 

tacgtttgag 

ccgaagctaa 

atgatgaaga 

gtgaaagatt 

gtgagcggaa 

atgaacaatt 

atatattatg 

tcaaaaatga 

acagagtacg 

ggaggaggcg 

aagggagata . 

agggctctaa 

tattgtagtt 

gatggcaata 

agtggaggtg 

ctggtagaga 

ctaacgggct 

ttcaacggct 

gaggctgaag 

gtctgttccg 

tacaaggcaa 

atctttgaag 

gagcctgttt 

gacgatgatt 

aagaaacaac 

atcagcaaaa 

ggaaaacgga 



tccgtttctc 
tttccccatg 
cgccctaatg 
aagtctccat 
aatgacggct 
atctaaatcc 
attccacaga 
ggggcagacg 
agcggaaatg 
attgggtctt 
tctcatgaca 
tctaaaggct 
tgataatcta 
ggttgaaaat 
gatgacacct 
aaattctaca 
tcctatgaag 
gtctgttggg 
ttggaacgaa 
ccatttactt 
taaacttgac 
agatcagaaa 
tcactcgtgc 
ggctgtcgat 
ttcatcgctg 
tagtagaaat 
ctctgaaaaa 
aggggtggct 
gggggaagta 
cttaaatgtt 
taaacgtctt 
gtttctcctc 
aaagggtgaa 
tgattttgtt 
actgattaaa 
taagaagagg 
taacatcgtt 
acgccaccag 
aaatccaaag 
gatacgggaa 
tgtcttgcag 
gagacgatac 
attgttgttg 
tggacgccac 
aatgcatacc 
ctaaacccga 
gtaataacac 
atagtaaaaa 
ccgaatctat 
aagaagcctt 
cggcagccgg 
cttggacgtc 
ttttagtcag 
atctgttata 
gaaatttgtg 
agaatggacc 
ctgagacaaa 
gttgtgatga 
ccaagaaaca 
ccaatcacga 
aaatgatgga 



cattccgcca 
ggcacagccg 
aaaggaggaa 
acaagaacat 
ctcttatctg 
ctagaaagat 
ttcagtggtg 
cacgcccaca 
gaaggtgtgg 
ccttactcta 
gcatcttctt 
aagagagagt 
gtgaaagatt 
gtaaaacaag 
atccctaaag 
caacaaagat 
atggtgtttg 
gatattgccc 
tacgctatcg 
ttagctgatt 
acaaataccg 
aaatttataa 
gttaacgtga 
gcgcttcgta 
gatatgtacc 
tacctatgta 
ttggccaaag 
aaaacagaac 
gaagctgtaa 
aatgcagcca 
cttctggcca 
aaagatttgt 
gcagtttctg 
attgagtatt 
cctgaaaaac 
aaattagtaa 
aaatacaaca 
gacaaacaat 
aaaaaacgtc 
ctggctgcca 
acctaatatc 
ttttaaccaa 
cccattggtt 
gtctgtgcga 
ctttaggata 
tttagctgct 
tcaaggcaat 
gtatgaagtt 
tcacaagaaa 
cgccgaaaga 
aggaggaggt 
gttgttgttc 
taacaagata 
tgccctatgc 
tgttattgaa 
tttcgactcc 
ttcatcggca 
tgatgacgat 
aaccaagaaa 
caacatgttg 
ttctttgtca 
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ggaaaaaagg 
tcatccgacg 
actaacaata 
attttttcac 
gttgttatta 
ctgcctcacg 
tttcgcattt 
caatctgact 
tcatacataa 
gattttgtcg 
agagaacggc 
tcaccgaccc 
gacaatcaga 
ctaaacgcat 
cacgggcaat 
gaatattttt 
cagctttgcg 
acatcatcaa 
agcaagtgga 
tagctgctca 
gcgttaatgt 
ccactggttc 
acggtgtaat 
ccttgattaa 
tcctccctcc 
gaagtagtac 
accaccaaat 
aggtgtcttt 
cttctattct 
cagcagcagt 
acaagtacag 
agaaatggag 
tagtgaaact 
agatggctgc 
actatgttcc 
tggctacact 
ccctagaaca 
gaaattcccc 
tcttgaaggc 
aatggacaga 
cacccatttc 
tgacttatac 
agccagaaaa 
gaaaaaatat 
ttcccataat 
tttcaacacc 
ttctggcagt 
attttaggga 
cgtcatcttc 
ctaatctctc 
tgtcactagc 
ggataattat 
tggtggtggt 
aggagcaccg 
tgttttctgg 
cgttctgggg 
aactcgttct 
atgtgcgagt 
gtgggacgga 
tgtgtggaca 
ttcttccttt 



gccaacactc 
ttgttgcagg 
gggatagaaa 
aacacaggat 
atggaaattg 
aaaatattct 
tgactggcgt 
ctgtggtgtt 
gagaaaactc 
tgaaatcaca 
aattatgctc 

gggggtcatt 

tgtgggaata 
aggtgtggaa 
ggaaaaggaa 
agacaccggt 
cttaattaga 
acaattgccc 
taaaacaaga 
agtaatgcac 
tgtcatgtca 
gggatggaaa 
agatagagta 
aaggggcatg 
agcttcttct 
tactacgggc 
caagctatcg 
tacggtagaa 
tcagtggtta 
agtagaagaa 
tgatactgat 
agcgtgcaaa 
ccacaaatat 
ctttgttgcc 
tcctcaaacc 
ctttactgac 
gctagaaaat 
tgatggtaat 
gaagcagtac 
taaggccccg 
aaagagacct 
acctaggtgc 
aataacagaa 
taccaaccaa 
tgtcgcctct 
ttctgccacg 
tgaagaggtc 
caataagcgc 
tgaaggtggg 
taaaaagcgc 
catgtaacgt 
cataatagaa 
ggtgtatcat 
gcagtcccac 
gtcgaaggaa 
aagaagattc 
attcaatggc 
catgaacatc 
tcaagaaaaa 
tctttatgga 
acttctacgc 



taaaaaattg 
agaaaatgag 
agactatgtg 
gaacaataac 
tttaaacttg 
tttttgccca 
ttcttgcttt 
gagtaataat 
attgaagcga 
ggacactaat 
agcttcaacc 
ttacccgttt 
atgacactgg 
gaaagactag 
ctaaataata 
tcggcaatct 
tcccccgctt 
accataacca 
gagaagatag 
ctcacagacg 
gagaccaatg 
attatgccta 
gaaacattag 
gatttagtgg 
acttccaata 
attggatcct 
gaatatagga 
ccttcaatag 
cctcctctag 
gaaaacgggg 
ttttggtcta 
ataaacgatc 
gattggacaa 
ctcatgacct 
ccttctcagg 
gtgtacggtt 
gaaaccggga 
tctgtcaggg 
tttacagaaa 
tcatctgtaa 
agaaaagata 
atataccaca 
cacgtatctt 
taccaaacaa 
atgtcatcaa 
accaagtttg 
agaaacttta 
aacacaactc 
cgtttttcat 
gcactggaag 
gtctcgatat 
aattaggtaa 
tgttattggg 
ctttttcatc 
atcgtgtttt 
cttattatgc 
cagaaacttc 
aacaccgtac 
ctaccaccat 
gggataacga 
agaaagaagt 



aaaacctccg 
gaagagaaca 
cttccatgcc 
aagttggcag 
tttgttactc 
cctttagtcc 
tttgatagaa 
gctgcccatt 
agtgtgagga 
ataggcatac 
ctgagtatgc 
ccaaggatgt 
accctcacga 
ctcaatacaa 
gtagaaatat 
tcgaagacaa 
tgaattctgc 
aatctctaag 
tggtggattc 
gagagatgac 
cttcgtccat 
acaatttcaa 
tggagtttgc 
atatgcaaag 
atactcctag 
tgtccattct 
ctggattatc 
acggcgttca 
ttaaaaggcc 
acaataaacc 
atgtccccgt 
gggcaatgat 
ataaaactac 
ttagaaaatt 
gaagtgaata 
ttgaatcgaa 
ttgaaagcat 
tcgtcagact 
tggccatacc 
aggagtacaa 
acaacgacgg 
ctgaacgttg 
tcaacaagga 
actacaaaag 
caaaaacaat 
ttcaggatcc 
aactcaaatc 
tttgttcaca 
gtccatcaga 
aaattgcccc 
atgacaattt 
aatgggagta 
tgttgttaca 
gtcttcctat 
aagtggaacc 
taattccata 
ccctttgggc 
tcatgaaaca 
aattctacct 
ccagaagtgt 
ccaaggggaa 



ctgctgctgg 
acccctpttc 
ctcaaataga 
aaagtgtagt 
aacacagaaa 
agcatgtagg 
ttgaaattgt 
cggctattct 
ctgcttcggt 
ctttaagcaa 
tagctggttt 
tgtactcttt 
tttggatata 
tactctacct 
gaaagagtca 
catgttcaac 
tgtattttca 
gagaagtcaa 
tttcagcata 
gtacgtcccc 
ctacttgatc 
taaaacactt 
gtgcaagtgc 
aactataagg 
agtagcgata 
tgcagaagat 
cattactgaa 
agcagagcat 
agaagtggta 
ttctgataaa 
cacacctcta 
tagtagttgg 
aaaggttgat 
ccaagacata 
cgcagtgacc 
tggaaataag 
atacgtccta 
ggaaaaggaa 
tcctattaat 
gtatttttgt 
cggtgtggag 
tttagtccat 
tttgaacata 
catattcgaa 
gactgtaaac 
gccaaaaaca 
tgttcttgtt 
aataactgaa 
gtcacttatt 
agagactgag 
aaacaacctc 
cagaagaata 
cttctaggaa 
tcctttactc 
aagaaggaca 
ttcagacatg 
ttgaacctta 
acagaacctg 
aaaaaatggt 
gggtgcgggc 
tggcttgctg 



tggtggtgct 
agtgagtcct 
agaagtcact 
caaacattct 
aaagtatatc 
atttaacaaa 
tttttccgac 
aaggctattg 
gaaaggaatc 
caaggaaata 
gggaaaatga 
ggatcacgag 
aaaatcacat 
atggatttta 
atattcacgg 

ggaggaggtt 

agcaagaact 
gctagagata 
cttagtgcta 
gatgggcact 
ataaacgacc 
gaaatgagag 
gtcgcatcat 
tctatggatt 
atgacatctg 
ggatcaacac 
aataatagag 
cctctatccc 
gcagcagcag 
gataacgaag 
attacaccca 
aaaaataatc 
tattttgata 
ctagcggata 
atgtctaacg 
ccattgtttg 
aatatcatag 
atgagtttcc 
gaaaaatgca 
gatctaacag 
cattctgcgt 
ctttactctg 
ttagaaattg 
attgtggacg 
aactacataa 
gggaaacaac 
cctcctcctt 
caaaattgcc 
ctcaagtact 
actagcattt 
ttaataaaga 
tcctggtcgg 
cagtaacaga 
ctgaatcgag 
ccttaatcaa 
actgttctga 
ttttctgttc 
acgatttatt 
ggtctgatgt 
aggcatttgt 
ctcacactaa 
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cggaaagaca 
tacactcaag 
aggcacaccc 
aggaagtact 
gatagctgtt 
gcgggaatat 
gagagtggaa 
acttatcgga 
tactaatcct 
agaaatcttt 
gagacttggt 
gtcatcaaaa 
cagacttagt 
tgtaggttat 
gtcaagggac 
aaaacaaact 
ttctatattt 
tttattcatt 
cttcccttgt 
atccattact 
agtaaacatg 
ttcagtcatg 
ggcaaagttg 
cagtgctcct 
ttgtccacag 
tgtctttctt 
aatacttttg 
gaacattaac 
atctacatgg 
atgtttcgat 
gtgctgcata 
gtcactaact 
aggaagacgc 
ctcgtaaccc 
ctgattcagg 
tttttcatac 
atggggtgca 
aaaaactggg 
acacaacttt 
acgaacgata 
agccacatct 
tagcccatgt 
agattttccc 
gtcataggca 
ggaagattct 
attaaaatca 
tttaatgtta 
aggattacta 
aacaatagtg 
agattcaaat 
tggattaccg 
cacgtacaca 
gtccttagtc 
cacattttgg 
caaaagaatc 
aaacttgt ca 
cttgtgagag 
ttttacttcc 
agtagaagag 
atccactaaa 
catcatcttg 



tctgagggtg 

cctatcatca 

atgaacccta 

cctcagtgtc 

tggggtgttg 

gggagcgaag 

gaagaagaag 

cctcgtagaa 

attgttagag 

ggcagtaact 

atcatgagca 

gccttatttt 

gtatggaaga 

tcctctgatg 

ctgatagaaa 

gtagttgcta 

tcaattgtaa 

ccttattcct 

tccttaaaca 

tcattacctt 

ctctttctga 

gcaatgatta 

ggagagaata 

ccagttctct 

aacaggacat 

ggtgccaaaa 

tccttaatgg 

atacaatata 

atattttctc 

agaccacaag 

ctcttacata 

tttttcaaaa 

aatttgacat 

agatttggta 

cacggattgg 

actaaaaata 

atatcaaaca 

tgtattttga 

gaagagggct 

gcagtgtata 

ccattattaa 

tgctgaataa 

gatctactta 

gaatacattg 

gccacctctc 

gcaatgcctc 

cacaagttac 

ttgcgggtta 

tctctaaaga 

ggaagatatt 

ataatttgtc 

aatactgtca 

acgaaccttt 

ccctgataaa 

gtagttaatg 

aaaagtggtg 

atggttttct 

actcttaatg 

gaggatgagg 

ttcgagactg 

tcagttttgg 



atactaattc 
ctgacgtcac 
aggatatgac 
ttgtatcgaa 
cagactctaa 
gaagaagaag 
aagtagaagt 
gacctttgac 
aggttgtgga 
caaaacttcc 
gagtttcttc 
gtttggcagt 
actggaagcc 
ttttctggca 
caaaagtgac 
tgtttatttc 
tatattacaa 
actactagtg 
atatgtttga 
ctcctgatat 
tatccaatgt 
tagcaccata 
tctttgtggg 
ttacatgcaa 
cttcagacat 
gactaatcgt 
ggaattcact 
cattagcttc 
ccttctcttg 
tttccgtagg 
caacatccat 
taccttcaat 
ctctttctcc 
catgagtaga 
gagatggcaa 
tataatggac 
acaacatttc 
tatagtcttt 
gttttacaat 
gctcaatcat 
caccaaagaa 
ccttagcagt 
ggatgaattc 
ttactcctcc 
tagagacact 
caataggcat 
cccgttcaac 
cagcaacata 
aagtgtgccc 
gaacgtatac 
gtccttcttt 
attcaccatt 
cctgagtgcc 
gagtaataga 
ccctaggaga 
tactgcgagt 
cgtggctaat 
cgggcacaga 
aattgttatt 
cattctttaa 
aagaaattgc 



agcctacctt 
agaagataat 
atattttgtc 
ttcggacatc 
agacctccta 
aaatcccggt 
tgcgctacct 
aacaacaaca 
agattttgat 
cttcattaga 
ttccatcgcc 
ctgggttggg 
ttttacttct 
tgaaactctt 
aaagaaaatt 
aggtattgtt 
aataaaaatg 
tatagcacat 
tacatttttc 
cttttccaaa 
tgaatacttg 
cattttacat 
aagaccttct 
agaaaattgt 
tgaaaagtgg 
atcagatggt 
aacaccagaa 
gtctgacata 
agctgtgcat 
acgacaaaat 
gtatttatta 
gttatttgaa 
cccgtccttg 
attggtggta 
aaaggcgaat 
ataaaattgg 
aatagtggac 
cttgttcttt 
accactggct 
tttcttgtta 
aaagtctcta 
gttggccatc 
ttcagcggca 
atctccatta 
aactgtatcg 
agtataagcc 
agaatgattt 
cgcatcctgg 
ttggcaagaa 
attgttccca 
aatttctagg 
cttgtacatt 
atggatcatt 
tttagcccta 
ttcggctgcc 
tttaatggcg 
gtcagacaga 
attttcaact 
gttcttttta 
aacgtgtttt 
atccttttct 



ttcataagcc 
atgatgatgg 
aacgattttt 
ctgaacaaga 
actaaacatc 
gttgaggagg 
tacattaaga 
accactacta 
tacgagtctt 
tttcttgatc 
aactttaaaa 
gatgaacata 
gcacctatta 
agaagcaaaa 
ggggaagatt 
tgtataacag 
cctaaattct 
gaaacaaaga 
acaataaatt 
tccaccacag 
gcaaattcat 
tttttgaata 
tcttcctcaa 
acccttttca 
agattgttag 
gtttctgggt 
gctgctgcta 
cacagccatc 
gtggacgtta 
ttaccagtcc 
accttatttt 
aactcatccg 
caatgggcaa 
attgattttg 
gaagaattca 
aacacaattg 
tcctttggtg 
tgtttcgtct 
atagagtagt 
cataatttct 
atgtttttaa 
caagacgcat 
gctaccacat 
tagtcatcat 
tggttaataa 
ttgtctttac 
tgaataatga 
gtagatgcac 
tatatcgtca 
aatcctccat 
catttttgtc 
aaattagtaa 
ctattagatg 
gttaccttct 
ctcaaaattt 
ttaattaatt 
tccatattca 
gccgcagccg 
gtttctttga 
ttagtgagtg 
tgtttggctt 



ttcaacgaac 
gaagaatgtc 
cagacgatat 
gggaagaatg 
aactggggga 
aagaagaaga 
aaagtggaaa 
ctactactac 
ttaatgaacc 
aaaaaaattg 
ttgaacaaga 
cccctaaatt 
ttgtccagaa 
ttgttgatcg 
gggctaataa 
taacagttat 
aatacaatgg 
aagagacacg 
tgtcaacttt 
ttccaacacc 
gcccttgcat 
gaggatatgt 
aagtactttt 
ttggatgaac 
taggtgaaat 
tactttcaag 
tttttgacat 
ccttttctgc 
caatagaccc 
tacgcaagtt 
gattagatgt 
cgcttgttat 
aatctttaga 
ggcgtacata 
aaacgttcaa 
tagccttgtt 
ccctattcac 
catattcttc 
gtgtgattga 
cataatgttt 
gttctgtttc 
taaacgataa 
ctttagaagg 
ccattgcagc 
agtcgctccc 
tcatggcgca 
acaagttttg 
cagaaaggtc 
tggcttgact 
tcttgatatt 
ctagccgttc 
ccactccagt 
tagtgaaaat 
ttttctcatc 
tcctgtctgt 
gttgccggta 
aaactacaaa 
ctgcaagtga 
.tgatagaagt 
tgattgcacc 
tagagttggc 
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catgaggcgt 
cttcttcctc 
agatttttca 
gtgagcagtt 
accattattg 
gatgttatca 
tgatctaccg 
tgaaataccg 
catgatatat 
cttccctaca 
atggagaggc 
ggcccattct 
gagttttcca 
acgggctcgt 
aaagtcaact 
tatatcagtt 
actactt ttc 
atcgaaataa 
tgatgcacta 
tttcgctagt 
caaattatct 
aaagaaggcg 
tggtactgca 
ttcatccatt 
atacaat ttt 
cacaatactg 
ttcagatatg 
agatttcatc 
cttttcgttt 
tgctctgtcc 
gaaatcgtca 
cggggaagaa 
tttaatagca 
aattgcttca 
tctagagcaa 
ttcttggcca 
tttcttcgtc 
ttcttcgttt 
cttcttgttc 
tattttcttc 
tccaagtagt 
taaaacagca 
aaggaacctt 
ttggattttt 
gttgagggta 
tgcacacatg 
tccagaatca 
tttcccagct 
gacagaaatc 
caatacct tt 
ggaggaagaa 
ctgggttctg 
attattgttg 
acgtttactc 
tttatactag 
ggatcgtcct 
atgtct tatg 
cgctaccgtt 
gtccttgg tg 
aaaggctgat 
acgccgccag 



tcacacatat 
ctctcatttc 
gagtatttaa 
tgaagcactc 
aaaacatctt 
aaatattgca 
gccattcgcg 
tacattaacg 
gttctcgaaa 
tctcctcctg 
ttatctgccc 
gagggcgagt 
ttatcaaatt 
accaaatctg 
aaatgaccta 
ctatatatca 
ataccagaac 
caatagtcct 
ttattattac 
agttctttgt 
ttaaagtaat 
ttcctcacac 
tctatacact 
aaggaggtca 
tctcggccac 
catacatttt 
tgagctagct 
atatccaaag 
gcatcagaaa 
ttttcaagtg 
tcttcttcgt 
cttggtgccg 
cttgagacta 
tggcgtttat 
aacattttca 
aaattatctt 
tcagcttcat 
tcgtctccct 
ttgttgttct 
tggatatggt 
ataacaaaca 
tcaataaaca 
aaatttgcca 
tctagagtta 
gaaaacacac 
agactctcct 
atcaagccgt 
ccaggaagac 
gagttgcaca 
ctggcctcct 
gaagacccag 
gtagccagaa 
ttgttcttct 
cccatgatgc 
ttctacctcc 
cttttgctta 
gcatttttta 
gcagcggcag 
cagtcagaat 
aaggaaaagg 
acaaggaagc 



ccttaggctg 
cctttgaaat 
gtgagttatt 
tggaaatctt 
cctttctgta 
ctgggtcaaa 
tatctaccaa 
aataagcgtg 
gggcagaaag 
ttgatatggg 
ctgctcctcc 
ggaacaagcg 
taacctgttg 
aaggcttatt 
cacgatctag 
tttctccact 
ctccacttcc 
cgtcgtcctc 
cagtttcttt 
atgttaaagt 
tagcgatagg 
acgctgccac 
tgggatcaga 
gatgtgtatc 
tgtaatcaat 
cagtgtactt 
ctgaacatat 
aagacttctt 
tgctcttgca 
catttctttc 
catcaaattc 
tcacttcctt 
aattgttgtt 
tgacaatatt 
gtatctttat 
gttcgactga 
cgtcactttc 
cttcttcttc 
tgttgttctt 
tagtgttatc 
ctcctccgaa 
ccaattccct 
acacttgtat 
tttcatcttt 
tttctgggta 
ttgcagaatt 
ggtcgttaag 
cacccagatg 
gatacctcac 
tgcacgtatc 
aaggggatgc 
aggggtatga 
tttgtttttt 
actctcggtg 
acgaggttgc 
agattttgtg 
tacatagcaa 
gagggatatc 
ttttttcaga 
catcaaaaac 
agagcccgcg 



cccttgttct 
actagtcacc 
gtgcctattg 
taaactaccc 
cataagaaat 
gttgtccatg 
gtcttgaagg 
cttaataatg 
ataggaagaa 
catagataat 
tcctcctcct 
tccagacagt 
cattctcact 
ttcagtgagc 
agtagatgcg 
ttcttcatca 
tccgtcttcc 
ataataagct 
cttcaagaga 
ttcattctct 
agtttcatcc 
agcagacgca 
accgtgcaca 
cttaaatctc 
ttctctcatg 
ttctagattt 
tccttcttct 
gtctgataca 
gcgtaaagag 
catcagtttc 
gtcacaattt 
tttattagaa 
aactgacttg 
ccgcaggtta 
aaaaacttca 
cccttggaag 
ttcttcttct 
accacctttc 
ctgaggtttt 
ttccactact 
aggtatatct 
gcaactggcc 
cactttccat 
cctaaatata 
catcaaggaa 
agatttgcaa 
aactgcaatt 
gaacacagat 
tgccagagac 
tgggagcttg 
cgctgcagct 
tttgatgttg 
ggtaactggc 
aatgtcggac 
cagatggtac 
tgtaaccctt 
ttcctcatct 
ttcatctgct 
agatatcgat 
cgataacaat 
cgccagaaac 



tttgttataa 
atttgcatgg 
ttaatgttgt 
ttaacaccct 
tcgttaacca 
aaaacactcc 
ttggcagtca 
tctattctct 
tatgcaggta 
accctcaacc 
actcctactc 
gcagatttat 
agtgtataca 
cattttttcc 
tcacggtgca 
tcgtcatcac 
aaatccattg 
gctgctgcag 
atagatctta 
aaaaatgaag 
acaaagtctt 
cggttatgtg 
tacttgacgt 
tctgataggc 
gctttggcca 
gcattgctca 
attagagcgg 
gaagt tgttc 
aatccattca 
aaaaatgctt 
tcttctgccg 
gcattaccac 
acgttattac 
cttgggttgc 
gtgtatagtt 
aaggtactct 
tcctcctcct 
tctttcttct 
tcttcctcac 
gcagaatttt 
ggcctgtgtt 
attgtatact 
tttagtttat 
actggaatct 
gaacctcctg 
cacaggagca 
agttccttga 
attcctcctc 
tgttgcaagg 
accgcagaag 
gcaactgcac 
ataactggaa 
tcttggccgc 
acgggggtaa 
gatact ttat 
atcgttacga 
tcttcaggcg 
ttatttggta 
ttaccagaaa 
aacaacaaca 
aggactttaa 



actcttcaat 
ccaacactat 
tattttcagt 
ccatcatttg 
ttgaagggaa 
catttctgga 
tatcaaacaa 
ttaacgacga 
ataacatact 
aatattcgtc 
ctgtgcttgt 
caagagaagg 
aataccacac 
acacaacatg 
atttatctgg 
ctcccccacc 
cttcttcttc 
ctgtacgtga 
tattttcaga 
caactgtacg 
ttccatctct 
cagaattgaa 
tcaatatctc 
tagaaacaat 
tcatgtcgaa 
ttttcttgag 
aaagggggag 
cacttcttat 
aagctcccga 
cattgttaaa 
tggattgggc 
agtcttcaac 
tcttcattaa 
caaaaaagtc 
ttgcattatc 
tttcttcctc 
ccccctcttc 
ctttcttctt 
tcgggttctt 
gcctgttatc 
tcgctatacg 
catcgataac 
cggccacaaa 
caaatccttt 
gtttcttgca 
tgtctgcaga 
ccattgtagt 
cagcataact 
aaacagtgct 
cagaggagga 
ttctgagcac 
ctggatttga 
tggagcacgg 
aaccaatcct 
taccaaaaat 
ttgttacaat 
ccactgccgc 
cctacatagg 
aacaaaagaa 
acaagcgtca 
gaaggaagag 
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gaaattatcc 
tctcagagga 
tggaaaaagg 
atcatgttga 
aagaagaata 
tctggaggag 
tctagggtac 
tatgcgacat 
gacgtggagg 
aacggtgatg 
ccatattata 
tctactgaaa 
gatgaaaggt 
gaaaataggt 
tcagtcatgc 
gttaaaatta 
taaaatacat 
tatcctcttg 
gttagttttc 
aaaggctttc 
gccggccagt 
agcaataaca 
tcctaaagac 
gttattatcc 
aaacatataa 
tgttgattct 
gcatatggat 
ggggttttgc 
ttgaatacaa 
aaaaattcaa 
cgttaatgac 
ttaaaacaat 
cgatatttac 
gagctatcca 
tcctccactc 
acctaaagat 
agctattatc 
tacatggtat 
aaaatatgta 
taatgaaaat 
tacagcagat 
gtggaagatc 
ttttttccca 
tacaatatct 
agaaggagat 
aattgccact 
tatatcaatt 
cttgttcggt 
ctcagttacc 
atttattagt 
tccgtagggt 
atctgacgac 
tagttctgca 
atttgaaaca 
cattatcctt 
aacagaccac 
aacttccata 
tcatcgagcc 
cagtattcct 
tccactacct 
atcgcacact 



aaaagaaaaa 
ggaagaggat 
acatttggat 
ttatattagc 
actggttctc 
taggaggatt 
aatattccac 
tagtaaggaa 
atagccagtt 
aaaataaaga 
atggagggtt 
agccgtatat 
tcttctgcct 
actttttacg 
ccactattga 
agcctgtaac 
tttaaagtat 
aatatggaac 
ttggccccac 
ttgaatgcac 
gctgtgcccg 
atgacaatta 
gccaccaatg 
ccttccatgg 
attttggctc 
gaattcaggg 
agtacaaatt 
tgggaacaat 
gtagttatgg 
gtcaactttc 
gtagataatt 
tttataatga 
tgatggtcct 
ttttttttca 
cctcctcctc 
cttcttccac 
agaaaaatca 
tggaaaattt 
accatgcagt 
tttgcattac 
gaacctcttt 
tttacaaagc 
ttatctactt 
tgggccttag 
aaattgtagc 
gtaaaatcac 
tcaacgctag 
acaaaaaaag 
caggcactaa 
gctctaattt 
ttaatatgtc 
ctcgtgtcca 
agtctcgcct 
aaaggtaa ta 
atcacacaca 
taccatcatt 
catcattctt 
tgtaccccaa 
ggagagaaat 
ttacttcccc 
cgccaa tggc 



tacacgtcaa 
taaagcgagc 
tgtttttcct 
cccattaata 
tggggccatt 
atgggataat 
atcaacgaaa 
cggaatgtcc 
ctatttacat 
ccaccttgaa 
tttatcatac 
taacacggag 
cgagaaggaa 
ccacatggca 
aattagtgat 
tggtacttct 
tatgtttatt 
gacgggtggg 
catctggtgt 
cgcccactcc 
ccaggagaga 
ttgttattaa 
gcgaagccat 
ttgttgcaca 
aaggcacctt 
ttgtgatttt 
gtcttgaatt 
gtcctatatg 
aaatggagat 
tcgtcatcca 
gttgcattca 
tttttaaagg 
gctactacct 
tccgtaaaag 
ctcctcctcc 
cattattcga 
ccaggatcaa 
tccacccttt 
acaataaaag 
gcgtattctt 
tttcaggctt 
aagatttatc 
caaaattctc 
ccagtgttct 
tggtttcata 
tcatgtgaag 
ctggcgtttt 
ttaatgccga 
tgttgtcttc 
tccctctgtt 
ctccatacaa 
aaacatttgt 
ccattattat 
acaaaataaa 
tattttttga 
cattacaaca 
ctttacatga 
ccatgggttt 
gaaggggtct 
gagtccaatt 
agccaacaag 



gagaaagaag 
atctagaggc 
ccccttttag 
tttgttgggg 
agtaatgaca 
gtttctgatt 
aatttagtca 
acaaatcaaa 
tttttccacg 
agggatgaaa 
aatatcaaca 
ataacttcca 
tacgtcgaag 
agtaattatg 
ctgactggca 
ataatttatg 
gttttatttc 
gggttttttc 
ttttggtcct 
tggtattgat 
tgctaagaga 
aagtaaacca 
cttggctatc 
taataatagt 
caaggctgtg 
tggtcgcacg 
cctcatcttc 
ttcccttttt 
gcgaaccact 
ctcttctcat 
aagcactgat 
gataactata 
ttaaagtcat 
tatcgtcaac 
cccagtcttc 
aaatacaaaa 
aacaggggca 
aatacataaa 
gtgatcaact 
tgtaactata 
aattgcgata 
cgtttgatta 
aattgttaga 
gattagcccg 
cctcccttgg 
attttcaatc 
tagtacgtcc 
tttcggatca 
agacattggt 
gccctgcatt 
atctacccca 
caccctagag 
tatttttaat 
atacataaga 
cggacaagta 
acagaaaatg 
atcaagaaag 
acaaatagac 
ttcccgtcca 
agattgactg 
gaatcgggaa 



actctaatta 
aaaggactgc 
aaaggatgtt 
tgaatttctt 
ctactattaa 
cagagtcagg 
atatagccat 
ggccatatga 
ttagaaattt 
gttttgtgtt 
acccaaatcc 
tcgtcagcac 
atggtgaaga 
ttgtaaaggc 
catacaataa 
aaaaattaca 
ttctaaaatt 
ttacccttct 
cctttcccct 
cccagcgcag 
ggagccttga 
aaaagtatat 
ttattgttat 
gtgtattata 
aaaatattac 
tttactgacg 
tttgtcctct 
tatiatagtaa 
aaaagcttgt 
ttttagggta 
gtgcttttca 
aagagaccct 
gattgtgaac 
cctccgcttc 
tgccttttat 
aaacctactg 
ttcgtattag 
aatttggtta 
ggtttttcat 
tcggaatata 
tccttttcag 
ttgaacaaaa 
gaaaaatccc 
tgatttattt 
tcatcacatt 
aattgaggaa 
catgcaaacg 
gggaaaacat 
aaagttgctg 
ttattattga 
gccatggaag 
gaaatttccc 
gtgtacaata 
aagattcaaa 
cattaacatt 
gggaagtttt 
tattccaatc 
tattaaattt 
ggtcatttgg 
tcgtcttttc 
gatcactcct 



atcggcgcag 
ttagagggat 
tcttttaatt 
cgtatacata 
ctccacaaat 
gacgtttcca 
ggatggagat 
aacctacaaa 
taaaccttta 
gatcgaatca 
catttacaat 
cactggtaca 
aggagttaca 
tagtttcaaa 
taagaatagt 
aggaacaaaa 
ttattttagg 
ttggtttttg 
ttccttttcc 
aagcgatcgt 
ttattattac 
tttttcctcc 
tattgttatt 
acattaaaag 
atgcagataa 
tccaacatag 
gtaccatcct 
ttattcgggt 
atatttatca 
atatctctcc 
tgatattcgt 
tcttctaata 
cctctcaaaa 
cgcttacccc 
tatcgcttcc 
caacaagagc 
acattcttgc 
caaatagtta 
ttctatatcg 
cttcatccca 
attccatttc 
aatcgagccg 
tcttccagac 
ttcctcccat 
ttactctaaa 
ttctcttccc 
cattgaggct 
ccgcagccac 
ctgctttttc 
atacagtctc 
cagttgcagt 
catctttaaa 
aaaacaatta 
aagttttatt 
atcaacattc 
gttgcacaag 
atcccattgc 
atcccatgaa 
tccaaca ttt 
ttttgtctcg 
cattttcttt 
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ggcggtgaag 
agtttatttc 
aactcttcac 
tccaccatag 
cctctcccca 
ggcttgtccg 
catacagcaa 
ttcccataca 
tctgttgggg 
tcaattttct 
tctctggtca 
ttgcatctca 
catcttttgc 
gtctcatatt 
aagtgttcta 
aagcacctca 
tcaagaatag 
taatactttg 
acttcacagg 
ccttctggca 
aagtcgataa 
tcttctcgaa 
ctgttggagg 
catgatacca 
tccagtgata 
attttgccag 
tgttcaggtg 
atgtgctttc 
tcagaaagga 
taaatagccc 
atcatcattt 
cgagacaact 
gtacaaccct 
aaacacgcgt 
tgggcccttt 
tgttgttgtt 
tatgcacaat 
actcttccaa 
gcgtcgccat 
ttcaattctt 
ttcttgaaga 
cacgatcttc 
ttgatcacat 
aggatgcgta 
ggtttttgtc 
gagttggcac 
acaaaaatga 
ggaatcatgt 
ccgcctccgc 
gaacagtctg 
tagtaaacta 
gaacaactat 
gagatactta 
ttagtccatc 
ttaaaaaaag 
cttgaacgga 
taggaaacga 
ttcgtgtgac 
ttttcaaaaa 
taagacagga 
agcggtccac 



aaggagaagt 
cttcggcaat 
cctctcccag 
ccaaagaagg 
ttgcgacgag 
gaccaagagt 
ctggaatatc 
aaatccccac 
gaagattgag 
tggcattcaa 
atttattgtc 
agcacaattt 
atatatattt 
tctccccaat 
gtttattggt 
aaaatactac 
agggcaaaat 
tgaatgccat 
aggcagccca 
gactcagacg 
acatcatttc 
gcacttcctt 
tttttgacgt 
atttcttgat 
ataatcctcc 
cttcttccaa 
tgagatctgt 
cttcttttag 
tggtagagac 
caatttcatc 
tgaagagttt 
tggtttttct 
tacaatcaca 
tcatgtgcag 
gagtcactac 
gttgctgctg 
tcaagaacac 
aatctaaagg 
acttttcaaa 
cttggttgat 
gggaaataga 
tgtcaaagga 
cttcttggca 
gccaacaatc 
cccaagtctt 
ataattttcc 
ggatggggaa 
tgttaaacac 
acacttcggt 
gtgcacgcac 
tagcattttg 
cagccacagc 
gaccagtagg 
catcttctaa 
atgcagattt 
tacggcccaa 
tctcgacaac 
gtcaattgac 
cgtttatata 
tgtttttttg 
ccttcgaact 



gacctgaaga 
tgcagtgttc 
ttcttcatac 
tgctccacat 
tttgttggcg 
agacatcatg 
aatattgccc 
agccttcgct 
agctccaatg 
aagattgagg 
ccagtgatag 
ttcatcatta 
caagtccttc 
ctttgagtcg 
aaagttcttg 
accggaaaga 
aatatccaaa 
ttttgcaata 
agaggcggct 
tacatcatac 
tgccacatct 
ctggtcatct 
ctccaaaagc 
gaaatcaaag 
tcctgtatat 
ggcatcattg 
attgttaatc 
gaaggcatat 
cctcgagcac 
aggcacagca 
ttcagcaaac 
tccttgatca 
gaggtttgta 
tgccagagta 
agccagtttg 
tggattaaag 
gctacgcttg 
tgccatattt 
tcctgcaggc 
ttccctcaag 
ttgcctcata 
ttcaaaaagt 
caattcactc 
attctccact 
catgatgaat 
agtgtcacta 
tttttcatcc 
gtacatcagg 
cttgcgcata 
aacaggtcca 
aatgcctgga 
cgatttcaca 
ttttgcagtt 
aagttcaatc 
cacagtagaa 
cattgtagta 
tgta tgctga 
gctcgtacgt 
atggtgcaat 
gccaatacgt 
tgacatcagg 



ggcgctggtc 
ttgagtttca 
atcacatcac 
tcaatcaaat 
tacctgcacc 
tacttggatt 
tcattcaata 
gcagctgaaa 
atatcactag 
gttgtgggtg 
accattggtt 
tcatattcca 
tctgataggt 
ccagcctttg 
tcaactccta 
ttattctcga 
tcagtctcgg 
gagaggatca 
gcacacacag 
aaagttccca 
tctcgtcctc 
gatccacaaa 
acatcttcaa 
cactgtccaa 
gaattggaga 
ggcacataga 
acttggagtg 
tctgccttgt 
actcctgcat 
ctgcaagcgt 
atcttggccc 
gatgtagctt 
tggtcaattc 
gaggcggcat 
ctgataccac 
gtttgcttag 
tggttgttat 
aaaaccctag 
tctgatcttc 
atggggaagg 
ataaaggttg 
tgccatctgt 
tggttactgc 
agcgccttcc 
tcttcttctg 
ttggcaatac 
ctggtaatgc 
gaagaatgaa 
tgcttcacaa 
ctttcaatgg 
gcaatcacat 
cttcccattg 
ggagtatctg 
ttggaggaag 
ctagaactct 
tatctttatg 
ggtgcctccg 
ataaacaggg 
ttcctattac 
ttctggcgca 
ccgacccagc 



tcttattatt 
agaaatagtc 
cagatgtcct 
ccatcagcag 
agggacattt 
tgcagcaatc 
gcttgtagag 
cagcaatttc 
ccatctcctt 
atgcagtttt 
tgcccatcct 
tctttagaga 
caagtcgacg 
tcaagattct 
tcttattggc 
tctctccatc 
aactgctatt 
tgtttgaagt 
atttgattgc 
atttttgaat 
caaatagatc 
agactgggat 
catttctagt 
actcgaacct 
aactaaatga 
accacataaa 
catcatagct 
ttccgcattc 
tattgcctgt 
acatcattga 
agtgtacgcg 
caacacttgc 
caaaatcgtc 
ttcctccccc 
caccaccgct 
agcaacaaaa 
atcttcctcc 
acttgaaaga 
cttcactttc 
cactaaattg 
cgatattctt 
tgttgtcgta 
cactacatct 
tcatgtcaag 
gtgcagtttc 
gcacatcctc 
catgcatttt 
gagcttcact 
aatctccccc 
agtgtttgtc 
tgagtgtagc 
aaccactaat 
tatcagcagc 
gtgcattttc 
tgctcatggt 
gtagaagttg 
caggccgctt 
tctattttta 
tcaaactagt 
atctgacatt 
ggtccacccc 



attttgttgc 
gataatgtaa 
gatggcttct 
tttcttccct 
tctgttctgc 
acattctaca 
aagacaagtc 
attgtcttta 
tactttgtca 
ggcacgagga 
gtagtttccc 
ttttacacca 
tacagggaac 
cagagccaaa 
ccatgtaatt 
ggcagttcct 
ctgctcttgg 
gttggagggt 
tgcataacct 
cacagtctcc 
gctggccaat 
ggcaaaatat. 
aatctttaga 
cttgtattcg 
ctggtggaag 
catggaggat 
agcagatttc 
agttcccaga 
aacaaaggca 
ttgttccttg 
agcatacttg 
catcaatttt 
caacgactgg 
tgtaattgac 
agaagactgt 
ggcactatca 
tccctgtaca 
ttcacgttga 
aataaaaaca 
atcattcagg 
gcgcacattg 
tccagaaaaa 
agagaaattg 
aatagtcaag 
cgaatcgtac 
atcggtgaac 
cttgaatgca 
gccaccgccg 
ttcccactct 
gaaatcattg 
catgatgcca 
atccatcaac 
agcagctttc 
catcaaagag 
ggaagaagcg 
atgttgacgg 
ttatacccct 
gagatgacat 
catatccggc 
ggcccgaccc 
ctaaactgga 
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gtgagctgaa 
tactatacac 
gggtcggaca 
gttacaattt 
tctgacattg 
gtccaccccc 
gagggtgaaa 
ccagaaatgg 
catttctgga 
gtatcttcgt 
aggccgaccc 
tttgaatcga 
gttagatcga 
acgattctaa 
gtacaataac 
gcggtccacc 
tgagctgaaa 
gaaaggcacg 
aatctgggtc 
aaagagttac 
tgcaatctga 
cagcggtcca 
agaaagaggg 
cagaaatggc 
ctggaacagc 
atggtatagt 
acttgacatc 
ttcaaaagtt 
gggaggtacc 
atcaccccat 
ggaagagttc 
attttcttcg 
atctccagtt 
agacctttgt 
actacaacgg 
ccttccacgt 
atctagatat 
atattcaaga 
gtttctctaa 
attgtgattt 
aacaatatat 
cccccctctc 
gttcctcttc 
tatatgcttt 
gattctttcc 
caccagaaaa 
attctggttc 
atggataata 
ttggcccgta 
aataatttga 
cattgaatct 
ttttgtctcc 
catccaattt 
ggtatacctc 
cggccgaatg 
atgtggggta 
attggtcatg 
gatgtgctat 
acagtttctt 
caattatgta 
atagagggga 



aaatttttca 
tcagagggtg 
gattccagaa 
cttataactg 
ggtcgaccca 
taaactggag 
acacctgtta 
ctgtccagaa 
aagggttaca 
gcaatctgac 
agcggtccac 
ggatgagggt 
gccggtccag 
ccatttctgg 
ttaccataca 
cttcgaactt 
aaatttttca 
atctggatgg 
ggccagattc 
aatttcttag 
cattgggtcg 
ccctctaaac 
tgaaaacctg 
tgttccagaa 
catttctgga 
acacctctct 
aggccgaccc 
tttgagacgg 
ctcgcacacc 
ctctaaataa 
atggatgtcc 
tatgaacact 
aggtttcaga 
cgactcgtgg 
ccaagtcatg 
gtttacctat 
ggatactgca 
aataaaagtg 
cactaagaga 
gttgtgccct 
aaaattcagt 
tagtgagaga 
ttcctcctct 
aattatcctc 
ctcaccacca 
aattctaata 
tggtgctaac 
ttgcacattg 
ggcagacatg 
acacatgaat 
gtttgtgagg 
tattcttcct 
gttggtttcc 
gtttggcata 
tgtaacaact 
catattactt 
acaactaccc 
atcgtgtaaa 
ttcgactttg 
cttgcagttg 
gaaagtgaaa 



aaagtttttg 
gcagatcggc 
acgtttctaa 
gtgcattatt 
gcggtccacc 
tgagctcaaa 
gaaagtggtt 
atcagattcc 
atttcttaga 
attgggccga 
cctctaaact 
gaaaacctgg 
aaatggctgt 
aacagtcatt 
tttctggtat 
gacatcaggc 
aaagtttttg 
tgtcgtccgt 
cagaaacgtt 
atttggtaca 
acccagcggt 
tggagtgagc 
gtaggagaag 
atctaggtcg 
aagtgttata 
gcaatctgac 
agcggtccac 
aggaagaggg 
ttagacacac 
tccatcatgt 
aagttgatag 
tctgactact 
gaaaccagag 
aagaaagagc 
gagatctaca 
acagatgaag 
accaagtgga 
acccctagag 
aacttgtata 
tttgcaaggc 
tgtattttta 
tcccaatcac 
tcttcctctt 
aatacattta 
acatcgaccc 
gacggatttt 
tgatcatcgc 
aattgatcat 
atcaaacggg 
ctgattcctc 
aaaaagtctc 
atttcgtt tc 
tctagttttt 
tggatgacct 
gcattcaaaa 
gaaaaaatct 
actataggat 
ggtaccattg 
ttgttggtta 
tgaattgctg 
gacttgtcat 



agatgaagaa 
cctgttccag 
ccatttctgg 
tctagcatat 
cttcgaactt 
aaatttttga 
gctggtcggt 
agaaacgtct 
tgtggtacaa 
cccagcggtc 
ggagtgagct 
taacagaagg 
ccagaaatct 
tctggaaaga 
cttcgtgcaa 
cgacccagcg 
aatcgagtaa 
tcactgctgt 
tctaaccatt 
ataacttacc 
ccacccttcg 
tcaaaaaatt 
gtacgcacgc 
accagaatcc 
ttttctcgca 
actgggtcga 
cctctaaact 
tgaaaaccct 
aacctcatca 
atatcttcgt 
atacaggatc 
acaactggcc 
tggtggatgg 
aaggaaaaga 
tcgcagaatg 
ctgtcaagag 
tggctgaaat 
tagtcttcaa 
actttggaac 
ataggattgt 
ttgctcaaat 
tttctactgt 
catcttcatc 
aaatggtgcg 
tacgtccatc 
ttatagaacg 
caataaaaga 
ataaagtatt 
gcctgaaacc 
tttctgtacc 
gtgcagataa 
catattcttc 
cttctaggta 
tattacgt tc 
ataatgccat 
tccctaacat 
tttcta tgcg 
attcgccccc 
tcccacaacc 
cattgaatag 
ggggtaaagc 



gagggtgaaa 
aaatggctgt 
aacagtcatt 
attctagccc 
gacatcaggc 
aaagtttttg 
agcatccgtt 
ataaccattt 
taacttacca 
cacccttcga 
caaaaaattt 
catgtctacg 
gggtcggcca 
gttacaattt 
tctgacattg 
gtccaccccc 
gagggtgaaa 
tccagaaatg 
tctggaacag 
atacatttct 
aacttgacat 
tttcaaaagt 
atactggcgg 
agaaacgttt 
tcttgtacat 
cccagaggtc 
cgagcgaccc 
tgcaacagag 
ccctccgtcc 
cgaaggttcc 
atgtggaatg 
tgccgaaatc 
aatgtttgaa 
gagtttgaag 
gttgagacaa 
tggattcttg 
tattagagag 
tggcaatgtt 
aaactataac 
acatttctta 
aagttactac 
cactgttgct 
ttctttagtt 
ctcttctcgg 
ctggtatact 
tacaacaatg 
taaaacaact 
gaagatgata 
accctcatct 
agaatatagg 
tgcaaacacg 
ccttgcttct 
tttcctcctt 
atgcttaccg 
gcattgatca 
actcactgga 
gaggcctttg 
taaaatacta 
catcaataaa 
tctatcagtg 
ttctcttttc 



agtgggtacg 
ccagaaatct 
tctggaaagg 
ctttgtgcaa 
cgacccagcg 
aaacgaggat 
cactgctgtt 
ctggaacagt 
tacatttctg 
acttgacatc 
ttcaaaagtt 
acggcttctg 
gattccagaa 
cttagatttg 
ggtcgaccca 
taaactggag 
acaccctctt 
gctgtccaga 
tcatttctgg 
ggcaccttcg 
caggccgacc 
ttttgaatcg 
cacatgcctc 
atagccgttt 
taacttgatt 
caccctcgga 
taaaaaattt 
ggggtataaa 
aatcaacatc 
cccctcacag 
tctttcctca 
gggacagaac 
cctgtcctaa 
gaatatctgg 
aggccactag 
aacgaggagg 
aagaggggca 
tgtagtgcat 
aatgttgtac 
taaaaaataa 
acccaattct 
gctgtttgtt 
cgttcttcat 
ttaaacttct 
gtttcaccat 
tccactaatg 
ccagaaacac 
gggtaagatg 
ctcattgtac 
ggtgtgtctt 
agacacgtac 
tcatcgatag 
gcggcgtcaa 
tccatttttt 
aaatttttaa 
acaactaagc 
taggataatc 
cccatactta 
tcagcagtta 
agaattctat 
actatattgt 
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tgatattcct 
tccttgtata 
agcaggggtt 
gagtattttt 
ttttgataga 
caaacccgga 
acttataagt 
tttttatcct 
gtcaatggaa 
ggattcacac 
aacttccaac 
ggcaagtaaa 
atctaggaag 
tgctggtgac 
atgtgagccc 
attcttcttc 
atactaaaaa 
ctgtgaataa 
caatgggtat 
atggcaaaaa 
taaaaaaaag 
gagtaaaagg 
ggggacaata 
attgaaaata 
aaatatgata 
gttaaatacc 
gctgaaattg 
ataaagctca 
tcagttggtg 
gtcagtagcc 
gatgcaaaaa 
acgaatcctt 
gtcggctgat 
tatagaatta 
aacaactaaa 
atattggttt 
tcttggaacg 
ccgcatcact 
acaccagtaa 
aagacccaga 
ctgtttacac 
gtaatgtcac 
tggtcaaagg 
atctagaaga 
tgttgaggaa 
aagatgtttg 
ataatcccat 
taaaaacttg 
tatctggatg 
ggataattgg 
aagaaggaac 
agaagaatgg 
attattacaa 
ttgtcccaaa 
gaatttgtaa 
atatacttta 
gtccaaactg 
ccagagaagt 
gaggagatga 
atccgggaag 
tgagaaaaca 



caaatcttct 
gaaattcctc 
gcctgctgag 
atgcaaagag 
ctcgataatg 
cgccatcttt 
gtgtgtgtta 
caacttctga 
atgccgattg 
tttgtctgtc 
aagaatgggc 
cgactacggg 
aggaggaaga 
tgtcatgact 
acagagctac 
cacaaccacc 
agttgacgac 
tggaggtgtt 
caatacagat 
ataaaatatg 
agtaactccg 
tactgtgcca 
aagattatga 
aaactgaatt 
aatgtttcgt 
tcaaagagca 
acaagagtaa 
ttgatgaaga 
ttataaatat 
acggaaaaca 
aaatatgata 
caacaaaaag 
attgatagga 
tccattgata 
taactcttta 
gcaaataaat 
gaagttatga 
tctaactgag 
ggaaaatgat 
accgtctgtc 
ttctgtaaag 
ctcaaccaaa 
ttcactgttt 
cgaaatcgat 
gggtttagac 
gaattccccc 
aaagagacat 
cttcgaaaca 
gttagctgaa 
atgtacctac 
gaaaaatttc 
tacaatttcc 
gtttctacaa 
acacaaacta 
agatttggga 
caagcaagga 
gaactttcac 
ggaagaagta 
ggataggaga 
aactaatgtc 
cagaatgact 



ggtagtcttt 
atttgtgaat 
aggcgtattt 
acgatatata 
gacggtattc 
catgactcat 
acgcacgtaa 
gaattgctca 
acgtcatgtt 
ctgagcagta 
tctgctaact 
tctaccaaca 
attgcatgtg 
ttgttgattg 
tcgccgccgc 
actacttcca 
gactatgatg 
ttcatcaatg 
aatggaggat 
gtatattaaa 
tctacatagt 
ttacatgtgg 
aacaagactg 
atgtgaaaaa 
agaggaggag 
gggaatccct 
caaagagtta 
gttggaaagt 
atccctgcaa 
cacttgaaaa 
aattatttga 
agaagggaaa 
gtatgaaaga 
cattattgga 
tcgggcaagt 
aatccacaca 
ctcacataga 
acatcgttgg 
acaatggaaa 
agtaaaaggt 
tgttaccttt 
gaaaggtttg 
ttgaggttag 
aaatggccag 
aagaagaaat 
atcgtccaag 
ctcatggatt 
aacctacgca 
aacgggttcg 
aagagtgatt 
gtgactcttc 
tattcacctg 
acagagtatc 
aagatacact 
gtgtggaagg 
tggtacctat 
tctatccaga 
gagactgttt 
aattacccgc 
gtttcctgtt 
gcgaaacaat 



tgagccaatc 
gatgttgaag 
tattcacaat 
caccttcagc 
ttctatagtt 
tttggagggt 
ataccagcca 
acgtactcat 
aggggtataa 
gagaaggacc 
tgaacgaccg 
agaaacaacc 
tcatcgcatc 
tggtactggg 
cgcagccgcc 
ctactactac 
acgacgtcaa 
gaagaaaact 
ttgtttggaa 
tatgtgttgt 
tttttttaaa 
agatcttgta 
attcaaagaa 
atcaatgagc 
acggaaaaat 
ctagacccag 
gatgcccttc 
atgaaataaa 
ggagatacgg 
atacaccaga 
agatgataaa 
ccctctagac 
gattgatgat 
tgaatgtgaa 
gactgtattt 
aaacgtgttg 
gtgtataaaa 
aagaggaaca 
aaaagactga 
ccagaaataa 
cttccataat 
aggagaggtg 
tagtggacga 
atatgacgaa 
tgaaagaagg 
aaacattcct 
tcaataccat 
cccatttccg 
ataaaaagta 
gggtggacag 
ataggaaaca 
aagagaaata 
cacaaaacaa 
attgtacgtt 
atatggaaga 
tattcgacgt 
cggacggcga 
ccaagaaaag 
ccactaatgc 
cggtatttga 
actatcaaga 



agattcagat 
aaataatgac 
ttcatccacc 
gagttcgctt 
tttaacgatt 
gagatcatgc 
ttgacgtcat 
ctcttttcct 
aacgactggc 
cagtgaagaa 
aactctactc 
agtgtgctgt 
tgttctgggc 
aactgcacca 
ggtgcagttt 
tactacacca 
tattggaggg 
ctcaaaagaa 
gaatggtgtt 
tttattgttt 
ttggaagata 
ttagcaacat 
ttaatgattt 
agatgaaaaa 
tccgcaagat. 
aagaaagacg 
ttgaggaaaa 
ataaaagagt 
tctcaaaaaa 
atagatgagc 
aggttccgag 
ccagaagaaa 
tgtctcgagg 
aacttgcatt 
gtacgtatgc 
tttttctata 
ggaattggat 
gagaagagag 
gacggctgca 
agaacccaaa 
caagagtgaa 
taaatccgta 
gtgtctgaga 
ggataacttt 
atctacacat 
atcccagcaa 
cacctacgcc 
gacacgacaa 
tacgaaactc 
tggtgatttg 
tttatgtgtt 
tccgataccc 
gaaaatacag 
tgaccaaacg 
acgacacaaa 
taaaaagat t 
aggcgtctct 
taagaaaaat 
caagtacgta 
tacccgtcaa 
atct tggatg 



tcagcacctt 
gttttttcca 
tcatgttttg 
ctccgtatac 
cccttgatag 
aatgaaaaaa 
atccgcctag 
ggtacatgac 
tcgagccctc 
aatactcctc 
gtggaggaag 
cacatttgct 
tgcgtcctcc 
gttaattgtg 
catccttacc 
actccaccag 
caatcagtta 
gaagaaaaag 
ttttctcaat 
ctttaaaaaa 
taaatagaat 
tagagaaatg 
ggaatcagaa 
tacacaacta 
ggaggaaaga 
tacaatgttg 
tgaacgtata 
aggagacttt 
tccatgactt 
agacaggaga 
aaatagagga 
gacttgtatt 
aaataaacca 
atggtcttca 
aattaggtcc 
tcctataaat 
gttcctatgt 
gttaacaaca 
acaacagaaa 
acaacttcta 
agtagtagaa 
agcaagatga 
cgttacaacc 
tacgtccaat 
cctgttgtag 
ggagaaggaa 
gccaaacaac 
cagagggcca 
gtacaacact 
gaaagggtaa 
attagtgata 
tcaatactaa 
aaaatgatag 
accattcaag 
caatcagaag 
aagaaattgc 
gtactatttt 
aaaaaaccta 
gtgggtgtag 
aaaagggtag 
acagatagaa 
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gaaaggcaaa 
taactaggta 
ccactaaaaa 
acgaaaaggg 
agtgcatatc 
acattgcttt 
ctgccaggat 
tcgtggacga 
tggagaagga 
caacaacaac 
tcagagaaaa 
caaaccgaaa 
cctcttcctt 
gggacttcaa 
taatggaaaa 
aggatgtaaa 
atgctatttt 
tcaacaacat 
ccaaaaaggt 
tgccttcacc 
ggctgttata 
ttcacgggca 
tatctgcaaa 
tctcggacgg 
ataaattagg 
ttttatgtct 
tgatgaggcg 
atcgactctg 
tccttcagtg 
tgctgctgct 
tgatgttgta 
gatcctgcag 
ccaactagta 
tttttggata 
caggaggaga 
gaatagtgaa 
gtctcataaa 
gatcgaagag 
caaaaacgaa 
agaagaaaga 
catcgaggaa 
aaaggaggtt 
aaaaatacat 
ttttatttgt 
aaggggttat 
ggagcaactg 
gtgtattacc 
ctccaaaact 
ccatgtcatg 
gcaccaaaag 
cctcatcctc 
ggagggagag 
gcttgttact 
cctttacttc 
ctgcagatga 
ctcgtgggta 
tagtttgcga 
cgcagataat 
aaacgaattc 
aaaatttgtt 
ttgtaaagta 



cgaaacgtac 
cgataatggc 
attcgaagct 
aaagaaaaag 
taactttata 
tggggatgcg 
tttcgccaag 
gaaatatacg 
ttgtttttca 
aacaacagaa 
tagagatacc 
agtatcgaca 
ttgtactcct 
tgcaagcacc 
gaaagacaag 
aaactcgtgt 
cgaccttacg 
aaaaatggta 
gacccttcaa 
ttttgcacaa 
ggtccaatct 
aggaaagcaa 
tcactgaacg 
aggcggttag 
cgagaggcta 
cggaccatgg 
aggatctgca 
gggagcttac 
aggtcctccg 
gcagcttttg 
tatgaacttc 
gccttatata 
cctttgtcac 
cattttctgg 
gcctgtgtca 
tctcgttcta 
actctttact 
ttacatcaca 
gagcgaaaga 
eg tgcaatat 
agtgaacgtc 
gaagaggaac 
tatatgattt 
tttttaccaa 
atcttccttc 
gttaactgta 
ccacctagaa 
aaaattgaca 
ttcccttagg 
attttccatt 
ctcttcctca 
gtga tgtgtt 
tgacgagtcc 
ttctggctcc 
agatgttgtt 
gaacaagtag 
aataegggea 
gttagatttg 
gatttaaaaa 
aatgctacaa 
catggttata 



aagaagaaca 
gaagaaatta 
tacttgaagg 
tacaggaaaa 
gatgaattaa 
aaatttgect 
aagataaagg 
tcaaaagtat 
ccgaataaga 
gaagacgaag 
agaagatget 
ggagatatct 
acttccatta 
aatattgttc 
atgccgttga 
cactagtgaa 
gtcattgtgg 
caatattcta 
gttcacttga 
cagactctat 
teatategtt 
tatcagtcag 
tgcggtctgt 
tcatgttttc 
tttctttcct 
etteggcaag 
catcaccatc 
teggegggtg 
cctttctctt 
cttccataga 
ttagtgttca 
caatttcaag 
ctacaccaga 
gtccagttgt 
ttctgtgtgt 
gtggtattga 
tgtcaaatag 
aaacaaataa 
agcatcagca 
tggctgaaat 
taggtttact 
atcttttgaa 
tttggattaa 
caaggatcat 
actttctgta 
agagatccaa 
acggcttcac 
gtattttgta 
gggtttttca 
ttctctgcag 
tattcctctt 
tctttgtgtt 
tctaccagtt 
ctcttcctct 
gatgttctag 
tagtagtatt 
atgttttatt 
atgaaaatcc 
tttgcgaggt 
atagattaat 
taaacccacg 



ataaagagta 
taaatgatgg 
tagtgaacga 
atgccatgaa 
tccctaaaag 
geaegggaag 
aaagagtegg 
gccatcgctg 
aaagaaaacc 
aaaatggaaa 
cgtccgaaaa 
ctatggaaac 
catgtgtctt 
ataaatttct 
agtatcactt 
atagtggcgt 
tatatcaact 
ggtgtgtttt 
tacactaact 
gaatttatta 
gtagaatctc 
tatagggaat 
aatggtcatt 
cagttgaggc 
cgcttccgtt 
tctttctctc 
aacattttca 
tgtttctact 
actggtcgtg 
cctggtggaa 
aagagactga 
atggggaagt 
aatccacaca 
gtgggggggg 
atgtcttgta 
aattgtacat 
aactgaaaga 
gcaatttgaa 
acagcaacaa 
cgataaacac 
tgtagatgea 
acaaaaagaa 
taaaaacaag 
caatcaaact 
ctacagcctg 
tctccatgtc 
ggttgatcag 
cctcaaattc 
gtttgatgtt 
tgtcgtcgtc 
cttcaccatc 
cttcctcctt 
cttcttgttc 
tggtggggag 
aagagaccat 
agactcttca 
atttatagat 
tggatctata 
taatatagca 
atacataaaa 
gaaattatta 



caaagaggcg 
taacggtgat 
gcactacagg 
agtatactct 
ggataaaatt 
aggtgagcaa 
aggtgataag 
caatcaacct 
gecgacaata 
atggaagaag 
gaegcaatte 
gccagtacct 
gggaggaaaa 
agggttttgg 
tattcgagtt 
agecctgaag 
cttgcccgat 
actattcttt 
tcaacaaatg 
cacatcaaat 
gaaccctcga 
tcaaaagtga 
tcgttgtgcc 
aaaagatata 
gcggcttttt 
tccattctgg 
tcaatataga 
gattcttctt 
ggagaaactt 
accattttga 
acaagtgtgt 
ggggggagaa 
tgcagaaggt 
ggtggtataa 
ctgaacatta 
aaaaatggtg 
catgcccaga 
caggctcaaa 
ataataattc 
atgaaagaga 
gaaatcaata 
gactaaacaa 
tggaataatt 
atctattgta 
tacaaaatta 
gccaaactct 
gaacccaata 
atgetcatag 
tttggtagta 
cactctgtct 
gtcttcttct 
ctcttcctta 
catttttact 
aagagtagag 
tttgggggtt 
ttgtttgact 
aatgtcacat 
aaaaagaaat 
aataatagta 
cctccatctt 
ttagatgcaa 



ttagaggaaa 
acttctacac 
ttactgtgga 
agaaaacaaa 
gaagattacc 
tacgcatcac 
aggtttactt 
ttaaatatgc 
gtaacaacca 
gctacacctc 
ggttacagtt 
tcttccactt 
ttegtcgaca 
gataaaaagt 
gectgagaat 
gatgagaagt 
aaagggttat 
gcttgaaatc 
gggatatttc 
tcaatttgta 
cacgtgattt 
ctttgttgaa 
tgagaagtga 
ccatttcttt 
cttcctcgat 
cgacttcgat 
cttcatcttc 
cttcttctgt 
ettttgettt 
ggttcttgga 
gctcaggtgg 
aataaggttg 
ttgatgetag 
atatagecca 
ctgaaatggg 
caccaaaaag 
tacaaaagca 
aggtcctcga 
ctctagaccc 
ttgatggttt 
acttggaaga 
taaaacatct 
tttaggtata 
acttcaatag 
tcacacttgt 
tttggtgggg 
tcagatagta 
tgtgaaactg 
ttgtccagtt 
tcatattctt 
tctttacatg 
attctggggc 
tccactttca 
gtggctgcta 
atttttggaa 
tctttgttac 
atggcaacta 
caaaacactt 
gattaccaca 
tacacgatgt 
aagggaccca 
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ggaaatcgtt 
cgatggttta 
gttagaaatg 
gaatacttta 
taagtacagg 
tttttgtccc 
gccttgttca 
cctttactgc 
ccatctccac 
cgttctcgcc 
tctttgtatt 
tttccattat 
agttcccttt 
agtctcgtct 
gttatggaca 
gaagagacat 
ctgcatagca 
tagcgggatg 
tcagagacgg 
ttctctgcaa 
gcgtatatga 
gggtgtgtac 
ttggagtaac 
catatatcat 
atgttacgaa 
gtggttatta 
agcctgtaga 
attttgttga 
tcacacccat 
ccgacttgaa 
tgaagtactt 
gtaaccctat 
ccagaaatag 
ttaaatgtgt 
tgacacagag 
gaaatcggca 
gtacacaaaa 
tggacgtcat 
agaccaataa 
ataatcttct 
ttaccaacac 
ttgtacacat 
acgtcaataa 
acatcttctt 
acacatgctt 
tctggaaact 
aacattccat 
atggtccatg 
gactgtttgc 
ggagatcctt 
gagctctctc 
gcattaataa 
aacatgtcct 
ctcaaggctc 
ggaagaaacc 
gaaagaaatg 
ccaagaatag 
tgttattttg 
ttacagggcg 
agagctcccc 
gttcttcccc 



tatatgttcc 
atgaatactt 
tacttagata 
aggaatctca 
gctaattata 
aaacccattt 
aaaggactac 
tggtgctgcc 
gttcttcatt 
gcaaacgctt 
ccttagatgt 
ccatgaaaat 
ctcctggagg 
tgaagtggtt 
tgatcaaacg 
acctgattcc 
aggagaaatc 
tcgcaatcga 
cacaacgttt 
actttatttc 
cggttgcatt 
tgttgtagaa 
tgcctgtgat 
acactgatat 
gtgttctagg 
tcatatagtc 
gaggagaaaa 
tattttgcat 
catcttgcaa 
gagtattgtc 
ccttatcttt 
caacagattt 
tgaatccatt 
catactcggg 
tgggccgata 
aaaaattacg 
tgtggcactt 
atttaactgg 
aacaacaaaa 
cagcaataaa 
ggtacacgta 
ttctggggat 
accagttatg 
ccctgtaaac 
tgaattcttc 
gttctgtgtc 
ccacgagcct 
ggaagatggg 
cctttccggc 
cgacgaatac 
tcttttatag 
tatataattt 
cctcctcgtc 
tggctcaccc 
agaagtgccc 
gaggaaatct 
tgggaagtct 
tcaaggatct 
ttctggacaa 
tgatcccaat 
ccagatctaa 



gctcattttt 
tcggtttact 
aagacccagg 
gagaaattga 
gaggtttcag 
aaaataaatg 
atattcattc 
gtttccatct 
atacatggaa 
ccgttcgtct 
cggatcaatc 
tttgatgtct 
gactgcaaag 
tcgtacactg 
gggcctacgt 
ttcttctgtt 
tcggtcagaa 
tgattttttg 
cctcaattcc 
tcttctgaaa 
gaaaagagat 
aatcaaacct 
tgggtttgca 
gcgggaagaa 
gcaatggctc 
acattttttc 
tttaaagttt 
atcaatgggg 
taaactccct 
gcctgtcaat 
tagacggact 
gaatgataat 
ttttaccttt 
tgtcacagat 
cgcaaattgc 
tctttagaat 
ttcaagccac 
aatatgtgta 
aacattttgt 
cttacatttc 
attcttactc 
aacttcaatc 
tgcatcattc 
catcacatgg 
tccatacatc 
gtgcctccac 
gggacgttcc 
taaatagtcc 
agttctcatg 
aatcatgttg 
aggcaaaaat 
ttatattttg 
gtcgtcgttc 
agatctggtg 
catccagttt 
gttccct ttg 
tcacgggaga 
g tataatgga 
gaaaatatct 
aactgatgtg 
agcgtaccga 



tggactactt 
agaatttgtg 
tgaaatattg 
aaaagagcca 
ttaataaagg 
catggttagc 
ccttcatttt 
ccatcttctg 
attgcccttg 
tcgggcaagt 
acattgattt 
gttcctgtta 
gttatactat 
tccctaatac 
tctcccacat 
gcagaataaa 
aaagcaaaca 
tgcacatccc 
ttcagttctt 
ttatccatct 
gacatataat 
agtttatcca 
atgtaggccc 
aagccacatt 
ggcgtgtagc 
acagctgcat 
ttgtcactaa 
agtgattcgt 
aacctgggga 
tgtctcagtc 
atatatagtc 
agaatatgtt 
aaagtcttga 
aataaaccta 
cacgttgctt 
gtctcggaat 
cacctctaca 
ataatatagt 
aatttattta 
atatcacagt 
gtcgagaaac 
tttacaaaca 
aattcttgaa 
aaaactggct 
tccaaatact 
atgcctgtga 
tggtaatcgg 
ctataattgg 
ttatccaccc 
aagggtaaga 
gtagttgcca 
tatcattcaa 
tcgttccgca 
gacaagatta 
ctggccgaca 
cacccgttca 
acattggaca 
gtattttcct 
ggatctggat 
gatttgctgt 
gtcatcactg 



gtatatcacg 
acacaaaatt 
tatcatgcag 
attagggatt 
actgttttgt 
aaacttttat 
cttctctttt 
aagtttctcc 
acgtattgaa 
tttcttccct 
cggcaccatt 
cctctctgac 
tgccactggt 
atataggggc 
ctctcttttt 
aggaaccatt 
ctaagcaggc 
tcaaagaaat 
cttccttcat 
gtaaattatc 
ttccaatact 
ttggaacaat 
ctttgtatgt 
gttccaccaa 
cacaccctac 
tagaaagtct 
gaagggcctt 
tccacttttc 
gattgtgagt 
tatgcactcc 
cgtcccttaa 
cgtatttgtc 
attcttccat 
cagaccttgc 
gaattactta 
gtcatatttc 
tcccagtgag 
ccaattctta 
ttcaactaaa 
gtacaatact 
atgaactaca 
acgctctctc 
taattggttc 
tgtatttgac 
ccctcaatcc 
aaaagttcaa 
atgcacgcaa 
tgtacatgaa 
atgaagtttt 
cttgtctaac 
gccaccacct 
ccgtagtcca 
tctccaccta 
cccagaaatg 
tctcgcacct 
agaaccaacc 
atgacattga 
atgtgaacgg 
ctggagaatc 
acattttcgg 
aagctgttct 



gatgggatgt 
tcattaacca 
aatatcttat 
taggttggat 
ttttttttat 
gtctttacaa 
ccttttctct 
tccacttcct 
aatcatgcgt 
tttgagtaac 
ttgatatact 
catttccatt 
gcgggagaaa 
atccgatcca 
atattcaggt 
tccgttggtt 
tccaccatca 
cctttgttct 
atttacttca 
cctttcattt 
tttagctgat 
taggcacatg 
taacctatca 
taattgttcc 
taaaaagtca 
ttcgtgtaaa 
ttctttaacg 
cccataatcc 
atactttaat 
gttttttatg 
ttcacttttt 
agggttttca 
tttgatatac 
cattgaactg 
atctgtagat 
tttgtgcgta 
gcggtggagg 
aaaaatcaat 
atctcatcaa 
attcgtgcat 
aatattccct 
ctccttgaga 
cattttcttg 
ccactcggcg 
aatttttgat 
tgatgatcca 
atgctcttga 
gttgaggaaa 
cccagttagg 
tctctcgagc 
taaaattcgt 
tctaagagtc 
ccagaccttt 
tgacgagaca 
gatccaagga 
acatctggaa 
agaatcatac 
cgtcaaggag 
ctcctcttct 
tactttggta 
agcactcccc 
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ttcaatgaat tcagtaacaa ctggcctcct acaaatatca aaggagcata cgtgtctaga 2874 60 
gatttcagga tgtttaatct gttggccggt ttagatcata tagaaggaga agttggggga 287520 
gaaagtgaat gggaatccat acacgcatct gtcgtcaagc gaatggtcac cattatgcgc 287580 
aacaaagctg agaagaaacc tccatcaaca tctagaattt ttagagtgta tgtggctgaa 287 640 
ccagttaatg atgcagtgac aaagatccct atacgtgtac tcagtaaatt attcggttca 287700 
agactcgccg gtattctcca gaaagtgtac tcttattcaa tgctaaatct tccatatctc 287760 
ctgtcttcaa attctataga catcaagcaa ggagtaaaag gaattacatt atctatacca 287820 
tctgcaagaa aattgggatt ctatttactc caaaaggata caacattaca atcttcccta 287880 
tcacaagatg ttgccgactg catagtttca atcaacgctg gtattattgg tgatgatttc 287940 
tctgaaaaaa tacgacagtg cattgaggag aaaaacaagc cagaaaactg ttgtatgtgc 288000 
ttttgtgaaa ttgacaagac gcccgatttt tcttatagtg aacatgtggc aaggcacaat 288060 
ttcttccccg tccacgcatt ctcctcatca ca tgatgaca agtgttgtgg agcaaagatt 288120 
tgttccgaat gtatattccc gtacatcatt tccctgtatg agaaaatgac tggtgtggca 288180 
ggtgtaaaag ttgtagattt gttccagtgc cctggttgta aaagtggcat gctcaatcta 288240 
aagggaagat gttacgagtt ttcaaatttg tgtaagagaa tgatactacc atacacatcg 288300 
actcattgtt cttctctctt tgatgctacc ataaatcgcg cagaggcttg tttttactcc 288360 
ctagagtttc tccagtatga ttttgaaact gcgaggagaa ttgcgcatgg agctaaagac 288420 
attccccatg tttacaataa ggtagtaaag aatgtaaaag atttggatag actgtgtgcc 288480 
ttgtactgtt acaaatgcgt atctcctgtc gtatgtgatg agccgaatga aagtacggac 288540 
tatgaaatgg tggatgtaac tccccctcta attaatctta ccgagattgt tgattcggaa 288600 
gagtatgatg atggtcccgg aaatcatatg tggccagcaa aatttacctg taactttatt 288660 
gcgggttcca gtggagaaac acccaccatt agtacgtgca gagatgctgt aacttttcta 288720 
ggaagagcac cgaggaagaa aatggcagga tgggatgatc aatcggcagt ggggcaagcc 288780 
attatagcgc tagccaactg gagaaagagt ggggaattgc ccaaaaatat gtttgattta 288840 
ctagaagggg taaatgccgt actttataga ggcgacagtt tcttgttacg tgcgataaac 288900 
tacccctgtg ttattggtag atccatgagc cctagtctgg aactcgttaa aagaaaggtg 288960 
aataaaattg ctttaataaa ggccttcttc cacgagaaaa gggtgcgtcc agacgcatct 289020 
aaaaagttac ttgaatgggc agaactatta gtcaaaagtt atctcatgga agttttactt 289080 
cagacgccag aatgtgtcat acaccgcgcc cattcatttg taggcaaaac tctcctcatt 289140 
actgacgaat tggttcacat gcgtccagat gatgccacaa gaaacgccta tatccagaac 289200 
ctaaatgcgg ccagacagaa tgcggctgct gcagcctcat tttctgggtc gctccccaaa 289260 
cctgaatttg tcccctgcaa agaaaggacg attgaatgga tgtatgaaaa ggacaatgat 289320 
gatgttagag ttgtaaattg tccttcatgt aaaaaggcta tccagaaata tggaggttgt 289380 
gtgaatgtgt tttgtgaatg tggaacaaac atgtgctgga tatgtgaaga gaaggtttct 289440 
cctgctgatt ctaatcattg tgtggagaaa cacaggattg tttatagtaa ctgtgttagg 289500 
gttaaatatg ccttagaaag tatgtacggg tttgagattt gtaccatgaa aaatgtagaa 289560 
gaaggagtta aaaattatta tgtaatggag aatggatttt tctttgatgt acaagaaatg 289620 
gttgctaaga aataataaaa cacttgtgaa atgttataac ttgttttatt gtccattata 289680 
ttcaacaaac tttaatttct ttgcatgcct tgttcaaaag ggcaacataa tcgccttctt 28974 0 
tctcctccct ctttctcttt gttcctccac tgcttgtggc tacttcttct tcttccccac 289800 
gttcctcgtt gtacgtctca attgccttaa ctgtgttgaa aatcatgcga cgctctcggc 289860 
gcagacgctt cctttcctcc tctggtaatg tttcttcctt ttttattatg tccttgcctt 289920 
ctttagaaga ggggtcaatc acttttattt ctacaccatc ttgatatact tttccattat 289980 
ccatgaagat tttgatgtct gatccagtaa cctctctgac catctccatc aacaaccttt 290040 
cattcggagg atcgatgaat gatatttcat taccctttcc ggatgcaatg aaatggttat 290100 
taaaatggtt tcgtatacta tctctgatgc atataggtgc atctgagcca gttatggaca 290160 
tgaccaatct gggcctacgt tcatccacgt ccctctttct gtagtcgtcc gaagagacgt 290220 
atctgattcc ttcttctgtg gcagagtaaa aggaaccatt tccattggtt ctgcacaaca 290280 
aggagaaatc tcggtcagaa aaggcaaaga ccagacaggc accgccgtca tatctggatg 290340 
atgcaattga ggacttttta tgcacatctc gaagagaaat cctttgttct tctgaggcag 2904 00 
catatagctt tttggattct ttaagttttt cttccagaac acttctttcg ctaactagaa 290460 
ggttgacttc attttttgca aatctgattt ccttttcgaa attatctaat ttttgattac 290520 
tcttttctag cgtgtataca actgcagcat tgaaaaggca tgacatgtaa tttccaatgc 290580 
ttttagctga cgggtgtgtg ctgttgtaga aaagtaatcc aagtttatcc attggtacaa 290640 
ttaggcacat atttgaaaaa ctgccaacta tggggtttgc gatacttaag ccattgtacg 290700 
ttaacctatc acagatatca tacactgata cacgagaaga ggtaccaccg tgttgcacta 290760 
acacttgttc catattctta agagtacggg tacaatttct aggagaaaac ccacacccta 290820 
ctaagaagtc tgttgtgatt atcatataat cacatttctt aacggccgca tttgaaagac 290880 
gttcatgtat gattctgtat aggggtgaaa atttaaagtt cttgtcatta actagagcct 290940. 
tttcttccac aattttgttc atttcctgga atgggcttgg tagtgattcg ttccattttt 291000 
ttccataatt agggtcttca tcttctaaga aactaccttg ttttgggaga ctggatgtgt 291060 
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attttagtcc 
tcttgatgtg 
cacttctcct 
tattttcccc 
ttataccgtg 
gatctgggaa 
ttacaggaaa 
atatcaaatt 
ttttggagtg 
gacattaaat 
tatataaaac 
tgacatctcc 
accgatgtgg 
aattcaagtg 
atgttgttac 
actggtctaa 
ttacacgccc 
tgaagtacac 
aattgagcaa 
tctagaaaac 
gactggattc 
cgccctaatt 
caatatggac 
ccacgagaag 
gagaaaaatc 
tgttggtgga 
cctagaagct 
aacaccatgc 
atgtattgcc 
cttcatatca 
acgaaacaat 
ggctgatatt 
caatggcagg 
tacccattct 
ttttgatatc 
aaattgtcac 
ttctgggaag 
aaaatgtaaa 
tcgtctacct 
tagaggggaa 
gaatttttgt 
ctcttcagaa 
taatcgtgcc 
aaagtcacgt 
aaacgttaac 
caagatactg 
catcacatct 
aggagtattt 
aaattcgtac 
aattttcacc 
gcctcacgtg 
tatcgtcaaa 
tacagcagaa 
tagattgagc 
gggaaaattt 
ggccggggca 
taataaagga 
gttggatctc 
gaacgccgtg 
agtagaccta 
aaaagtacac 



aatctgaagc 
gagtacttcc 
gacactcttg 
agacaaacaa 
taaccctaac 
gtgtactcgt 
agatgataaa 
tgggcaaatt 
aaattactaa 
tggtttgctc 
cagcctgcac 
agctccatca 
tttcctcctt 
tgaacccaaa 
ccagatatcc 
atctggtcat 
ttcagaagtg 
acgttccttg 
aattagtagt 
tttgtcaagg 
aacatattta 
tttatggcgg 
ttgtacacgt 
agtttttctg 
atgcaagccc 
gcacctgcgt 
agtttcgatg 
ccccattctt 
tgcttcgcat 
acggatgcct 
gaaaatggag 
acccaaaagg 
cagtggaatt 
ctagtaaaca 
ttgtacaact 
attttaactg 
tggtggacta 
gttgaaaagg 
aatccaggaa 
aatctaggga 
ctcatggttg 
tcgatagtgt 
caagatacta 
gaaggtgctc 
cacgatacgg 
tactatgata 
attgggttct 
ggactggttg 
gatcctgaag 
aacgagtttg 
attagtgggt 
catttgagtg 
tctgtccttc 
acccctcaag 
gtagacaaga 
gaaagtcagg 
tcagcaggat 
atgaaagtgt 
ctcgccaaag 
cactttcatc 
gtctattgtt 



gtaggatctc 
ttgtctctaa 
actgatttga 
ataccatttt 
aatggtgtgc 
gatttctgta 
atgaagacgt 
acatattttt 
aaaaggctat 
acccccccaa 
cccctcggaa 
ccctcttcca 
gacaacaaca 
aatctggaac 
cccgaccaga 
gaacgcttca 
attgaaagtg 
atggaccaca 
tggtgtacaa 
aagaacatga 
atagttcccc 
ctttcttgaa 
ttgattatgc 
tcgtccgagc 
tggacaataa 
tcacatttat 
tgattggggg 
cagccatgca 
caatgttctt 
ctaaaattgg 
gagaagaaaa 
ctcatgaaac 
ggcctactca 
cattgaaaaa 
tgcttaatcc 
gacataaaaa 
ttaattttgt 
atagaaaaat 
gcactaccgt 
gtgtaggtga 
aaaataggaa 
ctgatccctt 
ttaataatgg 
gtgtatggtt 
ccatctctac 
ttgaaacaac 
gtttgtgtac 
cacctggatc 
aaaaggaaga 
aaatgttgct 
ggaacaatgt 
atcacaccaa 
ctagagcaac 
aaagaataca 
aaactggcat 
ccaataccaa 
ggttccagaa 
gcgaaaaggc 
tgagtagtgt 
tattgggatt 
gcaaggatgc 



ctgttaattg 
gacgaactat 
atgttaatag 
ttacttttaa 
cagtataata 
gaaattgcta 
gcgatgagat 
tagcgtcaac 
atttcatgta 
catcgttttc 
cccacagtcg 
cccccaaatc 
aggaagtggt 
taattggtgt 
tatttgtggg 
gtgacatgag 
tgataaagaa 
ccgagaaata 
cccctatacg 
aactgtcgtg 
cgtgtatttt 
gcacaatagt 
gggtgctttt 
acaacttatc 
ctacaacaac 
gagcggggac 
aacaagagga 
actaaaactg 
tgtattggag 
acaagcgcaa 
taatgtcttt 
tgcctattcc 
cactgtggaa 
tctaggggat 
atttggaaaa 
caatgaaaat 
gggtgtgaac 
atccgatttg 
cgatgacaga 
agtcgtatcc 
atttagtgtg 
cttttcacta 
ccgagttagt 
ggccaaggat 
ggacgctatg 
agataaagat 
gggaggcgat 
cgacgtggaa 
cattatgaaa 
tggttttgga 
agcttttgac 
agacatgtct 
agaaggagga 
actagcaagc 
gttgaaacct 
gtttaaggaa 
aattattggc 
ctataaagaa 
cggcgacaag 
cttgaagctg 
ctacttgact 



tctcaatctt 
aaaatgtcct 
tacccgttca 
ttgtttaaag 
aatctacgga 
tttcctattc 
atgtgttttt 
actcatcttc 
caacataaaa 
atcaactctt 
acttgtaccc 
cagttgtacc 
catctacgac 
actttctgga 
aacatatatg 
taacaactgt 
aacgtccagc 
ctatttttct 
acagtgggta 
gtgcacaacc 
gaagtgcaca 
ttatggggag 
ctggacgaaa 
aactcgtatt 
aagaataaga 
ggagagggag 
ggaagatttg 
gacaatgaag 
aacccaggtg 
gcatggatag 
aaaaagacct 
aataccatcc 
cctattgccc 
agaaaacttc 
atgttgctag 
gtggtgcctc 
atgtggactt 
gcctgtatgg 
atagttttta 
gacattacac 
gataaagaaa 
gaagtgactg 
gctcgtgtaa 
gaaaatgcca 
gagcgagcta 
ttcaccgaca 
atgacacatg 
aaggtgaaag 
cagtgccctc 
aagtatatag 
gacccctttg 
tattgtgtag 
ggaggaggag 
actggtattt 
gaaatgactg 
cgcaacaagt 
ggtatgtgca 
tccctctctg 
gttaaaaatg 
aagaaggccc 
ggtatagttt 



tcgttaccgt 
tcctttagct 
taattttcgg 
tcttccattt 
ccatgctctt 
attatttcca 
tcaatatcgt 
atgcaagact 
cacatgtgca 
tcctcacctc 
tccgtcgaga 
actattgtaa 
accaattcca 
gtctctgata 
gtcaaatata 
ctggacaata 
gactttaaaa 
ggtgaccaaa 
tgcaactccg 
cttctggaat 
atgagatgga 
aaattaacgc 
gatggtgcca 
acaagtgcag 
agaggaagaa 
gaaaggaagc 
gtgttgattc 
gaaactatgg 
atgaatcttc 
atgaacgact 
tccatatgct 
cacttggacc 
atgaatttgt 
cccgattcaa 
tgtttattca 
gaggttctgc 
ttcaagtaac 
aaactctccc 
agggattctg 
agagtgtcaa 
ctggtttcat 
gctgtagatc 
tgaggatcct 
tcatctttga 
tagggcagca 
aaaaatcagt 
gaggagagag 
agactataat 
aagtgattga 
ataaagtgaa 
tctttactcg 
cagatgcatc 
aaactccata 
tcaataaatt 
cagatttatt 
tatcctccag 
gtgctattcg 
aatttaattt 
taaaagatga 
aggatcaggc 
ctacctccat 



291120 
291180 
291240 
291300 
291360 
291420 
291480 
291540 
291600 
291660 
291720 
291780 
291840 
291900 
291960 
292020 
292080 
292140 
292200 
292260 
292320 
292380 
292440 
292500 
292560 
292620 
292680 
292740 
292800 
292860 
292920 
292980 
293040 
293100 
293160 
293220 
293280 
293340 
293400 
293460 
293520 
293580 
293640 
293700 
293760 
293820 
293880 
293940 
294000 
294060 
294120 
294180 
294240 
294300 
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294420 
294480 
294540 
294600 
294660 
294720 
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caacaaggaa 
gacagccaac 
agaagaaagg 
aaagggaggt 
gacaagtttg 
ccatcgacaa 
caccgactct 
gtacagaccc 
tccaattact 
gaagacaaat 
aggtatggat 
gaatgatgat 
aatgttaatc 
ggaggatatt 
aagaacccat 
ccaaatggct 
tgaaaaattc 
tgattcggcc 
gttggttagg 
agccacttac 
gggagacatg 
gaattccatg 
tggatctgcc 
aggcacaagg 
caatattggt 
agattctgta 
tgaacaaact 
aattattccc 
cgctggtgtg 
tgccaagaaa 
gataaaattg 
ccacgcagat 
acccgctggt 
tatggaggaa 
caacttgtat 
ttcccgcccc 
tttcacccct 
aaagggtctc 
ttatttggat 
tatgatgatt 
taatcctcta 
gcagaaattt 
agtccctgct 
aagtgttcct 
atacaagatg 
agtgttgtgt 
taatacaggc 
taaacaaatg 
cgacgaagaa 
ctgtgtcacg 
taaaacaagc 
ggaggaaatt 
tgaccgttta 
cttacaacaa 
ccaaagagta 
ctccaacaag 
tgctgcagtt 
ccattggaaa 
tgtgcctttt 
cgacatggct 
tgagaacgaa 



ggggagattt 
ctggccactc 
gcagatagaa 
atggtgagtc 
tacccgatga 
attatgcaac 
ttattgttgt 
attgatattg 
cgcatagagg 
aataaaacgt 
tacttccccg 
atgcatatag 
gacagaccgt 
ctctcagaac 
tggacattta 
cgtcacatta 
gagaggttgg 
aatgatccag 
acatggaacg 
agagccgatc 
aaaagagctg 
tatggtcatt 
aatactgcct 
cactcggtga 
tgtggattac 
ttctgcgtgc 
ggcaaatatt 
atcctagtca 
ggcatgatga 
acataccata 
aaatcaaccg 
gggtatgttg 
aagaaattga 
tggtttacct 
gcctcacaaa 
atagaagcgg 
tacaaaaagg 
caatcattga 
gtgatgaaga 
actagttccc 
gctctagcga 
aagacggtga 
ggttatttta 
gcattttcag 
gttgagagcg 
tctacatctg 
gtcattgtta 
ttattgtttg 
gcagaagaag 
gaaaaaaaga 
gagaaaacgg 
gctagttcgt 
ctaaaagcaa 
tatagcaatt 
gcagaccaaa 
aagaggcaag 
aaggttaagt 
aagtgccccg 
gaagttgcac 
tgttgtcaat 
agat tggccc 



ttaggctgtg 
ctctatgtat 
gaggtgtggg 
aacctatcgt 
ccatgtgtca 
tgagggatag 
tggacgttat 
cagtcgcatc 
aaagtttggg 
ggtgcaccaa 
agattgtgtg 
ccccagcaag 
acattggcgc 
tcgaaaagga 
agggtcaaaa 
ttgaatctac 
ttagcttatc 
ctgttagact 
taaaaactga 
gagttgtgat 
gtctaaacaa 
tggccctaag 
caagtatttc 
cggccaatca 
agatggctct 
ataatattgt 
attatgtgat 
actcgttaac 
atatcgccca 
tgcttcactt 
ataacaataa 
ttccccataa 
aaagcttttt 
cttcacctac 
ttgtaggggt 
gtacagaaat 
gagcctttgt 
ttgcaagatt 
atcatgtgga 
gagtcaacaa 
taaataacca 
catcagtttc 
acgctggtag 
tcaagaattt 
ataagacggc 
ccaatactgg 
caaaggacgt 
ttggaggggg 
aggacgaaga 
tccctggacg 
agggagaaag 
tgagtaaatg 
cacattcttc 
gtacattatc 
tggaaaattt 
aaaagggagg 
ttttcccagt 
tgtccatccc 
tcaattctct 
cattgtattt 
ggcaaattgg 



tatggactct 
aggagaagga 
agtaagaaga 
caatcatgtt 
gaataatctg 
attggtactt 
tgacgagtgc 
atggaagaat 
tctaagattc 
tacatctccc 
tgacattaat 
tttagagtat 
acacataaca 
cttttctgtt 
acaatacgat 
gggaagaaat 
agacagaatt 
gtggtcttct 
cattcttaag 
gcagaacaag 
agttggacaa 
agcacgttcg 
caacatgtca 
gattacagaa 
tcctggtact 
aggtgatgga 
ggatattgct 
aaagggcatc 
tgaacgtcta 
taatgaaaat 
taagtttgca 
tccttcattg 
ggaagaggaa 
gtggatggcc 
ggagaagggt 
gatggaggcg 
gaagaaggga 
tttaccaaaa 
gaattttgca 
gtttgatacg 
cctgaaccct 
ttcttggagt 
cgtgcgttgg 
atctgttgtg 
aataaaatcc 
attttctctg 
ggctatggct 
aaaggattac 
aaatggtgaa 
aagcactaac 
aaagggctct 
tgggaagaaa 
ttgcaccaac 
ttcctatata 
aatatctcaa 
gccttttaag 
tttagatgcg 
agaaacgcgt 
aataggaaaa 
tgtcctcttg 
cctagatgac 



gctttaaccg 
gcaatctgta 
cactctattg 
ccctatcaaa 
tgcaccacta 
gaaaaaatga 
aatcagattg 
agcaactcta 
atagaaaatt 
aatatgaatg 
atgcagtttg 
atgcttcaag 
gctggaaaat 
gaaaaagatg 
ttctgtcata 
atccgtgatt 
tatcgccgcg 
cgtctaatca 
ggaatcatcc 
gccaaggagt 
aatattatga 
agccgtaaag 
gccaccggag 
aatgctcgat 
aagcagacgt 
ggaatgatac 
ctaaaaaata 
cagtttgtag 
gctgtcgctg 
agtgcagcgt 
tcctttatca 
attcttagag 
ggaattcatg 
atggatgctt 
aactggattg 
gtgacgcaag 
attacaccca 
atagaggaaa 
tcccatataa 
tcaaaagaac 
tcttcagaaa 
ctttcggcag 
gatgccacca 
cccaacgcca 
atgattgcta 
agaagaggag 
tgtatacgat 
ggtgaagacg 
aacgaagaga 
aaggatgttg 
aagacggcaa 
gatgcgagag 
aatgaagaga 
acttcagtca 
ttggatcaaa 
tctgaattgg 
tctagaaaat 
gaagaaaaac 
cacaagtgca 
tacacgctag 
tctgtagatt 



aggcggtcgt 
gaaatatggg 
ccacagacac 
cgattgacat 
cctttgtgac 
aaaacaaaac 
tgttgtccga 
atagacaaac 
tggatgccga 
tcactgccgc 
cggccaaggt 
tattgcccat 
gtcgtacatt 
aggaaattat 
gtcctgtgac 
atgaaggcaa 
ttggcgcatt 
atgttggaat 
ctcaaatgca 
ttgccaagat 
agctcggtat 
agtttgcgtc 
gaattggagg 
gtgtatttgg 
acggggatac 
cagaatatga 
aaatggctgc 
agcgccgaga 
gtcttttgtt 
tcaatgacat 
agagacctag 
cggccgaagg 
acgagaagag 
ctgttatcaa 
acgccatgac 
cgaatgcagc 
ccaccaaact 
agaaaagctg 
caaatcctgc 
agagtagacc 
tttcattggg 
aggaagggga 
acatgaaggg 
tcacatctgt 
aaaatgtaga 
cattgtcatt 
ctctaaataa 
acgacgacga 
acaaaggtga 
gtgaagaaac 
agggaaagac 
atgtcattct 
gaaccagagt 
tgaaattgga 
tacgtaa tct 
acgccatggt 
tgactcaaga 
cattaatggg 
cagatacatg 
ctttaaaatt 
tga tggctga 



294780 
294840 
294900 
294960 
295020 
295080 
295140 
295200 
295260 
295320 
295380 
295440 
295500 
295560 
295620 
295680 
295740 
295800 
295860 
295920 
295980 
296040 
296100 
296160 
296220 
296280 
296340 
296400 
296460 
296520 
296580 
296640 
296700 
296760 
296820 
296880 
296940 
297000 
297060 
297120 
297180 
297240 
297300 
297360 
297420 
297480 
297540 
297600 
297660 
297720 
297780 
297840 
297900 
297960 
298020 
298080 
298140 
298200 
298260 
298320 
298380 
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gatgttgttc 
tcaagataga 
aattatattt 
tagtgaaata 
gaattatact 
acgtgtcaat 
agaattggct 
taatttcatt 
gcagcaatag 
tcattttcta 
gcaatgattg 
acattctggc 
ttcgggaagt 
tcccacaaca 
caattctgta 
catgtatttt 
ctctacttct 
gagaggttgt 
actgttaata 
atgttcaaat 
ttccatggca 
gttgtatatg 
gataatgcat 
ggtagcgttc 
agagatattg 
gtcaggaaga 
tgttgcataa 
ttcagaacaa 
gtaagacgat 
atttctcacg 
attgaaacta 
tctgtaaaca 
agtgggtacc 
cgattttgct 
ttcacctccc 
tattttacta 
tgatgcagca 
tggcatcatc 
cagacatatt 
ttagtgttac 
aagatttgcc 
taggggaaga 
caaaaaacat 
taaaagtaag 
cagttggccc 
gagcagttgg 
atggagcagt 
gagatggagc 
gagcagttgg 
ttggccctgc 
cagttggccc 
gagcagttgg 
atggcgtgga 
gagatggagc 
gaaaggatgg 
acggttctat 
gggacggcac 
gaccaccagg 
caggacccgt 
caattggccc 
gtgtggatgg 



ggaggggata 
aagttggtga 
ttgtttgagt 
aaagacgctg 
gatgaaagat 
ggaaattgta 
gctgcatgtc 
ctggttttta 
cagcattatt 
cagcccatgg 
caatgatcaa 
taaaataaaa 
tttgcagcat 
tcacgcgtca 
ttgattatag 
acaatcgaat 
ctcccacgtg 
aaaatagagt 
tcaatagaaa 
cttactacat 
tcctctcctc 
ttcattgagc 
gcaccgacag 
tcttcatcat 
gaagccggtt 
gtctcgaata 
gatagccgcc 
agatctccta 
acatgaagtc 
tcaactcgtc 
aaatctcttt 
ggagaccaat 
gctctgcggt 
ccttcatttt 
attttattta 
cactgaaaaa 
acaatacctt 
ttctcttcaa 
tgttgtgata 
aaaaaatgtg 
tgataaggtg 
tgaggaggaa 
aagacttaaa 
gaaattacat 
tgcaggccct 
ccctgcaggc 
tggccctcaa 
agttggccct 
ccctcaagga 
aggccctcaa 
tgcaggccct 
ccctgcaggc 
cggttctgtg 
agttggccct 
tgcagttggt 
agggcctcaa 
tgcagcagaa 
aaaggatggt 
tggacctagg 
tgcaggccct 
ttctatcggc 



aactattggc 
aatctttatt 
ctttaaggtt 
ttagaggtct 
tttttggacc 
aattgtcctt 
tatcttgtgt 
ttttatatcc 
ttcagcgaca 
tcgtttaagg 
tattaatagt 
aagtttatct 
gacacactat 
tatttaaaac 
ggtttacaac 
atgagttgtc 
tattgagcga 
tatatttagg 
ctgagctata 
aacccaagaa 
ttggctcttt 
tatgatttcc 
aggtgtagaa 
agtaaccgag 
cagtagcgga 
aaataccgac 
ctgaactact 
taaaattgtg 
ctggtccagt 
tcaccttcac 
cgtggggatc 
ctgaatagta 
gcattcggta 
cccagagcgc 
ttttcctcta 
a tggcgtaca 
cttccaccct 
ccttcctctt 
acttgtgttg 
gtaaaacttt 
tacaaattgt 
gaaagggagg 
agaaagtgga 
ggcccgagag 
caaggagaaa 
cctcaaggag 
ggccctccag 
caaggagaaa 
gaaagaggag 
ggagaaagag 
ccaggagaaa 
cctccaggag 
■ggccctcaag 
gcaggccctc 
cctgcaggcc 
ggaatacaag 
agaggagaaa 
gtggatggtt 
ggtgaaccag 
ccaggagaaa 
ccccaaggaa 



ccaggaagtg 
gcctttaaat 
tgctcagaaa 
ggccttttct 
attgtataac 
tataactggt 
tttgtaagaa 
tcaagagttg 
gcttcttcat 
aatattcttt 
agcgaattaa 
ctcaatgaaa 
tactccaaaa 
atgcaccttt 
ctttaggcct 
ttgtgcacca 
gagatcatta 
tttaattagg 
tgctgacgac 
attataatct 
tagtacaatt 
cacaacatca 
taatcctcca 
cttgattttc 
aaaaatagac 
aggggaatta 
tgtattataa 
attagggcag 
tggcacgaaa 
tggtactttt 
acttccatcc 
ccttaaagct 
cacctccaaa 
atcagaatcg 
tatgataaaa 
ttgaccaagg 
ctaataggct 
ctaataaacc 
ttttaattgc 
ctaaagaaca 
ttgaaaattt 
agagggagga 
aagaaatgac 
gagaaagagg 
gaggagcaat 
aaagaggagc 
gagaaagagg 
gaggagcaat 
caattggacc 
gagaaaatgg 
gaggagcaat 
aaagaggagc 
gagaaagagg 
aaggaagaag 
ctcaaggaga 
gcccaagagg 
gaggcttccc 
ctgagggccc 
gtctagctgg 
gaggagcaac 
gaagaggagc 



ttaaaaaggg 
tataaccatg 
cctgtagctg 
accactacag 
atggatgaac 
atttatcata 
atgataataa 
tagtagtagc 
ttctattgat 
gaagcagaaa 
ttaaaataag 
aatttttatt 
ctatgctcac 
aaagcctgtt 
tcctttagag 
gacgggagcc 
agcacaactt 
aaacatttgt 
atggcttgca 
gcttcttcac 
tcttttggtt 
tcaaagaaaa 
tatctaatga 
acatgactca 
acttgttgga 
tcttcaacac 
attgttatat 
accaacatcc 
atggcagccg 
ccaaatcgtc 
tttaaagaga 
gttctatttt 
tccgtttcct 
tacctgtcgt 
accacaccta 
ggcgtttgga 
ctccaaacaa 
taggagtaca 
tgcttttata 
aaccgagaaa 
aaaggatgga 

aggggaggaa 

tgaacaaggg 
agaaactggt 
tggaccggca 
aattggaccg 
agaaaatgga 
tggaccggca 
ggcaggaaag 
acgcccagga 
tggaccggca 
aacaggtata 
agaaattgga 
aggagcaaca 
aaaaggagaa 
agagactgga 
aggaccacca 
tcaagggaaa 
cctcccagga 
tggtctacca 
aacaggccgc 



taaaagatgc 
acacaaatac 
gtatgagtgt 
gtactgtgtg 
tttgtaacga 
cggcagcagt 
aaaacatgta 
agcagcagca 
tgatatctgg 
cacataccca 
cattttgtcc 
gaccaatcac 
tcgtatctag 
ttttcttata 
atactatttt 
tgtcatgaaa 
ctgaagaaac 
ggaaacttga 
ttttcaaatc 
catcatcttc 
ggttagtcat 
ccccttctgg 
caccgactgc 
tgggtccgat 
acattttgtc 
tattaaagtt 
cggatgctat 
cgtctccagt 
ttcgtgaagg 
gtggatctac 
agagaggcac 
tttcttctag 
ttacttgaat 
cgaacgaatc 
aaatgtctaa 
gccaaatctg 
caacctccac 
tcaagagtaa 
agcaactcat 
atactagaaa 
acatttggta 
gagcttgaaa 
gatcaaggaa 
ccagcaggag 
ggaaaggatg 
gcaggaaagg 
cgcccaggaa 
ggaaaggatg 
gatggagcag 
agagatggag 
ggaagagatg 
ccaggaaggg 
cgcccaggaa 
ggacgcgcag 
gctggtaagg 
ccaccgggaa 
ggcgaaactg 
agaggagaaa 
agagatggag 
ggaaggaatg 
gcaggaaagg 



298440 
298500 
298560 
298620 
298680 
298740 
298800 
298860 
298920 
298980 
299040 
299100 
299160 
299220 
299280 
299340 
299400 
299460 
299520 
299580 
299640 
299700 
299760 
299820 
299880 
299940 
300000 
300060 
300120 
300180 
300240 
300300 
300360 
300420 
300480 
300540 
300600 
300660 
300720 
300780 
300840 
300900 
300960 
301020 
301080 
301140 
301200 
301260 
301320 
301380 
301440 
301500 
301560 
301620 
301680 
301740 
301800 
301860 
301920 
301980 
302040 
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atggggcagt 
gggatggtgt 
gaagggacgg 
caggaagaga 
aaaatggacg 
gtgcaatggg 
caggccgcgc 
aaactggacc 
gattaactgg 
aaggagaaaa 
ctgctggccc 
gccctatagg 
atggagcaac 
tggacggttc 
gagcagttgg 
atggagcaac 
tgccaggaag 
gaaaggatgg 
taggtcctgc 
ctgttggccc 
gccctgtagg 
atggaaagga 
aaagaggaga 
gcccccaagg 
agggacggcc 
taatgggtcc 
gacctgaagg 
gaggattaaa 
aggacggaat 
gattaagagg 
aaggcgatag 
ctcaaggtct 
gccgttcagg 
aaggaagaac 
tacaaggaaa 
gagaccgtgg 
gaggaattcg 
ctagaggggt 
gaccaacagg 
ctggtcccat 
gaaagggaaa 
aatattcaat 
catacattgg 
ggatgacaaa 
taacaactag 
ttttagttac 
cgaggtcaga 
ggccgtctga 
cagacgcaag 
caacgtctgt 
ctgatagagg 
tgcaagc 



tggccctgca 
ggacggttct 
ttcagttggc 
tggggcaact 
cccaggaaga 
aaagaatggc 
aggaaaggat 
agcaggaagg 
cagcccagga 
aggagaaaat 
tcaaggagaa 
tcctgctggc 
tggccctata 
tacgggtcct 
ccctgctggc 
tggccctata 
agatggagca 
ggcaactggc 
tggccctcaa 
tcaaggaaaa 
tcctgcaggc 
cggttccgtg 
aactggacga 
actaagagga 
aggaaaagat 
taggggtctt 
aagagatggt 
aggtatccag 
agaaggccct 
ttttcaagga 
agggctaaca 
tagaggagaa 
acctcaagga 
aggaatacaa 
ggaaggaaga 
agaacaaggc 
aggccctatt 
gagaggtgtt 
tcctcaaggg 
gggccctcct 
aaatggccct 
cacccatgct 
aagatcttat 
tgaggacaaa 
aggaagaaca 
tttactgatg 
gagggtgaat 
ccaatctata 
catgagttac 
gcttttatta 
aactgtaggg 



ggccctccag 
gtgggccctc 
cctgctggcc 
ggccctatag 
gatggagcaa 
gtggacggtt 
ggggcagttg 
gacggttcag 
agagatggag 
ggacgcccag 
aaaggagaaa 
cctcaaggag 
ggtcctagag 
caaggaagaa 
cctccaggag 
ggtcctgctg 
attggtcctc 
cctatgggtc 
ggagcaactg 
agaggattaa 
cctaaaggag 
ggtcctcaag 
ccaggaaggg 
gctacgggag 
ggtgaaacag 
cgaggagaaa 
gcacctggtc 
ggacgaccag 
agaggtcaag 
agaacaggag 
ggccctcaag 
aggggagcac 
agtaacggcg 
ggcctcactg 
atggggaaaa 
atcgctggag 
ggtgctcctg 
cctggctatc 
ccagcaggtc 
ggagcagtgg 
aaaggagcgg 
cgtgcagaga 
ggaactgata 
aaaaacgaa t 
tcgggtgttt 
ccagaaagtg 
gctgttagag 
ggtactcatt 
tcagttgata 
gcagaaacca 
aggttgatgg 



gagaaagagg 
caggagaaag 
ctcatggaga 
gtcctgctgg 
ctggccctat 
ctatgggtcc 
gccctgctgg 
ttggccctgc 
caactggccc 
gaagagatgg 
atggacgccc 
aaacaggatt 
gagaaactgg 
gaggagcaac 
aaagaggaga 
gccctcaagg 
aaggagaaaa 
ctccaggaga 
gtcttccagg 
tagggcgcac 
aaacaggatt 
gagcaattgg 
acggtgagga 
ctccaggacc 
gtcctccagg 
aaggagcacc 
ccgctggccc 
gaagagacgg 
atggaacaac 
aaactggtgc 
gaagagacgg 
ctggccctag 
tgcaaggacc 
gcatagaagg 
ttggacatcg 
cagacgggga 
gtaagcctgg 
ctggcgcaca 
ctcaagggcc 
gaccaagagg 
acggaaaaga 
taatgtggga 
caatccctgt 
attgtataca 
ttgtggtaag 
tttcctgtag 
aaagagaaag 
cacgttcaaa 
cattagacgc 
tacacgggga 
tacctcccac 



agcaacaggt 
aggagaaact 
aagaggagaa 
tcctcaagga 
aggccctaga 
tcaaggaaga 
ccctccagga 
tggccctcaa 
tataggtcct 
agcaactggc 
aggaagagat 
aactggacgc 
tgcaatggga 
aggccgcgca 
aaatggacgc 
agaaacagga 
gggagaaaat 
aaggggagag 
aagggatggt 
aggaagggat 
agctggcctg 
acctataggc 
tggttccaca 
gcaaggagaa 
gcgacaagga 
tggtaatgat 
tattggacct 
agaaatggga 
tggcgctaaa 
acaaggatct 
tccacccggt 
aggtcctaga 
tcgaggtccc 
tcctcgaggt 
aggagaaaag 
aaaaggtcca 
cacggaaggg 
aggggaatta 
gatggggcgt 
agagaaagga 
tgccgtaaat 
aggaaatgaa 
gatgatagaa 
agtaatgaca 
caataagaca 
aacagatgtc 
caaatcgtac 
aattgccgtg 
tgatgtggcg 
aaaagataga 
tgaagaagag 



ataccaggaa 
ggaccagcag 
aatggacgcc 
gaaaaaggag 
ggagaaactg 
agaggagcaa 
gaaagaggag 
ggagaaacag 
gctggccctc 
cctataggtc 
ggagcaactg 
ccaggaagag 
aagaatggtg 
ggaaaggatg 
ccaggaagag 
ttagctgggc 
ggacgcccag 
actggtccta 
gtggatggtt 
ggggcaattg 
ccagggatag 
ccacgaggag 
ggccctatgg 
agaggattaa 
agggatggaa 
ggtctagagg 
caaggaataa 
ccagccggca 
ggacctagag 
agaggagaaa 
gaagaaggtc 
ggtattcgtg 
cgaggaacaa 
cctagaggta 
ggtgataaag 
agaggtttac 
gttagaggtc 
ggtccccaag 
acaggagata 
ggtagaggaa 
atcatacaaa 
atcggagaag 
aatagaatag 
atgcactcaa 
gattatatcc 
agtacaaatg 
agatttatta 
gtaatgtatc 
cgaagagaaa 
ggtttctatg 
ttattggtat 



302100 
302160 
302220 
302280 
302340 
302400 
302460 
302520 
302580 
302640 
302700 
302760 
302820 
302880 
302940 
303000 
303060 
303120 
303180 
303240 
303300 
303360 
303420 
303480 
303540 
303600 
303660 
303720 
303780 
303840 
303900 
303960 
304020 
304080 
304140 
304200 
304260 
304320 
304380 
304440 
304500 
304560 
304620 
304680 
304740 
304800 
304860 
304920 
304980 
305040 
305100 
305107 



<210> 2 
<211> 855 
<212> DNA 
<213> SHRIMP 



<400> 2 

atggctgctg caaaaatgga cgctatcctt gcggatatta atgggaatga tacagattta 60 
tccaagctaa tcacagacgt gattcaaaag agagccaagg ctgtcatgga tagaaatagg 12 0 
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gctaaaatgg 
aaacaaaaag 
gtccctccaa 
catattgaac 
aacgtacaag 
gccatggtag 
gctcctacct 
tttacaaagg 
ggtaaaagta 
tgtagcgtct 
cgtgaatatg 
tttattccaa 
tcccgtatta 



acatgaatag 
cattagtggt 
cattatcggg 
ttgtagagag 
acatagacga 
tggtaaaagt 
ttggtgtgac 
ctgataaatc 
gacaactgac 
ttgcagagtt 
acaaacctaa 
tacaacccgc 
tatag 



aagagtagat 
atttgataaa 
atccgattac 
tgttgaggga 
ccaaatcggt 
cgactgtgaa 
acaaaataat 
tgtgcgtatt 
gtatgcggta 
tgaatctgac 
aaatgaattc 
agggacgaaa 



gaggctattc 
ctcgtggaag 
gacgcgtggg 
gattctttgt 
gatgagatag 
actggggata 
aaaatataca 
tttaaacttg 
aaaaataaca 
agtgacacta 
gaatatgagg 
ttattgttat 



aggaagccgt 
aaactgacag 
tagacagagc 
atgataaact 
atacaccaat 
tcgaagaaga 
gagatgaaag 
ctaaattgga 
atgaatatac 
ctaaatcagg 
agcgagagat 
actttttggt 



agcggccaag 
cggacaaagt 
catgccttcg 
ccctcctttt 
atcttacctt 
gtacaatctt 
agaccagatt 
tagtatatca 
agagtttgtc 
tataggtatt 
ctttacattt 
ggacgtcagg 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
855 



<210> 3 
<211> 284 
<212> PRT 
<213> SHRIMP 



<400> 3 



Met 


Ala 


Ala 


Ala 


1 








Asp 


Thr 


Asp 


Leu 








20 


Lys 


Ala 


Val 


Met 






35 




val 


Asp 


Glu 


Ala 




50 






Leu 


Val 


Val 


Phe 


65 








Val 


Pro 


Pro 


Thr 


Ala 


Met 


Pro 


Ser 








100 


Leu 


Tyr 


Asp 


Lys 






115 




lie 


Gly 


Asp 


Glu 




130 






Val 


Lys 


Val 


Asp 


145 








Ala 


Pro 


Thr 


Phe 


Arg 


Asp 


Gin 


He 








180 


Leu 


Ala 


Lys 


Leu 






195 




Ala 


Val 


Lys 


Asn 




210 






Ala 


Glu 


Phe 


Glu 


225 








Arg 


Glu 


Tyr 


Asp 


He 


Phe 


Thr 


Phe 








260 


Leu 


Tyr 


Phe 


Leu 






275 





Lys 


net 


ASp 


Til ~t 

Ala 


5 








Ser 


Lys 


Leu 


He 


Asp 


Arg 


Asn 


Arg 








40 


He 


Gin 


Glu 


Ala 






55 




Asp 


Lys 


Leu 


Val 




70 






Leu 


Ser 


Gly 


Ser 


85 








His 


He 


Glu 


Leu 


Leu 


Pro 


Pro 


Phe 








120 


He 


Asp 


Thr 


Pro 






135 




Cys 


Glu 


Thr 


Gly 




150 






Gly 


Val 


Thr 


Gin 


165 








Phe 


Thr 


Lys 


Ala 


Asp 


Ser 


He 


Ser 








200 


Asn 


Asn 


Glu 


Tyr 






215 




Ser 


Asp 


Ser 


Asp 




230 






Lys 


Pro 


Lys 


Asn 


245 








Phe 


He 


Pro 


He 


Val 


Asp 


Val 


Arg 



280 



lie 


Leu 


Ala 


Asp 




10 






inr 


Asp 


11.1 
vai 


lie 










Ala 


Lys 


Met 


Asp 


Val 


Ala 


Ala 


Lys 








60 


Glu 


Glu 


Thr 


Asp 






75 




Asp 


Tyr 


Asp 


Ala 




90 






Val 


Glu 


Ser 


Val 


105 








Asn 


Val 


Gin 


Asp 


He 


Ser 


Tyr 


Leu 








140 


Asp 


He 


Glu 


Glu 






155 




Asn 


Asn 


Lys 


He 




170 






Asp 


Lys 


Ser 


Val 


185 








Gly 


Lys 


Ser 


Arg 


Thr 


Glu 


Phe 


Val 








220 


Thr 


Thr 


Lys 


Ser 






235 




Glu 


Phe 


Glu 


Tyr 




250 






Gin 


Pro 


Ala 


Gly 


265 








Ser 


Arg 


He 


He 



T 1 n 

l le 


Asn 


(aly 


Asn 






lo 




Gin 


Lys 


Arg 


Ala 




30 






Met 


Asn 


Arg 


Arg 


45 








Lys 


Gin 


Lys 


Ala 


Ser 


Gly 


Gin 


Ser 








80 


Trp 


Val 


Asp 


Arg 






95 




Glu 


Gly 


Asp 


Ser 




110 






He 


Asp 


Asp 


Gin 


125 








Ala 


Met 


Val 


Val 


Glu 


Tyr 


Asn 


Leu 








160 


Tyr 


Arg 


Asp 


Glu 






175 




Arg 


He 


Phe 


Lys 




190 






Gin 


Leu 


Thr 


Tyr 


205 








Cys 


Ser 


Val 


Phe 


Gly 


He 


Gly 


He 








240 


Glu 


Glu 


Arg 


Glu 






255 




Thr 


Lys 


Leu 


Leu 




270 







<210> 4 
<211> 1461 
<212> DNA 
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<213> SHRIMP 



<400> 4 

atgttcgtca 

ataacaattt 

aggaagagat 

aacggaggag 

ggaacaacaa 

ggttctggaa 

ggaggaggaa 

gacacagacg 

tcttcaaaac 

caattatctc 

gacgcctcag 

tcatcggcct 

aatgaggaag 

gaatacgatg 

cttgttacta 

gctttccctc 

ggaactatta 

gaaggatcaa 

gtagtaaata 

tgctttgtaa 

aaaggtgatg 

catttttcaa 

ttctcagaaa 

gacgccaacg 

aatttattgt 



taagcatagc 
tagatggtgc 
atcgtaccag 
gaggaggtgg 
acggaggagg 
caacaaacgg 
atggaaatgg 
attttgagcc 
ctaaagaata 
gagatgattc 
acactctaca 
ttttagaagg 
aaattgcaca 
gaaaatatgc 
aatacacaaa 
atgagattga 
cttcaaacac 
caacagatcc 
aggaacaaca 
ttattcctcg 
aaattaaaca 
tcatcacaga 
gaattctccc 
tactgtcagc 
ccagccaata 



tacgtcattg 
aaaaacaatc 
tgagagtggt 
aggagaagga 
aggtggagga 
aggaggaggt 
aggaggaaat 
tacgccagcc 
ctatgaagca 
tacccaaact 
aggaaaacca 
taccactatc 
aacaatatta 
cacacccgag 
ccatgaagtg 
gtttgagaga 
tatgcagtac 
aggggaagtt 
ttctctagaa 
tttatccgca 
agaaacctat 
caaggatgaa 
cactttgagt 
ctacggaaag 



gtgttattct 
gactctcaac 
gacggcatag 
ggtggtggtg 
gaaggaggtg 
ggaggagaag 
ggaaatggaa 
cttctaaaag 
ttcgtatctg 
ataataattg 
agggattatt 
cgcaaagcgg 
agtcaactaa 
gaaagagctg 
ggtcttttag 
tgcataattc 
aggtccaacg 
gtccctgatg 
atatctgcaa 
ataggaaaaa 
ctgtttgtgg 
tctgttggaa 
gaccctgcaa 
cgcctaggtg 



tcttccttct 
cttttaggaa 
acggaggaac 
gaacaaatgg 
gtggtggaac 
gaggtggtgg 
atggaaatgg 
aacgtctact 
cagaggtaga 
atgacgacca 
tgttcaagct 
aagaccgtgc 
gagaaaaaca 
atttttccaa 
taggagaaac 
tagtagagga 
cttacaaaat 
attgtttggt 
ctaacagatg 
atgctaccat 
ccaataagaa 
tagaattaaa 
ccgttccaag 
ttggtgcctt 



tttcgtctca 
aaggaggaag 
tggaacaaca 
aaatggaact 
aaatggaaat 
tggaacaaat 
aggggataca 
taattccata 
aactgcctta 
attagaattg 
cgcaggtgtt 
ccgtaatata 
catcaacgat 
tagtcttaat 
tattgaaaag 
ttttaatagt 
cagagtagta 
ttttgccgta 
ccaagacata 
ggtaataagg 
tgacactact 
catgctcatt 
acctttgact 
tacagacaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1461 



<210> 5 
<211> 482 
<212> PRT 
<213> SHRIMP 



<400> 5 



Met 


Phe 


Val 


He 


1 








Leu 


Phe 


Val 


Ser 








20 


Gin 


Pro 


Phe 


Arg 






35 




Ser 


Gly 


Asp 


Gly 




50 






Gly 


Gly 


Gly 


Gly 


65 








Gly 


Thr 


Thr 


Asn 


Thr 


Asn 


Gly 


Asn 








100 


Glu 


Gly 


Gly 


Gly 






115 




Gly 


Asn 


Gly 


Asn 




130 






Phe 


Glu 


Pro 


Thr 


145 








Ser 


Ser 


Lys 


Pro 


Glu 


Thr 


Ala 


Leu 








180 


lie 


Asp 


Asp 


Asp 






195 





Ser 


He 


Ala 


Thr 


5 








He 


Thr 


He 


Leu 


Lys 


Arg 


Arg 


Lys 








40 


He 


Asp 


Gly 


Gly 






55 




Glu 


Gly 


Gly 


Gly 




70 






Gly 


Gly 


Gly 


Gly 


85 








Gly 


Ser 


Gly 


Thr 


Gly 


Gly 


Thr 


Asn 








120 


Gly 


Asn 


Gly 


Asn 






135 




Pro 


Ala 


Leu 


Leu 




150 






Lys 


Glu 


Tyr 


Tyr 


165 








Gin 


Leu 


Ser 


Arg 


Gin 


Leu 


Glu 


Leu 



200 



Ser 


Leu 


Val 


Leu 




10 






Asp 


Gly 


Ala 


Lys 


25 








Arg 


Lys 


Arg 


Tyr 


Thr 


Gly 


Thr 


Thr 








60 


Gly 


Gly 


Thr 


Asn 






75 




Gly 


Gly 


Glu 


Gly 




90 






Thr 


Asn 


Gly 


Gly 


105 








Gly 


Gly 


Gly 


Asn 


Gly 


Gly 


Asp 


Thr 








140 


Lys 


Glu 


Arg 


Leu 






155 




Glu 


Ala 


Phe 


Val 




170 






Asp 


Asp 


Ser 


Thr 


185 








Asp 


Ala 


Ser 


Asp 



Phe 


Phe 


Phe 


Leu 






15 




Thr 


He 


Asp 


Ser 




30 






Arg 


Thr 


Ser 


Glu 


45 








Asn 


Gly 


Gly 


Gly 


Gly 


Asn 


Gly 


Thr 








80 


Gly 


Gly 


Gly 


Gly 






95 




Gly 


Gly 


Gly 


Gly 




110 






Gly 


Asn 


Gly 


Gly 


125 








Asp 


Thr 


Asp 


Asp 


Leu 


Asn 


Ser 


He 








160 


Ser 


Ala 


Glu 


Val 






175 




Gin 


Thr 


He 


He 




190 






Thr 


Leu 


Gin 


Gly 



205 
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Lys 


Pro 


Arg 


Asp 


xyxr 


Leu 


Phe 


Lys 




210 










215 




Leu 


Glu 


Gly 


Thr 


Thr 


Ile 


Arg 


Lys 


225 










-D \J 






Asn 


Glu 


Glu 


Glu 


Tip 
x ic 


Ala 


Gin 


Thr 










245 








Hie: 


J. -Ltr 


AST* 


ASp 


ulU 


Tyr 


ASp 


oiy 








cvv 










Al =i 
nla 


Asp 




OCX 


Asn 


Ser 


Leu 


Asn 






77 ^ 










9fin 




vai 


biy 


Leu 


Leu 


VdJ. 




/till 
ulU 




















rfiS 




Arg 


Cys 


lie 


Tip 


Leu 


305 










•sin 

JiU 






Tin 


Th r 


Ser 


Asn 


Th v- 

i nr 


net. 


bin 


Tyr 


















Veil 


veil 


Vj-LU 


oiy 




Th r 

1 nr 


Th r 

1 nr 


Asp 








j *j u 










Cys 


L€U 


Vdi 


rue 


nlo 


val 


vai 


VdJ. 






j j j 










J Dv 


Tip 

x Its 




TV 1 -» 

nld 


Thr 


Asn 


Arg 


Owe 


bin 


















Arg 


Leu 


Ser 


Hid 


lie 


t*iy 


Lys 


Asn 


385 










390 






Asp 


Glu 


Ile 


Lys 


Gin 


Glu 


Thr 


Tyr 










405 








Thr 


Thr 


His 


Phe 


Ser 


Ile 


lie 


Thr 








420 










Glu 


Leu 


Asn 


Met 


Leu 


Ile 


Phe 


Ser 






435 










440 


Ala 


Thr 


Val 


Pro 


Arg 


Pro 


Leu 


Thr 




4 50 










455 




Gly 


Lys 


Arg 


Leu 


Gly 


Val 


Gly 


Ala 


465 










470 







Ser Gin 



Leu 


Ala 


Gly Val 


Ser 


Ser 


Ala 


Phe 








220 










Ala 


Glu 


Asp Arg 


Ala 


Arg 


Asn 


Ile 






235 










240 


Tip 


Leu 


Ser 




x^eu 


Arg 




Lys 




250 










255 




_ 

Lys 


Tyr 


Ala 


inr 


Prr* 
xrx (J 


ri„ 


V3l LI 


Arg 












£r t V 






XjVSU 


Val 


Thr 


Lys 


xyi 


X J1X 














9ft S 








XXI XT 


Ile 


Glu 


Lys 


Til a 
nlo 


Pha 

trne 


f ne 


Tip 

1 ie 








-3 r\r\ 










Vol 


Glu Asp 


Xriic 


Asn 


D€x 


oiy 


Th v- 

inr 






315 












Arg 


Ser 


Asn 


Ala 


Tyr 


Lys 


Tip 

1 ie 


Arg 




330 










335 




Pro 


Gly Glu 


Val 


v dl 


Pro 


Asp 


Asp 


"3 4 ^ 










*^sn 






Asn 


Lys 


Glu 


bin 


nxs 


C^r 
OCX 


Leu 


ulU 










J OD 








Asp 


Ile 


Cys 


Phe 


Vol 


Tip 

1 ie 


Tip 

1 ie 


Pro 








380 












Thr 


Met 


Val 


Tip 

lie 


Arg 


T wc 


oiy 






395 










400 


Leu 


Phe 


Val 


Ala 


Asn 


Lys 


Asn 


Asp 




410 










415 




Asp 


Lys 


Asp 


Glu 


Ser 


Val 


Gly 


Ile 


425 










430 






Ile 


Leu 


Pro 


Thr 


Leu 


Ser 


Asp 


Pro 










445 








Asp 


Ala 


Asn 


Val 


Leu 


Ser 


Ala 


Tyr 








460 










Phe 


Thr 


Asp 


Lys 


Asn 


Leu 


Leu 


Ser 






475 










480 



<210> 6 
<211> 903 
<212> DNA 
<213> SHRIMP 



<400> 6 

atgggagata 

gattggctat 

gagacagccc 

aaggttgaag 

getgetgata 

acagtggcaa 

tggacattag 

cgtttatcat 

tttttaagaa 

tccttgtcag 

aaggttgaca 

atttgtatgt 

aaatttcatt 

aatcgatege 

agttctgctt 

tag 



agcaaaaggt 
eggctaatgt 
aagttgttaa 
aaatgagacc 
tgataagaga 
gaactatcaa 
cctacaaacc 
ataaagacct 
taaaaaagat 
tgtcttatcc 
tacacatcag 
tagatactga 
ttgaagcagg 
atatagtaaa 
ctacaacaga 



ggaacagctt 
tgatccaatt 
acaggcegtt 
ggatataatc 
ggtagacact 
taatttaaga 
atggagaaga 
gactgtccat 
cgectatatt 
caacaagtcc 
aagaaatgat 
tttttcagga 
gaacaggact 
ctctaaaata 
actggataat 



ttaagggaat 
gttgagagat 
gatgaaaaat 
aatgaagcat 
agtagtgtta 
gataaaagga 
tatgtacaag 
gccaattccg 
aataacgacc 
gagtggggaa 
ttacaggaaa 
tatgataaag 
atgtttttac 
tgcactatag 
gtctattaca 



taaaggcaga 
ttgtcacaac 
acgatgaatt 
ccgaaacata 
ttgctcctgc 
aagaatatga 
caattacagt 
atacatattt 
gcgccagccc 
atgataaegg 
aggatcttta 
ctgtggaagt 
ccaaaacctc 
tgttccctcc 
ggatcacatg 



ggctaatgat 60 
taaatctgat 120 
attagaagat 180 
tgataaactt 240 
aatagctggc 300 
aaagaggcta 360 
gatggaattt 4 20 
aacattccct 4 80 
cgtgaactgc 540 
agttggacgt 600 
cctgtctgtt 660 
tgatgeccat 720 
taatctattt 780 
cgcttctgcc 840 
cacatgctcc 900 
903 
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<210> 7 
<211> 298 
<212> PRT 
<213> SHRIMP 



<400> 7 



Met 


Gly 


Asp 


Lys 


1 








Glu 


Ala 


Asn 


Asp 








20 


Arg 


Phe 


Val 


Thr 






35 




Ala 


Val 


Asp 


Glu 




50 






Met 


Arg 


Pro 


Asp 


65 








Ala 


Ala 


Asp 


Met 


Ala 


lie 


Ala 


Gly 








100 


Arg 


Lys 


Glu 


Tyr 






115 




Arg 


Arg 


Tyr 


Val 




130 






Lys 


Asp 


Leu 


Thr 


145 








Phe 


Leu 


Arg 


He 


Asn 


Cys 


Ser 


Leu 








180 


Asp 


Asn 


Gly Val 






195 




Leu 


Gin 


Glu 


Lys 




210 






Asp 


Phe 


Ser 


Gly 


225 








His 


Phe 


Glu 


Ala 


Leu 


Phe 


Asn 


Arg 








260 


Phe 


Pro 


Pro 


Ala 






275 




Val 


Tyr 


Tyr 


Arg 



290 



Gin 


Lys 


Val 


Glu 


5 








Asp 


Trp 


Leu 


Ser 


Thr 


Lys 


Ser 


Asp 








40 


Lys 


Tyr 


Asp 


Glu 






55 




He 


He 


Asn 


Glu 




70 






He 


Arg 


Glu 


Val 


85 








Thr 


Val 


Ala 


Arg 


Glu 


Lys 


Arg 


Leu 








120 


Gin 


Ala 


He 


Thr 






135 




Val 


His 


Ala 


Asn 




150 






Lys 


Lys 


He 


Ala 


165 








Ser 


Val 


Ser 


Tyr 


Gly Arg 


Lys 


Val 








200 


Asp 


Leu 


Tyr 


Leu 






215 




Tyr 


Asp 


Lys 


Ala 




230 






Gly Asn 


Arg 


Thr 


245 








Ser 


His 


He 


Val 


Ser 


Ala 


Ser 


Ser 








280 


He 


Thr 


Cys 


Thr 






295 





Gin Leu Leu Arg 
10 

Ala Asn Val Asp 
25 

Glu Thr Ala Gin 

Leu Leu Glu Asp 
60 

Ala Ser Glu Thr 
75 

Asp Thr Ser Ser 
90 

Thr He Asn Asn 
105 

Trp Thr Leu Ala 

Val Met Glu Phe 
140 

Ser Asp Thr Tyr 
155 

Tyr He Asn Asn 
170 

Pro Asn Lys Ser 
185 

Asp He His He 

Ser Val He Cys 
220 

Val Glu Val Asp 
235 

Met Phe Leu Pro 
250 

Asn Ser Lys He 
265 

Ala Ser Thr Thr 
Cys Ser 



Glu 


Leu 


Lys 


Ala 






15 




Pro 


He 


Val 


Glu 




30 






Val 


Val 


Lys 


Gin 


45 








Lys 


Val 


Glu 


Glu 


Tyr 


Asp 


Lys 


Leu 








80 


Val 


He 


Ala 


Pro 






95 




Leu 


Arg 


Asp 


Lys 




110 






Tyr 


Lys 


Pro 


Trp 


125 








Arg 


Leu 


Ser 


Tyr 


Leu 


Thr 


Phe 


Pro 








160 


Asp 


Arg 


Asp 


Val 






175 




Glu 


Trp 


Gly 


Asn 




190 






Arg 


Arg 


Asn 


Asp 


205 








Met 


Leu 


Asp 


Thr 


Ala 


His 


Lys 


Phe 








240 


Lys 


Thr 


Ser 


Asn 






255 




Cys 


Thr 


lie 


Val 




270 






Glu 


Leu 


Asp 


Asn 


285 









<210> 8 
<211> 2376 
<212> DNA 
<213> SHRIMP 

<400> 8 

atggaggaag aaagtcaacg agtacagagg 

tcccagatcc tgaaagacac caagttaaga 

tactctgttt cagttataaa aagtgctctc 

ataactgcgt ggtcaatcag ggaggcttat 

gaaaacgtaa aacctgctgc aaaatcatta 

atagctgtgg aggataccag tcctagaagt 

ctagaaaaat atgaaaaggg gaattttagg 

aggctagttc acgcctccag ttctagagtg 

gaagaggaca gtgataaatt agggggtatt 



aggatcggag tcctcccaga agaggctgcc 60 
gtatcatatc tgggcgtggg acattgggga 120 
cagaaaggat gcaggaggaa cgatgaggat 180 
ctctattacc atcttggatt gaattacatt 240 
aatacaaata tggtaaacag aattaaaatc 300 
atggtggctt ctaatgagtg tgtgagaact 360 
caacccagtt acttgatgga tgctgccatg 4 20 
tgctcccaca tgagagcttt gtgttgcaag 4 80 
tattatgcta attttaatga acttgaaact 540 
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cagtgtgtta 
attgaaagtg 
gcttaccatg 
ggaccattca 
cagcacgtaa 
aattggagaa 
acatctattg 
attcagaaaa 
gctacatttg 
ctggacaaac 
gtttctggag 
cgtctagatt 
ttgaagcgag 
ataaaggctt 
ataaagaaaa 
gctactacta 
ccaagtggaa 
gaggatttag 
acagttttaa 
tggggttctc 
tatttgttgc 
atagtaaagg 
aatgccatgt 
ataacagatg 
aggcaattaa 
tcaaacattc 
gcaaaggatc 
aagacttcct 
tggctatttg 
aatatgggaa 
ggggtcgtga 



gtgcagttaa 
taaaattggg 
tgttaagata 
agcgaaaaga 
aaacagaccc 
gggatttttt 
tggaattaat 
gggatttaaa 
attggataga 
atacgaacaa 
gagacaagcg 
ctccccctcc 
aagctgctag 
ctagttttac 
gaaaaattga 
gtagttctat 
gcaataaagt 
atagagtact 
tgaaggatgg 
atcagaattt 
catgccctga 
atgaaatttc 
gttatttcat 
tgcgcctttc 
tacacatttt 
tcgtaggtga 
cgaggacagc 
ttaaggtaaa 
atacatcaaa 
ttggtcccac 
atgacctgct 



tttttctccc 
gaagaaaagt 
ttatggggat 
gtttgagcag 
tgaattacgt 
ctgctcaaaa 
tatagccatg 
acgtttcaat 
aggacatgtt 
aaacacacat 
ctggtcccca 
cccagaagtg 
tcattgcgta 
tgctaatatt 
gattaaggat 
tacttctact 
agactctata 
agaagtacac 
aaataaagtt 
tgttcaagta 
tagcggacca 
gagtacagct 
ctctgggtgt 
acatatggga 
aatttttaga 
tggaggaaac 
tctcgaaaat 
caaggtaaca 
aagtgataaa 
tactttggat 
ttatgacaac 



atagaaagaa 
gtacagttat 
aaagtgaaag 
ttttgggggt 
tgttatttta 
gggttcttta 
tgcataggtg 
aaaggagaag 
aaacggatgc 
ggagtatctt 
ggggtgtggc 
ggccaatttc 
acgaggaata 
aattctgaac 
gataatacta 
ccacctccta 
caattgaata 
aaccaaaact 
gtgtttaagg 
ctaaaggatg 
tacagaggat 
gctaggatag 
gtgactagac 
ccaaataaac 
ttgttgactg 
ttgttctctg 
agaaagataa 
aaggaagata 
ataatgaatg 
attgtaaaga 
aaataa 



ttaaacacgt 
taaatttaag 
atacaaataa 
tatgctttaa 
atgagttgac 
gagaagaaag 
accgtaaaca 
aagggagaaa 
ctcaaatgcc 
ttgccctaga 
ttcattctta 
tggatcaggc 
tttgtactac 
cgatggaaat 
ctgctactgt 
caaagaaaca 
atctaccaac 
ctaaaaaggg 
agatgagaaa 
aagatgtgtg 
tatacagatg 
aaaaagtgaa 
aagaaggaat 
aatatgtgta 
gtgtgagtga 
tggacgagaa 
aggaactaca 
tagatagctg 
gtgtatgtaa 
ataattgtac 



cttcagggag 
aagtgtagca 
gaaacatagt 
atttgttact 
atatgccata 
tctattcctt 
gtttgccaag 
agaagaggct 
tgtatgggtt 
gagtagtatg 
tacaaagatg 
ttttaataca 
tacaggattt 
taaggaagaa 
tactgttagt 
aaaaacaact 
tttaaatatg 
tgtagctgct 
aagctttggt 
taaattggac 
ctatttcaaa 
atggggagaa 
tgggaaaatt 
tgataactat 
taccaacaca 
ttatgtgggc 
attgctactg 
tcttccatct 
tattggtaaa 
ttgtatttta 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2376 



<210> 9 
<211> 785 
<212> PRT 
<213> SHRIMP 



<400> 9 



Met 


Glu 


Glu 


Glu 


Ser 


Gin 


Arg 


Val 


Gin 


Arg 


Arg 


He 


Gly 


Val 


Leu 


Pro 


1 








5 










10 










15 




Glu 


Glu 


Ala 


Ala 


Ser 


Gin 


He 


Leu 


Lys 


Asp 


Thr 


Lys 


Leu 


Arg 


Val 


Ser 








20 










25 










30 






Tyr 


Leu 


Gly 


Val 


Gly 


His 


Trp 


Gly 


Tyr 


Ser 


val 


Ser 


Val 


He 


Lys 


Ser 






35 










40 










45 








Ala 


Leu 


Gin 


Lys 


Gly 


Cys 


Arg 


Arg 


Asn 


Asp 


Glu 


Asp 


He 


Thr 


Ala 


Trp 




50 










55 










60 










Ser 


lie 


Arg 


Glu 


Ala 


Tyr 


Leu 


Tyr 


Tyr 


His 


Leu 


Gin 


Tyr 


He 


Glu 


Asn 


65 










70 










75 










80 


Val 


Lys 


Pro 


Ala 


Ala 


Lys 


Ser 


Leu 


Asn 


Thr 


Asn 


Met 


Val 


Asn 


Arg 


He 










85 










90 










95 




Lys 


He 


He 


Ala 


Val 


Glu 


Asp 


Thr 


Ser 


Pro 


Arg 


Ser 


Met 


Val 


Asn 


Glu 








100 










105 










110 






Cys 


Val 


Arg 


Thr 


Leu 


Glu 


Lys 


Tyr 


Glu 


Lys 


Gly Asn 


Phe 


Arg 


Gin 


Pro 






115 










120 










125 








Ser 


Tyr 


Leu 


Met 


Asp 


Ala 


Ala 


Met 


Arg 


Leu 


Val 


His 


Ala 


Ser 


Ser 


Ser 




130 










135 










140 










Arg 


Val 


Cys 


Ser 


His 


Met 


Arg 


Ala 


Leu 


Cys 


Cys 


Lys 


Glu 


Glu 


Asp 


Ser 


145 










150 










155 










160 


Asp 


Lys 


Leu 


Gly 


Gly 


He 


Tyr 


Tyr 


Ala 


Asn 


Phe 


Asn 


Glu 


Leu 


Glu 


Thr 










165 










170 










175 




Gin 


Cys 


Val 


Ser 


Ala 


Val 


Asn 


Phe 


Ser 


Pro 


He 


Glu 


Arg 


He 


Lys 


His 
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180 



vol 




Arg 








JL -7 —J 




Leu 


Leu 


Asn 


Leu 




£. x u 






Gly 


Asp 


Lys 


Val 


225 








Arg 


Lys 




rile 


Gin 


His 


vai 


Lys 








o c n 


inr 


Tyr 


Ala 


i le 










Fne 


Arg 


Glu 


bill 










Ala 


Met 


Cys 


He 


305 








Asp 


Leu 


Lys 


Arg 


Ala 


Thr 


pne 


ASp 










Pro 


Val 


Trp 


val 






"3 C C 




oer 


fne 




Leu 




o /U 






Ser 


Pro 


Gly 


Val 


385 








Pro 


Pro 


Pro 


Pro 


Leu 


Lys 


Arg 


Glu 








420 


Thr 


Thr 


Gly 


Phe 






4 35 




Glu 


Pro 


Met 


Glu 




450 






Lys 


Asp 


Asp 


Asn 


465 








Ser 


Ser 


lie 


Thr 


Pro 


Ser 


Gly 


Ser 








500 


i nr 


Leu 


Asn 


Ma 4- 






etc 
515 




Asn 


Ser 


Lys 


Lys 




t "3 n 
5 JU 






Lys 


Val 


Val 


Phe 


545 








Gin 


Asn 


Phe 


Val 


Tyr 


Leu 


Leu 


Pro 








580 


Phe 


Lys 


He 


Val 






595 




Lys 


Val 


Lys 


Trp 




610 






Val 


Thr 


Arg 


Gin 


625 








Ser 


His 


Met 


Gly 


Leu 


lie 


His 


He 



660 



lie 


r l ii 
ulu 


JC1 


val 










Arg 




1 


Ala 










T a/c 

i-»y & 


nop 


X 11 JL 


Ad XI 










Glu 


Gin 


Phe 


Trp 


245 








TV* v 

inr 


Asp 


trro 


Glu 


Asn 


Trp 


TV m 

Arg 


Arg 








zoU 


Ser 


.Leu 


rne 


Leu 






£. y 5 




Giy 


Asp 


Arg 


Lys 




jiu 






Phe 


Asn 


Lys 


Gly 


325 








Trp 


lie 


GlU 


Giy 


Leu 


ASp 


Lys 


HIS 








"5 £ A 


GlU 


ser 


Ser 


Met 






6 /5 




Trp 


Leu 


nlS 


Ser 










Glu 


Val 


Gly 


Gin 


405 








Ala 


Ala 


Ser 


HIS 


He 


Lys 


Ala 


Ser 








A A f\ 

4 4 0 


lie 


Lys 


GlU 


GlU 






455 




Tnr 


Tnr 


Ala 


Tnr 




4 /□ 






Ser 


Thr 


Pro 


Pro 


485 








Asn 


Lys 


vai 


Asp 


GlU 


Asp 


Leu 


Asp 








con 


bly 


vai 


Hid 


Ala 










Lys 


GlU 


Met 


Arg 












val 


Leu 


Lys 


565 








Cys 


Pro 


Asp 


Ser 


Lys 


Asp 


Glu 


He 








600 


Gly 


Glu 


Asn 


Ala 






615 




Glu 


Gly 


He 


Gly 




630 






Pro 


Asn 


Lys 


Gin 


645 








Leu 


He 


Phe 


Arg 



185 

Lys Leu Gly Lys 



nid 


iyr 


His 


Val 








220 


Lys 


Lys 


His 


Ser 






235 




Gly 


Leu 


Cys 


Phe 




250 






Leu 


Arg 


Cys 


Tyr 


O £L C 








Asp 


r*ne 


Phe 


Cys 


inr 


Ser 


He 


Val 








300 


Gin 


irne 


Ala 


Lys 






315 




Glu 


Glu 


Gly Arg 




330 






nis 


vai 


Lys 


Arg 


345 








Thr 


Asn 


Lys 


Asn 


vai 


O A V 

ber 


Gly 


Gly 








380 


Tyr 


Thr 


Lys 


Met 






395 




Phe 


Leu 


Asp 


Gin 




410 






Cys 


Val 


Thr 


Arg 


425 








Ser 


Phe 


Thr 


Ala 


lie 


Lys 


Lys 


Arg 








460 


Val 


Thr 


Val 


Ser 






475 




Pro 


Thr 


Lys 


Lys 




4 90 






Ser 


He 


Gin 


Leu 


505 








Arg 


Val 


Leu 


Glu 


Thr 


IT- t 

val 


Leu 


Met 








540 


Lys 


Ser 


Phe 


Gly 






555 




Asp 


Glu 


Asp 


Val 




570 






Gly 


Pro 


Tyr 


Arg 


585 








Ser 


Ser 


Thr 


Ala 


Met 


Cys 


Tyr 


Phe 








620 


Lys 


He 


He 


Thr 






635 




Tyr 


Val 


Tyr 


Asp 




650 






Leu 


Leu 


Thr 


Gly 



665 





190 






Lys 


Ser 


Val 


Gin 


205 








Leu 


Arg 


Tyr 


Tyr 


Gxy 


lr AO 


lr Iltr 


i*y s 










Lys 




VOX 








^ JJ 




c*ne 


Asn 


Pin 
ulU 


Leu 










Cor 
OCX 


Lys 


uiy 




285 








Glu 


Leu 


He 


He 


lie 


Gin 


T tic 

iiys 


Arg 








u 


Lys 




ulU 


nld 






O «J -J 




net 


Pro 


Gin 


WCL 










inr 


HIS 


Giy 


vai 


365 








Asp 


Lys 


Arg 


Trp 


Arg 


Leu 


Asp 


Ser 








flUU 


Ala 




Asn 


xnr 






415 




Asn 


i le 


Cys 


inr 




4 JO 






Asn 


i le 


Asn 


C A V 


445 








Lys 


lie 


Glu 


He 


Ala 


xnr 


Tnr 


oer 








1 o u 


Gin 


Lys 


Tnr 


xnr 










Asn 


Asn 


Leu 


pro 




Jlu 






val 


HIS 


Asn 


Gin 


525 








Lys 


Asp 


Gly 


Asn 


Trp 


Giy 


oer 


Una 

nis 










cys 


Lys 


Leu 


Asp 






O / o 




Gly 


Arg 


Cys 


Tyr 




590 






Ala 


Arg 


He 


Glu 


605 








He 


Ser 


Gly 


Cys 


Asp 


Val 


Arg 


Leu 








640 


Asn 


Tyr 


Arg 


Gin 






655 




Val 


Ser 


Asp 


Thr 




670 
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Asn 


Thr 


o a w- 
675 


Asn 


I ie 


Leu 


v a jl 


uiy 
680 


Asp 


Glu 
690 


Asn 


Tyr 


val 


Gly 


Ala 
695 


Lys 


Arg 


Lys 


He 


Lys 


Glu 


Leu 


Gin 


Leu 


705 










710 






Asn 


Lys 


Val 


Thr 


Lys 
725 


Glu 


Asp 


He 


Phe 


Asp 


Thr 


Ser 
740 


Lys 


Ser 


Asp 


Lys 


Gly 


Lys 


Asn 
755 


Met 


Gly 


He 


Gly 


Pro 
760 


Asn 


Cys 
770 


Thr 


Cys 


He 


Leu 


Gly 
775 


Val 



Lys 
785 



Asp 


tjxy 


«xy 


Asn 


Leu 
685 




oer 


v a. J. 


Asp 


Pro 


Arg 


Thr 
700 


Ala 


Leu 


Glu 


Asn 


Leu 


Leu 


Lys 
715 


Thr 


Ser 


Phe 


Lys 


Val 
720 


Asp 


Ser 
730 


Cys 


Leu 


Pro 


Ser 


Trp 
735 


Leu 


He 


Met 


Asn 


Gly 


Val 


Cys 


Asn 


He 


745 










750 






Thr 


Thr 


Leu 


Asp 


He 
765 


Val 


Lys 


Asn 


Val 


Asn 


Asp 


Leu 


Leu 


Tyr 


Asp 


Asn 



780 



<210> 10 
<211> 1857 
<212> DNA 
<213> SHRIMP 



<400> 10 

atggctttaa 

gacatggctc 

ctttatagag 

aacacaaaca 

gacagaactg 

tctatactcg 

agtgggaaaa 

tcatacaaag 

aagagtacgt 

atggattcga 

gccttgtaca 

ctgactcata 

tggaggaaga 

ctgaagaaca 

atgtacaaga 

tcggacagtg 

gatgaagcat 

gtcacaacaa 

cacatccacc 

ttaatgaagg 

gtgaaccctg 

aaacctcctt 

ctttcttctg 

aaagaattgg 

aatctatcta 

attactgcca 

gctctgtggg 

gcggctgaca 

aaaactgtaa 

aatgtagttg 

tatatgtctc 



aagacgcatt 
cagttgttga 
ttctggtagt 
aaagacaatg 
tatctaattt 
attactggta 
gtgatacgtt 
acatgaaaac 
tggacccttt 
aatatgggag 
cgactgtata 
aacctgttac 
tcgtcaaggc 
taatgaagaa 
aaattggtaa 
aagatgatga 
cagaaaggga 
tcactggcga 
ctgaatggaa 
ccatggaaac 
tacagatttt 
ctgatgtcaa 
gtgcgggtgt 
acattatttc 
ccagaatttg 
gaggtgctgt 
ccaagaaaat 
aaaatagtag 
cgtttggatg 
taagcttcga 
agcaagtttt 



caccgagcgg 
agatgaccgc 
tgaacagaaa 
ccgccaggta 
accccaagct 
caagaacgtt 
tctgaggact 
ttgttcgctg 
attgaaggac 
taaaactaca 
taatcactgg 
gggtaagaaa 
ttacgtgtcc 
tgagagtgct 
cagtgtcaag 
tgatgatgat 
aaagaagcac 
ttacatcttt 
aatacctatg 
tgacatatca 
acccatcgct 
attttgtatg 
gtttggtcga 
aaactgtcta 
gagagaagga 
tttagggtat 
ccaggacgct 
gaactgtgat 
tgcaaatatg 
ggattctatt 
tactcactca 



ttagttgtga 
cagaaatctc 
aattctgcaa 
gaagaagaca 
gttaaagaaa 
atacctcttc 
gcagtcatct 
atatatgaat 
attctcgaca 
tcccctgagc 
acaaacgcct 
tatgcatcag 
aaacacaaca 
gataatgcta 
aacactaata 
gacgatgact 
caagaatctc 
aaagtgcttc 
tgtgattttg 
aatgctgtaa 
gctaacttct 
atgtttaatg 
tttattggag 
agaagtttat 
ggttctgttg 
ggtgaacaat 
gtaacagaat 
ttgttaggag 
ttgaaaaccc 
caggctaata 
tttccagctc 



acaaagttgg 
tcttccaaaa 
ttactctctc 
aagtgatttt 
ccatcgcagc 
ttgacacgaa 
gtcttgtgag 
tcgaacacaa 
acaaacaaga 
tggcaaagga 
tcaagctcta 
tcatacactt 
acgtggagag 
atgttttaac 
ataatagtgc 
gtgaaggcat 
tatacccaat 
tagagcttgt 
ttaataggaa 
tagaagttag 
gggatttttg 
aaccatcttc 
gacccttttc 
tgttgaataa 
tctgttttaa 
tgtccgagag 
ctgtcaagag 
acgagggtgt 
agaacggtat 
aggaaggggc 
tagatcaagg 



ctccggtaca 
agtggaaaat 
tggaaacaaa 
tgaaggagag 
caatgcagaa 
gaaagaaaga 
atgttgtgtg 
aattttaaat 
actactacat 
aaccatcgag 
ccaagcctcg 
tataaggaca 
ggatctttca 
tattgaaaaa 
ccaccaaatg 
ggatgtttgt 
caacactcca 
tctttcaccc 
tatccccaaa 
agcttcaaag 
taagagcggg 
taacgagacc 
ccacaagagt 
agaagcggac 
ttactgcccc 
atctattaaa 
gcaaaggaat 
agtatccatg 
ggggaaattt 
ggccagacag 
aaaataa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1857 
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<400> 11 




























Met 


Ala 


Leu 


Lys 


Asp 


Ala 


Phe 


Thr 


Glu 


Arg 


Leu 


Val 


Val 


Asn 


Lys 


Val 


1 








5 










10 










15 




Gly 


Ser 


Gly 


Thr 


Asp 


Met 


Ala 


Pro 


Val 


Val 


Glu 


Asp 


Asp 


Arg 


Gin 


Lys 








20 










25 










30 






Ser 


Leu 


Phe 


Gin 


Lys 


Val 


Glu 


Asn 


Leu 


Tyr 


Arg 


Val 


Leu 


Val 


Val 


Glu 






35 










40 










45 








Gin 


Lys 


Asn 


Ser 


Ala 


He 


Thr 


Leu 


Ser 


Gly 


Asn 


Lys 


Asn 


Thr 


Asn 


Lys 




50 










55 










60 










Arg 


Gin 


Cys 


Arg 


Gin 


Val 


Glu 


Glu 


Asp 


Lys 


Val 


He 


Phe 


Glu 


Gly 


Glu 


65 










70 










75 










80 


Asp 


Arg 


Thr 


Val 


Ser 


Asn 


Leu 


Pro 


Gin 


Ala 


Val 


Lys 


Glu 


Thr 


He 


Ala 






85 










90 










95 




Ala 


Asn 


Ala 


Glu 


Ser 


He 


Leu 


Asp 


Tyr 


Trp 


Tyr 


Lys 


Asn 


Val 


He 


Pro 








100 










105 










110 






Leu 


Leu 


Asp 


Thr 


Lys 


Lys 


Glu 


Arg 


Ser 


Gly 


Lys 


Ser 


Asp 


Thr 


Phe 


Leu 






115 










120 










125 








Arg 


Thr 


Ala 


Val 


He 


Cys 


Leu 


Val 


Arg 


Cys 


Cys 


Val 


Ser 


Tyr 


Lys 


Asp 




130 










135 










140 










Met 


Lys 


Thr 


Cys 


Ser 


Leu 


He 


Tyr 


Glu 


Phe 


Glu 


His 


Lys 


He 


Leu 


Asn 


145 








150 










155 










160 


Lys 


Ser 


Thr 


Leu 


Asp 


Pro 


Leu 


Leu 


Lys 


Asp 


He 


Leu 


Asp 


Asn 


Lys 


Gin 










165 










170 










175 




Glu 


Leu 


Leu 


His 


Met 


Asp 


Ser 


Lys 


Tyr 


Gly 


Ser 


Lys 


Thr 


Thr 


Ser 


Pro 








180 










185 










190 






Glu 


Leu 


Ala 


Lys 


Glu 


Thr 


He 


Glu 


Ala 


Leu 


Tyr 


Thr 


Thr 


Val 


Tyr 


Asn 






195 










200 










205 








His 


Trp 


Thr 


Asn 


Ala 


Phe 


Lys 


Leu 


Tyr 


Gin 


Ala 


Ser 


Leu 


Thr 


His 


Lys 




210 










215 










220 










Pro 


Val 


Thr 


Gly 


Lys 


Lys 


Tyr 


Ala 


Ser 


Val 


He 


His 


Phe 


He 


Arg 


Thr 


225 










230 










235 










240 


Trp 


Arg 


Lys 


He 


Val 


Lys 


Ala 


Tyr 


Val 


Ser 


Lys 


His 


Asn 


Asn 


Val 


Glu 










245 










250 










255 




Arg 


Asp 


Leu 


Ser 


Leu 


Lys 


Asn 


He 


Met 


Lys 


Asn 


Glu 


Ser 


Ala 


Asp 


Asn 








260 










265 










270 






Ala 


Asn 


Val 


Leu 


Thr 


He 


Glu 


Lys 


Met 


Tyr 


Lys 


Lys 


He 


Gly 


Asn 


Ser 






275 










280 










285 








Val 


Lys 


Asn 


Thr 


Asn 


Asn 


Asn 


Ser 


Ala 


His 


Gin 


Met 


Ser 


Asp 


Ser 


Glu 




290 










295 










300 










Asp 


Asp 


Asp 


Asp 


Asp 


Asp 


Asp 


Asp 


Asp 


Cys 


Glu 


Gly 


Met 


Asp 


Val 


Cys 


305 










310 










315 










320 


Asp 


Glu 


Ala 


Ser 


Glu 


Lys 


Lys 


His 


Gin 


Glu 


Ser 


Leu 


Tyr 


Pro 


He 


Asn 








325 










330 










335 




Thr 


Pro 


Val 


Thr 


Thr 


lie 


Thr 


Gly Asp 


Tyr 


He 


Phe 


Lys 


Val 


Leu 


Leu 








340 










345 










350 






Glu 


Leu 


Val 


Leu 


Ser 


Pro 


His 


He 


His 


Pro 


Glu 


Trp 


Lys 


He 


Pro 


Met 






355 










360 










365 








Cys 


Asp 


Phe 


Val 


Asn 


Arg 


Asn 


He 


Pro 


Lys 


Leu 


Met 


Lys 


Ala 


Met 


Asp 




370 










375 










380 










He 


Ser 


Asn 


Ala 


Val 


He 


Glu 


Val 


Arg 


Ala 


Ser 


Lys 


Val 


Asn 


Pro 


Val 


385 










390 










395 










400 


Gin 


He 


Leu 


Pro 


He 


Ala 


Ala 


Asn 


Phe 


Trp 


Asp 


Phe 


Cys 


Lys 


Ser 


Gly 










405 










410 










415 




Lys 


Pro 


Pro 


Ser 


Asp 


Val 


Lys 


Phe 


Cys 


Met 


Met 


Phe 


Asn 


Glu 


Pro 


Ser 








420 










425 










430 






Ser 


Asn 


Glu 


Thr 


Leu 


Ser 


Ser 


Gly 


Ala 


Gly 


Val 


Phe 


Gly 


Arg 


Phe 


He 






435 










440 










445 








Gly 


Gly 


Pro 


Phe 


Ser 


His 


Lys 


Ser 


Lys 


Glu 


Leu 


Asp 


He 


He 


Ser 


Asn 




450 










455 










4 60 










Cys 


Leu 


Arg 


Ser 


Leu 


Leu 


Leu 


Asn 


Lys 


Glu 


Ala 


Asp 


Asn 


Leu 


Ser 


Thr 


465 










470 










475 










480 
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Arg 


T 1 a 

lie 


Trp 


Arg 


Glu 


uiy 


f* 1 w 




Ua 1 
v d _L 


vet J. 


cys 


tilt: 


Asn 


A Jr *■ 


Cys 


Pro 








485 










490 










4 95 




lie 


Tnr 


Arg 


Ala 


Val 


Leu 


biy 


Tyr 








Leu 


Cor- 


Ser 


lie 


Lys 






500 










V# J 










510 






Ala 


Leu 


Trp 
515 


Ala 


Lys 


LyS 


lie 


OJLI1 

520 


Asp 




v a. j. 


TK r 

X li X. 


Glu 
525 


Ser 


Val 


Lys 


Arg 


Gin 
530 


Arg 


Asn 


Ala 


Ala 


Asp 
535 


Lys 


Asn 


Ser 


Arg 


Asn 
540 


Cys 


Asp 


Leu 


Leu 


Gly 


Asp 


Glu 


Gly Val 


val 


Ser 


Met 


Lys 


Thr 


Val 


Thr 


Phe 


Gly 


Cys 


Ala 


545 










550 










555 










560 


Asn 


Met 


Leu 


Lys 


Thr 
565 


Gin 


Asn 


Gly 


Met 


Gly 
570 


Lys 


Phe 


Asn 


Val 


Val 
575 


Val 


Ser 


Phe 


Glu 


Asp 
580 


Ser 


lie 


Gin 


Ala 


Asn 
585 


Lys 


Glu 


Gly 


Ala 


Ala 
590 


Arg 


Gin 


Tyr 


Met 


Ser 


Gin 


Gin 


Val 


Phe 


Thr 


His 


Ser 


Phe 


Pro 


Ala 


Leu 


Asp 


Gin 




595 










600 










605 









Gly Lys 
610 



<210> 12 
<211> 5811 
<212> DNA 
<213> SHRIMP 



<400> 12 

atggggagta 

aacaacaaca 

gctaccagaa 

tcttcttctt 

gecagaaagg 

aactcgattt 

cctggagctg 

ttgattgatt 

agtctcatgt 

ttttctaccc 

ctagaaaaaa 

aatttaaggt 

attgatgetg 

gttatactac 

atccagaaga 

aacaagaaca 

gacgaaaacg 

gctgagacga 

aattttggcc 

atgttttgct 

cgccatgaag 

teaagtgeta 

ccaagttctt 

gaagatgacg 

gaaaaggaca 

gatgeaaacg 

gatatgatga 

gctcacatat 

gtatgcagta 

cgagaaaaat 

tccttaatgg 

atagatgeag 

aggaacgect 

tttaaagata 

gaactactag 



aacgtccgtg 
ataattcaaa 
cccaggtgct 
cctcctcctc 
tattgagcac 
ctgtcagtta 
ggaaaactac 
ctggatctgc 
gtgcatgcaa 
tcaacaaagg 
tccaatttgt 
tccttgttat 
ttttaegtat 
ttggagataa 
aaataaagaa 
agaagaagaa 
aagaagaaga 
agaaagagga 
aagaagataa 
ctagagactt 
caattttaat 
ttaaagttga 
ccccggccca 
atttctttaa 
gagcateggg 
aaaagataag 
aatctctccc 
ctgaactcaa 
ttgtgttcga 
tgtatattgt 
atgaagagat 
taccattcaa 
tctttagaga 
atttgegtae 
cgaaatctga 



ctccagcggc 
tccagttcca 
cagaagtgea 
ctctgcttct 
tgtttccttg 
tgctggagga 
aatggtcaag 
agacatgetc 
gaaaccagga 
atttgagatt 
ggccgataaa 
cgatgagtat 
agegaaacac 
caggcaaaat 
cccgagtgag 
gaaagagaag 
gggggaggag 
agaaaagagt 
tgcaaaacta 
ttttggcaac 
gaagagtaat 
agactgtggt 
atccacggca 
caatgaagca 
agctttgaat 
aagtggaaca 
cctttccgct 
gaaaatgagc 
catgatggee 
ttctagecta 
attgaaegtc 
ttctgeacat 
tggaaaagac 
agttgcttca 
aaatataaga 



caagagecag 
gttatcaaca 
gttgcagctg 
geggtcaage 
caacagtctc 
ggaatatctg 
gaactaattg 
ctgtgttgca 
ggttcttcct 
ccagttatat 
ctaaaatgga 
acaatggeca 
aggecagata 
tetgeagtag 
gaagaaaaac 
aaagagaaag 
gaggaagaag 
accttct tec 
tacactgaag 
ccaagtaacc 
aacgtcaagt 
aatgettcta 
gaagaaaatt 
tttttgaaac 
ggattctctt 
acttctgtat 
ctaatagaag 
aatgeaaate 
aaagccatga 
tcagagagat 
aagtatgtgc 
aaccgtgcgt 
tttgtggatg 
t ttttagaga 
tctattctct 



ttaccaaaaa 
tcaaatcata 
cagcggcatc 
tcccagatac 
tggcagtgag 
tgttccatct 
cagttcttaa 
aatctaattc 
tgatgtaccc 
ttaggaaaga 
aagtgataca 
gttgcaggga 
tacctttegg 
tggaagataa 
ctcagaagaa 
gtggtgaaga 
aagaagaaag 
aagggtcagt 
tttttataaa 
tgeggaatat 
cagttaacaa 
ataaaaagga 
gtgacgaatt 
tgatggaaag 
tacgetgeaa 
cagacaagaa 
aaggaatatg 
ragaaaagta 
gagaaattga 
ttaaggatac 
acggttctga 
ctgctgtggc 
aaactcctat 
atgaaacttt 
tgaagaaaga 



acaaaagaag 
cccctttctg 
cccttctggg 
gtgeaaggag 
gtatctgtgc 
gggtggtctt 
cgaccacggc 
tgcaaaggag 
agaaagtgtg 
tgaaataact 
agtgttggca 
attggtgttt 
aggagtgttt 
cactaaccat 
caacaagaac 
agaagaggga 
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Ala 


Met 


Gin 


Met 




1010 








1015 








1020 








Val 


Thr 


Ser 


He 


Ser 


Lys 


Gly 


Asn 


Glu 


Arg 


Arg 


Lys 


Lys 


He 


Glu 


Glu 


1025 








1030 








1035 








1040 


Phe 


lie 


Thr 


Lys 


Glu 


Gin 


Gly 


Gin 


Pro 


Lys 


Asp 


Met 


Cys 


Glu 


Arg 


Leu 










1045 








1050 








1055 


Met 


Ala 


Asn 


Ser 


Lys 


Ala 


Lys 


Gin 


Glu 


Lys 


Asp 


Ala 


He 


Ser 


Ser 


Lys 








1060 








1065 








1070 




Thr 


Asp 


Lys 


Met 


Met 


Gly Ala 


He 


Thr 


Leu 


Thr 


Lys 


Lys 


His 


Val 


Leu 






1075 








1080 








1085 






Lys 


Asn 


Ala 


Val 


Ser 


Asn 


Leu 


Val 


Asp 


Thr 


Ser 


He 


He 


Lys 


Glu 


Thr 




1090 








1095 








1100 








Lys 


Lys 


Asn 


Asn 


Asn 


Asn 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Thr 


Ser 


Leu 


Ala 


1105 








1110 








1115 








1120 


Ala 


Ala 


Ala 


Ala 


Val 


Glu 


Asn 


Ser 


Val 


Pro 


Ala 


Leu 


Arg 


Val 


Glu 


Val 










1125 








1130 








1135 


Lys 


Phe 


Val 


Val 


Leu 


Asn 


Met 


Asp 


Leu 


Ser 


Asp 


He 


Ser 


His 


Glu 


Lys 








1140 








1145 








1150 




Thr 


lie 


Ser 


His 


Lys 


Tyr 


Arg 


Gin 


Gin 


Leu 


He 


Asn 


Ala 


He 


Lys 


Thr 






1155 








1160 








1165 






Arg 


Ser 


Thr 


Pro 


Leu 


Phe 


Asp 


Lys 


Phe 


Thr 


Asp 


Arg 


Lys 


He 


Leu 


Arg 




1170 








1175 








1180 








Ala 


Ala 


Glu 


Ser 


Pro 


Arg 


Ala 


Leu 


Thr 


Thr 


He 


Leu 


Leu 


Asp 


Glu 


Lys 


1185 








1190 








1195 








1200 


Lys 


Lys 


Val 


Thr 


Arg 


Ala 


Lys 


Ser 


He 


Thr 


Leu 


Tyr 


Gin 


Gly 


Gin 


Asn 



1205 1210 1215 



Val He Phe Thr Thr Ser Asn Arg Met He His Gly Thr Gin Glu Arg 

1220 1225 1230 

Phe Val Thr Lys Asp Thr Gly Val Val Thr Asn Leu Met Tyr Lys Asn 

1235 1240 1245 

Gly Glu Leu Thr Val Phe Val Tyr Val Glu Arg Leu Gly Gin Lys Cys 

1250 1255 1260 

Leu Glu He Lys Glu Gly Arg Gin He He Gly Asn Pro Asn He Lys 
1265 1270 1275 1280 

Asn Gly Gly Phe Gly Asn Asn Val Tyr Val Gin Tyr Leu Pro Phe Glu 

1285 1290 1295 

Ser Ser Gin Ala Met Thr He Tyr Ser Cys Gin Gly His Thr Phe Phe 

1300 1305 1310 

Arg Asp Thr He Val Asp Leu Ser Gly Ala Ser Thr Gin Asp Ala Tyr 

1315 1320 1325 

Val Ala Val Thr Arg Asn Ser Asn Pro Gin Asn Leu Phe He He Gin 

1330 1335 1340 

Asn His Ser Val Glu Arg Gly Asn Leu Cys Asn He Lys Cys Ala Met 
1345 1350 1355 1360 

Ser Lys Asp Lys Ala Tyr Thr Met Pro He Gly Gly He Ala Asp Phe 

1365 1370 1375 

Asn Gly Ser Asp Phe He Asn His Asp Thr Val Ser Val Ser Arg Glu 

1380 1385 1390 

Val Ala Glu Ser Ser Ala Ala Met Asp Asp Asp Tyr Asn Gly Asp Gly 
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1395 1400 1405 

Gly Val Thr Met Tyr Ser Ala Tyr Asp Pro Ser Lys Asp Val Val Ala 

1410 1415 1420 

Ala Ala Glu Glu Phe lie Leu Ser Arg Ser Gly Lys Ser Leu Ser Phe 
1425 1430 1435 1440 

Asn Ala Ser Trp Met Ala Asn Thr Ala Lys Val lie Gin Gin His Gly 

1445 1450 1455 

Leu Glu Thr Glu Leu Lys Asn lie Arg Asp Phe Phe Phe Gly Val Asn 

1460 1465 1470 

Asn Gly Asp Val Ala Lys His Tyr Glu Lys Leu Cys Asn Lys Lys Met 

1475 1480 1485 

lie Glu Leu Tyr Thr Ala lie Val Arg Ser lie Thr His Tyr Ser lie 

1490 1495 1500 

Ala Ser Gly lie Val Lys Gin Pro Ser Ser Lys Leu Cys Glu Glu Tyr 
1505 1510 1515 1520 

Glu Thr Lys Gin Lys Asn Lys Lys Asp Tyr lie Lys He His Pro Val 

1525 1530 1535 

Phe Val Asn Arg Ala Pro Lys Glu Ser Thr He Glu Met Leu Leu Phe 

1540 1545 1550 

Asp He Ala Pro His Asn Lys Ala Thr He Val Phe Gin Phe Tyr Val 

1555 1560 1565 

His Tyr He Phe Leu Val Tyr Glu Lys Leu Asn Val Leu Asn Ser Ser 

1570 1575 1580 

Phe Ala Phe Leu Pro Ser Pro Asn Pro Cys Leu Asn Gin Tyr Val Arg 
1585 1590 1595 1600 

Pro Lys Ser He Thr Thr Asn Ser Thr His Val Pro Asn Leu Gly Tyr 

1605 1610 1615 

Glu Ser Lys Asp Phe Ala His Cys Lys Asp Gly Gly Glu Arg Asp Val 

1620 1625 1630 

Lys Leu Arg Leu Pro He Thr Ser Ala Asp Glu Phe Ser Asn Asn He 

1635 1640 1645 

Glu Gly He Leu Lys Lys Val Ser Asp Thr Ser Asn Gin Asn Lys Val 

1650 1655 1660 

Asn Lys Tyr Met Asp Val Val Cys Lys Ser Met Gin His Asn Leu Arg 
1665 1670 1675 1680 

Arg Thr Gly Lys Phe Cys Arg Pro Thr Glu Thr Cys Gly Leu Ser Lys 

1685 1690 1695 

His Gly Ser He Val Thr Ser Thr Cys Thr Ala Gin Glu Lys Gly Glu 

1700 1705 1710 

Asn He His Val Asp Ala Glu Lys Gly Trp Leu Cys Met Ser Asp Glu 

1715 1720 1725 

Ala Asn Val Tyr Cys Met Leu Met Phe Met Ser Lys He Ala Ala Ala 

1730 1735 1740 

Ser Gly Val Ser Glu Phe Pro He Lys Asp Lys Ser He Ser Asn Pro 
1745 1750 1755 1760 

Glu Thr Pro Ser Asp Thr He Ser Leu Leu Ala Pro Arg Lys Thr He 

1765 1770 1775 

Ser Pro Thr Asn Asn Leu His Phe Ser Met Ser Glu Asp Val Leu Phe 

1780 1785 1790 

Cys Gly Gin Val His Pro Met Lys Arg Val Gin Phe Ser Leu His Val 

1795 1800 1805 

Lys Arg Thr Gly Gly Ala Leu Lys Ser Thr Phe Glu Glu Glu Glu Gly 

1810 1815 1820 

Leu Pro Thr Lys He Phe Ser Pro Asn Phe Ala Thr Tyr Pro Leu Phe 
1825 1830 1835 1840 

Lys Lys Cys Lys Met Tyr Gly Ala He He He Ala Met Thr Glu Met 

1845 1850 1855 

Gin Gly His Glu Phe Ala Lys Tyr Ser Thr Leu Asp He Arg Lys Ser 

1860 1865 1870 

Met Phe Thr Gly Val Gly Thr Val Val Asp Leu Glu Lys He Ser Gly 
1875 1880 1885 
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Glu Gly Asn Glu Val Met Asp Lys Val Asp Lys Phe He Val Lys Asn 

1890 1895 1900 

Val Ser Asn He Leu Phe Lys Glu Gin Gly Lys Arg Val Ser Phe Phe 
1905 1910 1915 1920 

Val Ser Cys Ala He His 
1925 



<210> 14 
<211> 768 
<212> DNA 
<213> SHRIMP 



<400> 14 

atggctgggg 

aataaaatgc 

actttaccaa 

ttccctgatc 

gcatgggacg 

caattgattg 

gatgaccaag 

aatcacgggc 

ttttctctaa 

ttcaataatc 

aaggatatcg 

tccgatatag 

aaaccagt tg 



tagatttgta 
agggcaacag 
tgtctgaaga 
cgaaatcggc 
tactaaaaac 
aaaatcttca 
ggaggtatga 
taatcagaac 
caattgagaa 
aaacggataa 
caattaagcc 
ttacaaagaa 
atcacctttt 



tggaggacat 
agggaaaatt 
caacattagt 
attaactttt 
gccagctagc 
catgagtgat 
aaccagctac 
actggctaag 
ttttgaagta 
atcttgcttt 
tgaaaaaaga 
tacgcgtaat 
attgtactgc 



attaaaccct 
agagcactaa 
gcctgggtaa 
tttgtaccga 
gttgaaattg 
tttacagtgg 
aatttatctc 
gcccaagata 
gataatggga 
gtaaagatct 
ggttcatctg 
gcaaaatttt 
atggttacat 



acggagagac 
taaatgaaaa 
ctgaggtggc 
acaagagcct 
atatagggaa 
caatttttag 
cttctatggg 
ttgtagtctg 
aaaaacggct 
tccacgaaat 
ctgtatggga 
cattacgata 
atttttaa 



tgtattcaat 
agcagcagca 
tgcggatgtt 
caatgcgttt 
gagaattcct 
agtaaaatgt 
aggaaaaata 
gaagagggat 
cgattttttg 
ggaatctgaa 
tgaagtatat 
tagaaatgaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

768 



<210> 15 
<211> 255 
<212> PRT 
<213> SHRIMP 



<400> 15 
Met Ala Gly Val 
1 

Thr Val Phe Asn 
20 

Leu He Asn Glu 
35 

He Ser Ala Trp 
50 

Lys Ser Ala Leu 
65 

Ala Trp Asp Val 

Lys Arg He Pro 
100 

Val Ala He Phe 
US 

Ser Tyr Asn Leu 
130 

He Arg Thr Leu 
145 

Phe Ser Leu Thr 

Leu Asp Phe Leu 
180 

lie Phe His Glu 
195 

Lys Arg Gly Ser 



Asp Leu Tyr Gly 
5 

Asn Lys Met Gin 

Lys Ala Ala Ala 
40 

Val Thr Glu Val 
55 

Thr Phe Phe Val 
70 

Leu Lys Thr Pro 
85 

Gin Leu He Glu 

Arg Val Lys Cys 
120 

Ser Pro Ser Met 
135 

Ala Lys Ala Gin 
150 

He Glu Asn Phe 
165 

Phe Asn Asn Gin 

Met Glu Ser Glu 
200 

Ser Ala Val Trp 



Gly 


His 


He 


Lys 




10 






Gly 


Asn 


Arg 


Gly 


25 








Thr 


Leu 


Pro 


Met 


Ala 


Ala 


Asp 


Val 








60 


Pro 


Asn 


Lys 


Ser 






75 




Ala 


Ser 


Val 


Glu 




90 






Asn 


Leu 


His 


Met 


105 








Asp 


Asp 


Gin 


Gly 


Gly 


Gly 


Lys 


He 








140 


Asp 


He 


Val 


Val 






155 




Glu 


Val 


Asp 


Asn 




170 






Thr 


Asp 


Lys 


Ser 


185 








Lys 


Asp 


He 


Ala 


Asp 


Glu 


Val 


Tyr 



Pro Tyr Gly Glu 
15 

Lys He Arg Ala 
30 

Ser Glu Asp Asn 
45 

Phe Pro Asp Pro 

Leu Asn Ala Phe 
80 

lie Asp He Gly 
95 

Ser Asp Phe Thr 
110 

Arg Tyr Glu Thr 

125 

Asn His Gly Leu 

Trp Lys Arg Asp 
160 

Gly Lys Lys Arg 
175 

Cys Phe Val Lys 
190 

He Lys Pro Glu 
205 

Ser Asp He Val 
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210 215 220 

Thr Lys Asn Thr Arg Asn Ala Lys Phe Ser Leu Arg Tyr Arg Asn Glu 
225 230 235 240 

Lys Pro Val Asp His Leu Leu Leu Tyr Cys Met Val Thr Tyr Phe 
245 250 255 



<210> 16 
<211> 348 
<212> DNA 
<213> SHRIMP 



<400> 16 

atggcttcgc 

ttaataacaa 

tctctcctgg 

ggagtgggcg 

acaccagatg 

cccacccgtc 



cattggtggc 
taattgtcat 
cgggcacagc 
gtgcattcaa 
gtggggccaa 
gttccatatt 



gtctttagga ggaggaaaaa atatactttt tggtttactt 60 
tgttattgct gtaataataa tcaaggctcc tctcttagca 120 
actggccggc acgatcgctt ctgcgctggg atcaatacca 180 
gaaagccttt ggaaaaggaa aggggaaagg aggaccaaaa 24 0 
gaaaactaac caaaaaccaa agaagggtaa gaaaaaaccc 300 
caagaggata cctaaaataa aattttag 34 8 



<210> 17 
<211> 111 
<212> PRT 
<213> SHRIMP 



<400> 17 



Met 


Asp 


Leu 


Val 


Ala 


Ser 


Leu 


Gly 


Gly 


Gly 


Lys 


Asn 


He 


Leu 


Phe 


Gly 


1 








5 










10 










15 




Leu 


Leu 


Leu 


He 
20 


Thr 


He 


He 


Val 


He 
25 


Val 


He 


Ala 


Val 


He 
30 


He 


He 


Lys 


Ala 


Pro 


Leu 


Leu 


Ala 


Ser 


Leu 


Leu 


Ala 


Gly 


Thr 


Ala 


Gly 


Thr 


He 




35 










40 










45 








Ala 


Ser 
50 


Ala 


Leu 


Gly 


Ser 


He 
55 


Pro 


Gly 


Val 


Gly 


Gly 
60 


Ala 


Phe 


Lys 


Lys 


Ala 


Phe 


Gly 


Lys 


Gly 


Lys 


Gly 


Lys 


Gly 


Gly 


Pro 


Lys 


Thr 


Pro 


Asp 


Gly 


65 










70 










75 










80 


Gly 


Ala 


Lys 


Lys 


Thr 
85 


Asn 


Gin 


Lys 


Pro 


Lys 
90 


Lys 


Gly 


Lys 


Lys 


Lys 
95 


Pro 


Pro 


Thr 


Arg 


Arg 
100 


Ser 


He 


Phe 


Lys 


Arg 
105 


He 


Pro 


Lys 


He 


Lys 
110 


Phe 





<210> 18 

<211> 687 

<212> DNA 

<213> SHRIMP 



<400> 18 

atggtctctt 

ctccccacca 

gaacaagaag 

aaggaggaag 

gatggtgaag 

gacgacgaca 

aacatcaaac 

gaatttgagg 

ctgatcaacc 

gacatggaga 

caggctgtag 

agtttgattg 



ctagaacatc 
agaggaagag 
aactggtaga 
aacacaaaga 
aagaggaata 
ctgcagagaa 
tgaaaaaccc 
tacaaaatac 
gtgaagccgt 
ttggatctct 
tacagaaagt 
atgatccttg 



aacaacatct 
ggagccagaa 
ggactcgtca 
aacacatcac 
tgaggaagag 
aatggaaaat 
cctaagggaa 
tgtcaatttt 
ttctaggtgg 
tacagttaac 
gaaggaagat 
ttggtaa 



tcatctgcag 
gaagtaaagg 
agtaacaagc 
ctctccctcc 
gaggatgagg 
cttttggtgc 
catgacatgg 
agttttggag 
ggtaatacac 
cagttgctcc 
ataacccctt 



tagcagccac 
tgaaagtgga 
gccccagaat 
catgtaaaga 
aagaatatga 
aactggacaa 
cagtttcaca 
tactatctga 
ccccaccaaa 
acaagtgtga 
ctattgaagt 



ctctactctt 
agtaaaaatg 
taaggaagag 
agaagaagac 
agacagagtg 
tactaccaaa 
ctatgagcat 
tattgggttc 
agagtttggc 
taattttgta 
tacaatagat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

687 
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<210> 19 
<211> 228 
<212> PRT 
<213> SHRIMP 



<400> 19 



wet. 


vai 






Arg 


l nr 


Ser 


x nr 


i nr 


Car 


Ser 


Ser 


nla 


Val 


Ala 


nlo 


JL 


















1 fi 
X KJ 










1 S 




1 11 X 


Ser 


i nr 


Leu 


Leu 


Pro 


± nr 


Lys 


Arg 


Lys 


Arg 




Pro 


V7-LU 


UlU 


val 


















^£ D 
















Lys 


vai 


Lys 


VdX 




va x 


Lys 




VjlU 


fin 


oi.U 




Leu 


val 


ulU 


Asp 






JO 




























C A V 




oer 


Asn 


Lys 


Arg 


tro 


Arg 


Tin 

j. xe 


Lys 


<~ 1 
blu 




Lys 


blU 


u±U 


ulu 




sn 










55 










fin 










nis 


T W CT 




Thr 


His 


His 


Leu 


Ser 


Leu 


Pro 


Cys 


Lys 




CI ii 


SZ"\ ii 


Asp 


65 










70 










75 










80 


Asp 


Gly 


Glu 


Glu 


Glu 


Glu 


Tyr 


Glu 


Glu 


Glu 


Glu 


Asp 


Glu 


Glu 


Glu 


Tyr 










85 










90 










95 




Glu 


Asp 


Arg 


Val 


Asp 


Asp 


Asp 


Thr 


Ala 


Glu 


Lys 


Met 


Glu 


Asn 


Leu 


Leu 








100 










105 










110 






Val 


Gin 


Leu 


Asp 


Asn 


Thr 


Thr 


Lys 


Asn 


He 


Lys 


Leu 


Lys 


Asn 


Pro 


Leu 






115 










120 










125 








Arg 


Glu 


His 


Asp 


Met 


Ala 


Val 


Ser 


His 


Tyr 


Glu 


His 


Glu 


Phe 


Glu 


Val 




130 










135 










140 










Gin 


Asn 


Thr 


Val 


Asn 


Phe 


Ser 


Phe 


Gly 


Val 


Leu 


Ser 


Asp 


He 


Gly 


Phe 


145 










150 










155 










160 


Leu 


He 


Asn 


Arg 


Glu 


Ala 


Val 


Ser 


Arg 


Trp 


Gly 


Asn 


Thr 


Pro 


Pro 


Pro 










165 










170 










175 




Lys 


Glu 


Phe 


Gly Asp Met 


Glu 


He 


Gly 


Ser 


Leu 


Thr 


Val 


Asn 


Gin 


Leu 








180 










185 










190 






Leu 


His 


Lys 


Cys 


Asp 


Asn 


Phe 


Val 


Gin 


Ala 


Val 


Val 


Gin 


Lys 


Val 


Lys 






195 










200 










205 








Glu 


Asp 


He 


Thr 


Pro 


Ser 


He 


Glu 


Val 


Thr 


He 


Asp 


Ser 


Leu 


He 


Asp 



210 215 220 

Asp Pro Cys Trp 
225 



<210> 20 
<211> 1698 
<212> DNA 
<213> SHRIMP 

<400> 20 

atggcaaggt ctgtaggttt attatctgtg acacccgagt atgacacatt taagtatatc 60 

aaaatggaag aattcaagac tttaaaggta aaaaatggat tcactatttc tggtgaaaac 120 

cctgacaaat acgaacatat tctattatca ttcaaatctg ttgatagggt tacaaaaagt 180 

gaattaaggg aeggactata tatagtccgt ctaaaagata aggaagtact tcacataaaa 240 

aacggagtgc atagactgag acaattgaca ggegacaata ctcttcaagt eggattaaag 300 

tatactcaca atctccccag gttagggagt ttattgcaag atgatgggtg tgaggattat 360 

ggggaaaagt ggaacgaatc actccccatt gatatgeaaa atatcaacaa aatcgttaaa 420 

gaaaaggece ttcttagtga caaaaacttt aaattttctc ctctctacag gcttttacac 4 80 

gaaagacttt etaatgeage tgtgaaaaaa tgtgactata tgataataac cactgacttt 540 

ttagtagggt gtggctacac gecgagceat tgccctagaa cacttegtaa catggaacaa 600 

ttattggtgg aacaatgtgg cttttcttcc cgcatatcag tgtatgatat atgtgatagg 660 

ttaacataca aaggggecta cattgeaaac ccaatcacag gcagttactc caacatgtgc 720 

ctaattgttc caatggataa actaggtttg attttctaca acagtacaca cccatcagct 780 

aaaagtattg gaaattatat gtcatctctt ttcaatgcaa cegtcatata cgcaaatgaa 840 

agggataatt tacagatgga taatttcaga agagaaataa agtttgcaga gaatgaagta 900 

aatatgaagg aagaagaact gaaggaattg aggaaaegtt gtgccgtctc tgaagaacaa 960 

aggatttctt tgagggatgt gcacaaaaaa tcatcgattg cgacatcccg ctatgatggt 1020 
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ggagcctgct 
ggaaatggtt 
tataaaaaga 
gatgccccta 
cgcaccagtg 
atggtcagag 
caaaatggtg 
aaaagggaag 
attttcaata 
ggaggagaaa 
aggaaaagag 
gacataaaag 



tagtgtttgc 
ccttttattc 
gagatgtggg 
tatgtattag 
gcaatagtat 
aggtaacagg 
ccgaaatcaa 
aaaacttgcc 
cgtcaagggc 
cttcagaaga 
aagaaaatga 
tttgctaa 



tttttctgac 
tgcaacagaa 
agaacgtagg 
ggacagtgta 
aacctttgca 
aacagacatc 
tgtgattgat 
cgaagacgaa 
aatttccatg 
tggagatgga 
agggaatgaa 



cgagatttct 
gaaggaatca 
ccccgtttga 
cgaaaccact 
gtccctccag 
aaaattttca 
ccgacatcta 
cggaagcgtt 
tataatgaag 
aacggcagca 
tatgtagtcc 



ccttgctatg 
ggtatgtctc 
tcatgtccat 
tcaagacgag 
gagaaaggga 
tggataatgg 
aggaatacaa 
tgcggcgaga 
aacgtggaga 
ccagcagtaa 
ttttgaacaa 



cagaaccaac 
ttcacctgaa 
aactggatcg 
acttttctcc 
actaatggaa 
aaaagtatat 
agagttactc 
acgacgcatg 
tggaggaagt 
aggagagaaa 
ggcttgtaaa 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1698 



<210> 21 
<211> 563 
<212> PRT 
<213> SHRIMP 



<400> 21 



Met 


Ala 


Arg 


Ser 


1 








Phe 


Lys 


Tyr 


He 








20 


Gly 


Phe 


Thr 


He 






35 




Leu 


Ser 


Pne 


Lys 




50 






Gly 


lie 


Val 


Arg 


65 








vax 


HIS 


Arg 


L»eu 


Leu 


Lys 


Tyr 


Thr 








100 


Asp 


Gly 


Cys 


Glu 






115 




Asp 


Met 


Gin 


Asn 




130 






Asp 


Lys 


Asn 


Phe 


145 








Leu 


Ser 


Asn 


Ala 


Asp 


Phe 


Leu 


Val 








180 


Leu 


Arg 


Asn 


Met 






195 




Arg 


lie 


Ser 


Val 




210 






Tyr 


lie 


Ala 


Asn 


225 








Val 


Pro 


Met 


Asp 


Ser 


Ala 


Lys 


Ser 








260 


Val 


He 


Tyr 


Ala 






275 




Arg 


Glu 


He 


Lys 




290 






Leu 


Lys 


Glu 


Leu 


305 








Ser 


Leu 


Arg 


Asp 



Val 


Gly 


Leu 


Leu 


5 








Lys 


Met 


Glu 


Glu 


Ser 


Gly 


Glu 


Asn 








40 


Ser 


Val 


Asp 


Arg 






55 




Leu 


Lys 


Asp 


Lys 




70 






Arg 


uin 


Leu 


TVs v 

x nr 


85 








His 


Asn 


Leu 


Pro 


Asp 


Tyr 


Gly 


Glu 








120 


He 


Asn 


Lys 


He 






135 




Lys 


Phe 


Ser 


Pro 




150 






Ala 


Val 


Lys 


Lys 


165 








Gly 


Cys 


Gly 


Tyr 


Glu 


Gin 


Leu' 


Leu 








200 


Tyr 


Asp 


He 


Cys 






215 




Pro 


He 


Thr 


Gly 




230 






Lys 


Leu 


Gly 


Leu 


245 








He 


Gly 


Asn 


Tyr 


Asn 


Glu 


Arg 


Asp 








280 


Phe 


Ala 


Glu 


Asn 






295 




Arg 


Lys 


Arg 


Cys 




310 






Val 


His 


Lys 


Lys 



325 



Ser 


Val 


Thr 


Pro 




10 






Phe 


Lys 


Thr 


Leu 


25 








Pro 


Asp 


Lys 


Tyr 


Val 


Thr 


Lys 


Ser 








60 


Glu 


Val 


Leu 


His 






75 




Gly 


Asp 


Asn 


Thr 




90 






Arg 


Leu 


Gly 


Ser 


105 








Lys 


Trp 


Asn 


Glu 


Val 


Lys 


Glu 


Lys 








140 


Leu 


Tyr 


Arg 


Leu 






155 




Cys 


Asp 


Tyr 


Met 




170 






Thr 


Pro 


Ser 


His 


185 








Val 


Glu 


Gin 


Cys 


Asp 


Arg 


Leu 


Thr 








220 


Ser 


Tyr 


Ser 


Asn 






235 




He 


Phe 


Tyr 


Asn 




250 






Met 


Ser 


Ser 


Leu 


265 








Asn 


Leu 


Gin 


Met 


Glu 


Val 


Asn 


Met 








300 


Ala 


Val 


Ser 


Glu 






315 




Ser 


Ser 


He 


Ala 




330 







Glu 


Tyr 


Asp 


Thr 






15 




Lys 


Val 


Lys 


Asn 




30 






Glu 


His 


He 


Leu 


45 








Glu 


Leu 


Arg 


Asp 


He 


Lys 


Asn 


Gly 








80 


Leu 


Gin 


val 


Gly 






95 




Leu 


Leu 


Gin 


Asp 




110 






Ser 


Leu 


Pro 


He 


125 








Ala 


Leu 


Leu 


Ser 


Leu 


His 


Glu 


Arg 








160 


He 


He 


Thr 


Thr 






175 




Cys 


Pro 


Arg 


Thr 




190 






Gly 


Phe 


Ser 


Ser 


205 








Tyr 


Lys 


Gly 


Ala 


Met 


Cys 


Leu 


lie 








240 


Ser 


Thr 


His 


Pro 






255 




Phe 


Asn 


Ala 


Thr 




270 






Asp 


Asn 


Phe 


Arg 


285 








Lys 


Glu 


Glu 


Glu 


Glu 


Gin 


Arg 


He 








320 


Thr 


Ser 


Arg 


Tyr 






335 
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Asp 


Gly 


Gly 


Ala 








.34 U 


Leu 


Leu 


Cys 


Arg 






■jcc 
J J J 




Glu 


Gly 


He 


Arg 




J /U 






Gly 


Glu 


Arg 


Arg 


385 








Pro 


He 


Cys 


He 


Phe 


Ser 


Arg 


Thr 










Glu 


Arg 


Glu 


Leu 






435 




Lys 


He 


Phe 


Met 




4 50 






Asn 


Val 


He 


Asp 


I O J 








Glu 


Glu 


Asn 


Leu 


Arg 


Met 


He 


Phe 








500 


Arg 


Gly 


Asp 


Gly 






515 




Asn 


Gly 


Ser 


Thr 




530 






Glu 


Gly 


Asn 


Glu 


545 








Lys 


Val 


Cys 





Cys 


Leu 


Val 


Phe 


ml, _ 

Inr 


Asn 


Va-Ly 


Asn 








J ou 


Tyr 


val 


Ser 


C *n w 

£>er 






J /O 




Pro 


Axg 


Leu 


He 










Arg 


Asp 


Ser 


val 


405 








Ser 


Gly 


Asn 


Ser 


Met 


Glu 


Met 


Val 








A A f\ 
H 4U 


Asp 


Asn 


Gly 


Lys 






A C C 

455 




Pro 


Thr 


Ser 


Lys 




470 






Pro 


Glu 


Asp 


Glu 


485 








Asn 


Thr 


Ser 


Arg 


Gly 


Ser 


Gly 


Gly 








520 


Ser 


Ser 


Lys 


Gly 






535 




Tyr 


Val 


Val 


Leu 




550 







Al 3 


Pne 


oer 


Asp 


345 








Gly 


Ser 


Phe 


Tyr 


Pro 


Glu 


Tyr 


Lys 








Jo U 


Met 


Ser 


He 


Thr 






5 




Arg 


Asn 


His 


Phe 




410 






He 


Thr 


Phe 


Ala 


425 








Arg 


Glu 


Val 


Thr 


Val 


Tyr 


Gin 


Asn 








460 


Glu 


Tyr 


Lys 


Glu 






475 




Arg 


Lys 


Arg 


Leu 




490 






Ala 


He 


Ser 


Met 


505 








Glu 


Thr 


Ser 


Glu 


Glu 


Lys 


Arg 


Lys 








540 


Leu 


Asn 


Lys 


Ala 






555 





Arg 


Asp 


Phe 


Ser 




350 






Ser 


MJ.a 


Thr 


Glu 


365 








Lys 


Arg 


Asp 


Val 


Gly 


Ser 


Asp Ala 








400 


Lys 


Thr 


Arg 


Leu 






415 




Val 


Pro 


Pro 


Gly 




430 






Gly 


Thr 


Asp 


He 


445 








Gly Ala 


Glu 


He 


Leu 


Leu 


Lys 


Arg 








480 


Arg 


Arg 


Glu 


Arg 






495 




Tyr 


Asn 


Glu 


Glu 




510 






Asp 


Gly 


Asp 


Gly 


525 








Arg 


Glu 


Glu 


Asn 


Cys 


Lys 


Asp 


He 








560 



<210> 22 
<211> 630 
<212> DNA 
<213> SHRIMP 



<400> 22 

atgattgtat 

agaactgccg 

tatt taccca 

cgtcccaggc 

aggcacgaca 

tatggagaag 

gtgatggttt 

gatgcacata 

gaagttatcc 

aagaattacg 

tgtaagttta 



tcgtcgaagg 
gaaagggcaa 
tcttcccatg 
tcgtggatgg 
cagaacagtt 
aattcaaagc 
acagggaaga 
actggtttat 
ccagaaatgt 
tgtaccgtgt 
ttgctgagaa 



atctccccta 
acagtctttc 
gaccattcaa 
aatgtttgga 
tccagaatca 
atgtgtcgcc 
agatgtcaag 
tgacgttctc 
gtacaaaggg 
tggtaaatgc 
gattatttga 



actgggaaaa 
ctcaacttca 
gagcatttgc 
tcatcattga 
aaaattggat 
gagtgggtca 
aaaatggaac 
aaggaggaga 
aatatttgta 
acgaatagta 



cttcatgggt 
tgtacaccaa 
gtgcatccga 
actttttcac 
tgagggagta 
aatacaagcc 
caattattca 
gagcgttgtt 
gttcatgttt 
ttgtacactg 



ggataacatg 
ttatagggac 
ttaccaggaa 
aggcatgtgg 
tttggagatg 
agttttccat 
agaattgaat 
tgtaaagatt 
ctcgacgagt 
tgatatgaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

630 



<210> 23 
<211> 209 
<212> PRT 
<213> SHRIMP 



<400> 23 

Met He Val Phe Val Glu Gly Ser Pro Leu Thr Gly Lys Thr Ser Trp 

1 5 10 15 

Val Asp Asn Met Arg Thr Ala Gly Lys Gly Lys Gin Ser Phe Leu Asn 

20 25 30 

Phe Met Tyr Thr Asn Tyr Arg Asp Tyr Leu Pro He Phe Pro Trp Thr 
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35 




Tip 


m n 

U11 1 


OJ.U 


Hie: 




50 






V ClJ- 


Asp 


J. y 


i ltr l. 


65 








Arg 


His 


Asp 


Thr 


Tyr 


Leu 


ulU 


net 








±\JV 


val 


Lys 


Tyr 


Lys 






115 




Val 


Lys 


Lys 


Met 




130 






Trp 


Phe 


lie 


Asp 


145 








Glu 


Val 


He 


Pro 


Phe 


Ser 


Thr 


Ser 








180 


Ser 


lie 


Val 


His 






195 





He 



40 

Leu Arg Ala Ser 
55 

Phe Gly Ser Ser 
70 

Glu Gin Phe Pro 
85 

Tyr Gly Glu Glu 

Pro Val Phe His 
120 

Glu Pro lie He 
135 

Val Leu Lys Glu 
150 

Arg Asn Val Tyr 
165 

Lys Asn Tyr Val 

Cys Asp Met Lys 
200 







m n 


V3 1U 








D U 


Lieu 


Asn 


Phe 


trim 






75 




Glu 


Ser 


Lys 


Tip 




90 






Phe 


Lys 


Ala 


Cys 


105 








Val 


Met 


Val 


Tyr 


Gin 


Glu 


Leu 


Asn 








140 


Glu 


Arg 


Ala 


Leu 






155 




Lys 


Gly Asn 


He 




170 






Tyr 


Arg 


Val 


Gly 


185 








Cys 


Lys 


Phe 


He 



45 

Arg Pro Arg Leu 

Thr Gly Met Trp 
80 

Gly Leu Arg Glu 
95 

Val Ala Glu Trp 
110 

Arg Glu Glu Asp 
125 

Asp Ala His Asn 

Phe Val Lys He 
160 

Cys Ser Ser Cys 
175 

Lys Cys Thr Asn 
190 

Ala Glu Lys He 
205 



<210> 24 
<211> 1347 
<212> DNA 
<213> SHRIMP 



<400> 24 

atgggaggtg 

ggagcaaaat 

gcggtaccca 

cctgtttaca 

tttagtttca 

gacgtgagaa 

gtategtett 

ctttgttctg 

tettatgeaa 

actcttcctg 

tccaatatct 

gagaaegcta 

geatgeatta 

aacatataca 

gatgeca tgg 

agatttgaac 

gatattaaca 

ttacaacctc 

agagaagtag 

gtaaaataca 

aatacagaat 

gatgttgtgg 

aaacttcccg 



aagattcgtt 
cgat tcaagt 
ctctagaaga 
gagtgectet 
atgtagatcc 
atccttcacg 
acactggaga 
aaatagcatc 
caaactttaa 
acgacaaaat 
ctatcggacc 
ccgcagtcgg 
tcccagaagg 
acatgactaa 
aagaagatga 
atga tttgaa 
gttcaagttt 
tegtttctte 
agtttcatga 
tgaaaatagt 
tgtataagaa 
gactagatac 
aagtgattgg 



cgacgacagg 
aaaggaaacg 
aaaaaataga 
cttctcttta 
acgacgattt 
aacggctgcc 
cgggatgttg 
cgatataaca 
tagtgttgaa 
gttccaacaa 
catgagtcat 
tgtcattaga 
ggttttcttt 
ccaaccaaaa 
tggtgaagaa 
aatgeaagee 
ccacaaatgt 
agaagttgtg 
caggctcccg 
atctctaaag 
aaaacaggct 
gagtgagcat 
tcaataa 



tacgattctg 
gatttggagg 
acagctttaa 
aaggatggaa 
ggaaaagtac 
attttcgtgc 
gtctgcccta 
atttataata 
gataattccc 
gtgtctattt 
gtgaaaatca 
tatggaggat 
gatgatgttg 
gaaattgtac 
gcagattata 
atgtcgtcag 
ttcctaatta 
cttaatgatc 
tctggtgcac 
gaaggcctaa 
ttaaaggtgc 
agttttggag 



atgcgctctg 
tgtaccgaat 
ggtactattc 
gtgatcccca 
cagtgaaggt 
caactggacc 
atcacaattt 
caagtagttc 
ctgtcggtat 
tttccgctac 
ageteggtta 
tattctacac 
tgggaaatca 
taaaagagee 
atttcttggg 
catatagctc 
aacctaaata 
tctcgctcaa 
aagacaactc 
aggttgtaaa 
atgttttaaa 
taatagtgtg 



ggaaaatgaa 
gcaccgcaga 
agat tggtct 
cgaaagagat 
gagacgagtt 
aggacttcat 
tataggagat 
agggeggcta 
tttattcgag 
tgaacegget 
ctatgatgaa 
ctctgtcggt 
tagctcaatg 
aagaggagag 
ttatgtagta 
agtttctatt 
taactctatt 
tacacgtggg 
atat tcgatt 
ccctataatc 
tatgacgegt 
tea tgetgea 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1347 



<210> 25 
<211> 444 
<212> PRT 
<213> SHRIMP 



<400> 25 
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Met 


Gly 


Gly 


Glu 


1 








Trn 


Glu 
uiu 


A QTt 


Glu 








20 


Glu 


Val 


xyx 


Arg 






35 




Asn 


Arg 


1 IIX 


nld 




50 






VOX 


Pro 

X; 1 U 


Leu 


C X1CS 


65 








Phe 


Asn 


Val 


Asp 


Arg 


vol 


Asp 


V ci X 








i nn 

lUv 


Th r- 
X 11 X. 


y 


Pro 


uxy 






115 




Val 


Cys 


Pro 


Asn 




130 






Ser 


Asp 


Tie 
lie 


X IIX. 


145 








Ala 


Thr 


Asn 


Phe 


I: lit: 


o i Li 


x nx 


Leu 








i an 

1 O w 






Thr 


O X Li 






l jj 




11c 


Lys 


Leu 


uxy 










Tie 
lie 


Ax g 


Tyr 


uxy 


225 








Pro 


Glu 


Gly 


Val 


Asn 


Tie 

j. ie 


Tyr 


Asn 








?fift 


Pro 


Arg 


oiy 


blU 






£ JO 




Tyr 


Asn 


pne 


T A1 1 

Leu 










bin 


a j. a 


Mec 


JCX 


305 








Ser 


Ser 


Phe 


His 


Leu 


bin 


Pro 


Leu 








^4 ft 


Asn 


i nr 


Arg 


Caly 






355 




Ala 


Gin 


Asp 


Asn 




370 






Leu 


Lys 


Glu 


Gly 


385 








Tyr 


Lys 


Lys 


Lys 


Asp 


Val 


Val 


Gly 








420 


Cys 


His 


Ala 


Ala 






435 





Asp 


Ser 


Phe 


Asp 


5 








Gly 


Ala 


Lys 


Ser 


Met 


His 


«x y 


Arg 








40 


Leu 


Arg 


x yi 


Tur 






55 




Ser 


Leu 


Lys 


Asp 




70 






Pro 




Arg 


Phe 


85 








Arg 


Asn 


Pro 


Ser 


IjSU 


His 


Val 


Car 
•JCi 








120 


Hie: 


Asn 


rue 


Tip 
1 Its 






135 




Tie 

XXC 


x yx 


Asn 


Thr 




150 






Asn 


Ser 


vol 


ul U 


165 








Pro 


Asp 


Asp 


Lys 


Pro 


Asn 


Tie 
lie 










200 


Tyr 


Tyr 


ASp 


k?l u 










« j. y 


Leu 


r I It? 


Tyr 




230 






DHe 

f ne 


rue 


Asp 


Asp 


245 








Met 


Thr 


Asn 


Gin 


Asp 


nld 


Me t- 


oXU 








ctJU 


bi y 


Tyr 


vdl 


W-a "l 
Vdl 










JCX 


Ala 


Tyr 


C2er> 
ocx 




^ 1 ft 
J1U 






Lys 


Cys 


rile 


Leu 


325 








Val 


Ser 


Ser 


Glu 


Arg 


bxu 


va i 


ri it 

uxu 








360 


Ser 


Tyr 


Ser 


He 






375 




Leu 


Lys 


Val 


Val 




390 






Gin 


Ala 


Leu 


Lys 


405 








Leu 


Asp 


Thr 


Ser 


Lys 


Leu 


Pro 


Glu 








440 



Asp 


Arg 


Tvr 
j 


Asp 




10 






He 


Gin 


Val 


Lys 


25 








Ala 


Val 


Pro 


Thr 


OCX 


Asp 




Ser 








60 


Gly 


Ser 


Asp 


Phe 






75 




oxy 


Lys 


Val 


Pro 










Arg 


Thr 


Ala 


Ala 


105 








Ser 


Tyr 


Thr 


Gly 


uxy 


ASp 


Leu 


i— y o 








14 ft 


ocx 


OCX 


OCX 


Gly 






155 




ASp 


Asn 


Ser 


Pro 




1 > u 






Met- 
ric? L. 


Phe 

XT1 Jfc; 


Gin 


Gin 


185 








He 


Gly 


Pro 


Met 


bXU 


Asn 


Ala 


Thr 








220 


Th v 

x nr 


Ser 


Val 


Gly 






235 




vai 


Val 


Gly Asn 




250 






Pro 


Lys 


Glu 


He 


265 








Glu 


Asp Asp 


Gly 


Arg 


Phe 


Glu 


His 








300 


Qe y* 


Val 


Ser 


He 






315 




Tie 

x ie 


Lys 


Pro 


Lys 




330 






vai 


Val 


Leu 


Asn 


345 








Phe 


His 


Asp Arg 


Val 


Lys 


Tyr 


Met 








380 


Asn 


Pro 


He 


He 






395 




Val 


His 


Val 


Leu 




410 






Glu 


His 


Ser 


Phe 


425 








Val 


He 


Gly 


Gin 



Ser 


ASD 


Ala 


Leu 






15 




Glu 


Thr 


ASD 


Leu 




30 






Leu 


Glu 


Glu 


Lys 


45 








Pro 


Val 


Tyr 


Arg 


Arg 


Asp 


Phe 


Ser 








80 


Val 


Lys 


Val 


Arg 










lie 


Phe 


Val 


Pro 




110 






ASp 


Gly 


Met 


Leu 


125 








Ser 


Glu 


He 


Ala 


Arg 


Leu 


Ser 


Tyr 

i yr 








160 


Val 


Gly 


He 


Leu 






i / j 




Val 


Ser 


He 


Phe 




i ^ v 






Ser 


His 


vai 


Xjys 


205 








Ala 


Val 


Gly 


Val 


AT a 
nXd 


Vb#y5 


Tie 

lie 


Tie 

1 16 








?4ft 


nxs 


•JCX 


Ser 


Met" 

net 










vai 


Leu 


Lys 


(^1 ii 




P7ft 






fZI n 

UlU 


fZt ii 
Ul IX 


Ala 
Ala 


Asp 


285 








Asp 


Leu 


Lys 


Met 


Asp 


Tie 
116 


Asn 


OCX 








320 


Tyr 


A cn 
nail 


ocx 


Tip 
1 XtS 






335 




Asp 


T 

Leu 


Ser 


Leu 




350 






Leu 


Pro 


Ser 


Gly 


365 








Lys 


He 


Val 


Ser 


Asn 


Thr 


Glu 


Leu 








400 


Asn 


Met 


Thr 


Arg 






415 




Gly 


Val 


He 


Val 




430 







<210> 26 
<211> 2352 
<212> DNA 
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<213> SHRIMP 



<400> 26 

atggcagcag 

caactcaccc 

aagatctttg 

aaccaaggtg 

aattgcatat 

aaaaagtata 

gccgtgatcc 

gagattattg 

ggaatgatga 

tcaactaaac 

gcagctgaag 

tttcctggtt 

ggagatgatg 

actcgcggtc 

tgtctcagtg 

ggagaatttg 

actaaagggt 

tgtttgagga 

tttgatgaga 

gcaggtgctg 

ttcagaaaat 

aatcttgata 

aagaaaaacg 

aacaaggagg 

agagaaaagc 

gtcgcagcca 

gacatggctg 

cgtgcagtcg 

aggatcaagc 

gaagacgcga 

aaggaggaag 

gaaaagaaca 

atgaggattc 

gaagaagaga 

gatatggtga 

aaggatgaag 

aaagaaattg 

aactattctg 

aacgatgcat 

ttcaagacat 



cagcagtctc 
cggacggaga 
gtgacaaaag 
agagttacca 
ggaatggaga 
gacttctcaa 
ctatctacgg 
acaaggaaac 
acgccaagca 
cgtttgaaaa 
ctatttatga 
tggaatctgc 
acaatatgtt 
gggtaaatag 
ttttaaagtt 
aagaaaaggc 
ctgaacaaga 
atatttgtat 
gtattttgag 
gaaatgaaga 
tgttgtcccc 
tgttgtgccc 
aagatggaaa 
aagaagacgc 
agaacaagga 
tcaacgaaaa 
tcgcagccat 
acatggcggt 
gtgcagtcga 
ggatcaagcg 
aagacgcgag 
agaaggaaga 
aacgtatagt 
aaaggacaaa 
acgaagtaca 
ccaagaatac 
taccttgttt 
ccaacaatac 
ctagattctc 
aa 



aggagagggg 
cgtgataaga 
tgtgatagag 
aatcgcatct 
gtcgggagga 
ctcacttttg 
gtcagtgtgt 
tcttcagacc 
tggcacattg 
tttccaatat 
tggattcaga 
tgatgtgtac 
gttgaacctc 
atccttgtta 
tgttgaaaaa 
cgaaacatgt 
atcttacaag 
agtatcacaa 
aaagtgtaca 
agaaatatgt 
tgtctcaatt 
ctacagtgac 
cagagtgagg 
gaggataaag 
ggaagaagac 
gaacaaggag 
caacgaaaat 
tgcagccatc 
catggctgtc 
tgcagtcgac 
gatcaagcgt 
agacgcgagg 
cgacatggca 
aagggaacaa 
gaagaaactt 
tagtaatgtt 
aggaaataat 
taagaataat 
caatattgta 



agaatctctg 
tacgactctg 
actattggac 
tctgtgttgg 
atggctctat 
gtccacaaga 
gacagagaag 
atatgtaaga 
ggaggtaatt 
gaagcaatgg 
gaccatggct 
ggaaacaatc 
atctgcaact 
gattttttaa 
cactttaaaa 
gttaattgtc 
ctttcttgtg 
cacctaagat 
cctaacttga 
ttcatgagaa 
cctcatttct 
cacacgataa 
gtcaaccaca 
cgtgtagccg 
gcgaggatca 
gaagaagacg 
aacaaggagg 
aacgaaaaga 
gcagccatca 
atggcggttg 
gcagtcgaca 
a tcaagcgta 
attgcagctg 
gagttaaggg 
gaagacatgg 
gttagtagca 
aataatgctg 
gtatttggtt 
gaaactccca 



cagatctact 
agcagtacac 
attttctcat 
aaaaattccc 
ggaaggcatt 
taaaaaattg 
aaaggcccat 
gtgatatacg 
ttttacactt 
gagctaatgc 
taaacccatc 
cagtggaaat 
atggtgtatc 
aaatgaacac 
ttgagtcaaa 
ttgatagaaa 
gacactttct 
gtgaaaaatg 
attggtggtt 
ataagaaact 
tcaaaaatag 
taccaaataa 
cagccatcag 
tcaggacatt 
agcgtgcagt 
cgaggatcaa 
aagaagacgc 
acaaggagga 
acgaaaataa 
cagccatcaa 
tggcggttgc 
taattgactt 
ccactaaaaa 
ctgatctgag 
aactagaaaa 
gtagtgttgt 
tcattggtat 
cacctcataa 
aaatgtcttt 



cctgttggaa 
caaacctagg 
ccacaaccac 
cgctctactc 
gtacagggct 
gccttcggtt 
catcatgagt 
ttctcttctt 
ctatgcccgc 
agtgctaatg 
agaatatact 
tgcaatatca 
ttatgaaaaa 
agcttcaaag 
tacaccaaaa 
taatgtgttg 
tcatgtaaaa 
tctaaaaaga 
gactatgccg 
ggttgatgat 
tagacagcga 
agaagatcca 
tgaaaagcag 
tacagccatc 
cgacatggct 
gcgtgcagtc 
gaggatcaag 
agaagacgcg 
caaggaggaa 
cgaaaataac 
agccaccaac 
gactgttgat 
ggacaagaaa 
aagggcaatg 
ggggtgtaat 
tgcctattct 
gactagcacc 
attttccttc 
caatttctcg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2352 



<210> 27 
<211> 781 
<212> PRT 
<213> SHRIMP 



<400> 27 




























Met 


Ala 


Ala 


Ala 


Ala 


Val 


Ser 


Gly 


Glu 


Gly 


Arg 


He 


Ser 


Ala 


Asp 


Leu 


1 








5 










10 










15 




Leu 


Leu 


Leu 


Glu 
20 


Gin 


Leu 


Thr 


Pro 


Asp 

25 


Gly 


Asp 


Val 


He 


Arg 
30 


Tyr 


Asp 


Ser 


Glu 


Gin 
35 


Tyr 


Thr 


Lys 


Pro 


Arg 
40 


Lys 


He 


Phe 


Gly 


Asp 
45 


Lys 


Ser 


Val 


lie 


Glu 
50 


Thr 


He 


Gly 


His 


Phe 
55 


Leu 


He 


His 


Asn 


His 
60 


Asn 


Gin 


Gly 


Glu 


Ser 


Tyr 


Gin 


He 


Ala 


Ser 


Ser 


Val 


Leu 


Glu 


Lys 


Phe 


Pro 


Ala 


Leu 


Leu 


65 










70 










75 










80 


Asn 


Cys 


He 


Trp 


Asn 


Gly 


Glu 


Ser 


Gly 


Gly 


Met 


Ala 


Leu 


Trp 


Lys 


Ala 
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Leu 


Tyr 


Arg 


Ala 








100 


Lys 


i le 


Lys 


Asn 






115 




Val 


Cys 


Asp 


Arg 




130 






Lys 


Glu 


Thr 


Leu 


145 








Gly 


Met 


Met 


Asn 


Phe 


Tyr 


Ala 


Arg 








180 


Met 


Gly 


Ala 


Asn 






195 




Phe 


Arg 


Asp 


His 




210 






Ala 


Asp 


Val 


Tyr 


225 








Asp 


Asp 


Asn 


Met 


Glu 


Lys 


Thr 


Axg 








260 


Met 


Asn 


Thr 


AXa 






275 




His 


Phe 


Lys 


He 




290 






Ala 


Glu 


Thr 


Cys 


305 








Gly 


Ser 


Glu 


Gin 


Val 


Lys 


Cys 


Leu 








340 


Glu 


Lys 


Cys 


Leu 






355 




Pro 


Asn 


Leu 


Asn 




370 






Glu 


Glu 


He 


Cys 


385 








Lys 


Leu 


Leu 


Ser 


Gin 


Arg 


Asn 


Leu 








420 


Pro 


Asn 


Lys 


Glu 






435 




Val 


Asn 


His 


Thr 




450 






Ala 


Arg 


He 


Lys 


465 








Lys 


Gin 


Asn 


Lys 


Met 


Ala 


Val 


Ala 








500 


Arg 


He 


Lys 


Arg 






515 




Asn 


Lys 


Glu 


Glu 




530 






Val 


Ala 


Ala 


He 


54 5 








Lys 


Arg 


Ala 


val 



Lys 


Lys 


Tyr 


Arg 


Trp 


Pro 


Ser 


val 








l^O 


Glu 


Giu 


Arg 


Pro 






135 




Gin 


Thr 


He 


Cys 




150 






Ala 


Lys 


His 


Gly 


165 








Ser 


Thr 


Lys 


Pro 


Ala 


Val 


Leu 


Met 








200 


Gin 


Pro 


Ser 


Glu 






215 




Gly 


Asn 


Asn 


Pro 




230 






Leu 


Leu 


Asn 


Leu 


245 








Gly 


Arg 


Val 


Asn 


Ser 


Lys 


Cys 


Leu 








280 


Glu 


Ser 


Asn 


Thr 






295 




Val 


Asn 


Cys 


Leu 




310 






Glu 


Ser 


Tyr 


Lys 


325 








Arg 


Asn 


He 


Cys 


Lys 


Arg 


Phe 


Asp 








360 


Trp 


Trp 


Leu 


Thr 






375 




Phe 


Met 


Arg 


Asn 




390 






Pro 


Val 


Ser 


He 


405 








Asp 


Met 


Leu 


Cys 


Asp 


Pro 


Lys 


Lys 








440 


Ala 


He 


Ser 


Glu 






455 




Arg 


Val 


Ala 


Val 




470 






Glu 


Glu 


Glu 


Asp 


485 








Ala 


He 


Asn 


Glu 


Ala 


Val 


Asp 


Met 








520 


Glu 


Asp 


Ala 


Arg 






535 




Asn 


Glu 


Lys 


Asn 




550 






Asp 


Met 


Ala 


Val 


565 

















Leu 


Leu 


Asn 


Ser 


105 








Ala 


Val 


He 


Pro 


lie 


lie 


Met 


ber 








14 0 


Lys 


Ser 


Asp 


lie 






JL OO 




Tnr 


Leu 


Gxy 


taiy 




1 /o 






Phe 


Glu 


Asn 


T-\\_ « 

Pne 


185 








Ala 


Ala 


Glu 


Ala 


Tyr 


Thr 


Pne 


Pro 








O 1 A 
ZZU 


IF. 1 

Val 


GlU 


l ±e 


Ala 










He 


Cys 


Asn 


Tyr 




250 






Arg 


Ser 


Leu 


Leu 


265 








Ser 


Val 


Leu 


Lys 


Pro 


Lys 


Gly 


Glu 








~z r\ n 
JOU 


Asp 


Arg 


Asn 


Asn 






315 




Leu 


Ser 


Cys 


Gly 




330 






He 


Val 


Ser 


Gin 


345 








Glu 


Ser 


He 


Leu 


Met 


Pro 


Ala 


Gly 










Lys 


Lys 


Leu 


Val 






O ft c 




Pro 


His 


Phe 


Phe 




/lift 
410 






Pro 


Tyr 


Ser 


Asp 


425 








Asn 


Glu 


Asp 


Gly 


Lys 


Gin 


Asn 


Lys 








h bU 


Arg 


Thr 


Phe 


Thr 






475 




Ala 


Arg 


He 


Lys 




490 






Lys 


Asn 


Lys 


Glu 


505 








Ala 


Val 


Ala 


Ala 


He 


Lys 


Arg 


Ala 








540 


Lys 


Glu 


Glu 


Glu 






555 




Ala 


Ala 


He 


Asn 




570 







95 

Leu Leu Val His 
110 

He Tyr Gly Ser 
125 

Glu He He Asp 

Arg Ser Leu Leu 
160 

Asn Phe Leu His 
175 

Gin Tyr Glu Ala 
190 

He Tyr Asp Gly 
205 

Gly Leu Glu Ser 

He Ser Gly Asp 
240 

Gly Val Ser Tyr 
255 

Asp Phe Leu Lys 
270 

Phe Val Glu Lys 
285 

Phe Glu Glu Lys 

Val Leu Thr Lys 
320 

His Phe Leu His 
335 

His Leu Arg Cys 
350 

Arg Lys Cys Thr 
365 

Ala Gly Asn Glu 

Asp Asp Phe Arg 
400 

Lys Asn Ser Arg 
415 

His Thr He He 
430 

Asn Arg Val Arg 
445 

Glu Glu Glu Asp 

Ala He Arg Glu 
480 

Arg Ala Val Asp 
4 95 

Glu Glu Asp Ala 
510 

He Asn Glu Asn 
525 

Val Asp Met Ala 

Asp Ala Arg He 
560 

Glu Asn Asn Lys 
575 
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Glu Glu Glu Asp 
580 

Ala lie Asn Glu 
595 

Ala Val Asp Met 
610 

Glu Asp Ala Arg 
625 

lie Gin Arg He 

Lys Lys Glu Glu 
660 

Asp Leu Arg Arg 
675 

Glu Asp Met Glu 
690 

Thr Ser Asn Val 
705 

He Val Pro Cys 

Ser Thr Asn Tyr 
740 

Pro His Lys Phe 
755 

Glu Thr Pro Lys 
770 



Ala Arg He Lys 

Asn Asn Lys Glu 
600 

Ala Val Ala Ala 
615 

He Lys Arg He 
630 

Val Asp Met Ala 
645 

Glu Lys Arg Thr 

Ala Met Asp Met 
680 

Leu Glu Lys Gly 
695 

Val Ser Ser Ser 
710 

Leu Gly Asn Asn 
725 

Ser Ala Asn Asn 

Ser Phe Asn Asp 
760 

Met Ser Phe Asn 
775 



Arg Ala Val Asp 
585 

Glu Glu Asp Ala 

Thr Asn Glu Lys 
620 

He Asp Leu Thr 
635 

He Ala Ala Ala 
650 

Lys Arg Glu Gin 
665 

Val Asn Glu Val 

Cys Asn Lys Asp 
700 

Ser Val Val Ala 
715 

Asn Asn Ala Val 
730 

Thr Lys Asn Asn 
745 

Ala Ser Arg Phe 

Phe Ser Phe Lys 
780 



Met Ala Val Ala 
590 

Arg He Lys Arg 
605 

Asn Lys Lys Glu 

Val Asp Met Arg 
640 

Thr Lys Lys Asp 
655 

Glu Leu Arg Ala 
670 

Gin Lys Lys Leu 
685 

Glu Ala Lys Asn 

Tyr Ser Lys Glu 
720 

He Gly Met Thr 
735 

Val Phe Gly Ser 
750 

Ser Asn He Val 

765 

Thr 



<210> 28 
<211> 1506 
<212> DNA 
<213> SHRIMP 

<400> 28 

atggtatata aaggctttgt atctccatca 
agaccaacaa catcccctct ttcgttctct 
gatacgtatg aggatatctt acttgaacaa 
tctgctcgta aaagtgaaat agaggataaa 
aaattccatc ttgcaaaggg catagaagag 
acaattgaac ctattatttc tccaacgaca 
gagggagata tatcctcaag tcccctatat 
catgatattt atgaattgga ccttattgtt 
aatctacgca acgtttctaa actgatgaag 
gatgtgtgcc acagaaaatt tttcaacaat 
ttctcaaaaa atgtgtgtat tattcctcta 
ggggaatgca gggatttatt caacggtatt 
tattttttct accctgaaaa tactactact 
atggaaattg cagatgagga agaacaatcc 
agtagaaact ggtctggtgt ctgtttgatt 
attaatagag gagatagagg aggttctttt 
atcaaggaaa agagatgtaa aatcacagat 
gtgataactg ggtgttatac agaattgtac 
aacaggcgca aatttttgaa aatgaatggg 
gatctaatcg aattcgcgat gagtgtttct 
ttggggatgt tccagaaccg taaaatgata 
gttgtatgtg gtgagcatga tatacaaaaa 
attgcagact ataatgactt tgtaagtaag 
gaagaaaatg aagatgaaga tgcagacgct 
tcttctcctc ctgcacataa aaaatcacgt 
tgttaa 



tttatcacta ttcgtacgat gacgtccaac 60 
gaaggcttct cgctttcagg ggataagtat 120 
ttcaactgtt tcaagacatc ttctccttct 180 
actttaatct ttcaacttaa agaaggagaa 240 
ctccgtgaga ttctagacga taattctgca 300 
ttcaatgaca gaaacgaatt actaaaccac 360 
actcagataa tgaagcatat ttcaccagag 420 
ggcactgatt tgctttttgg tctaggtgtg 480 
aaaatatcgt atggtacttt aaatgtagtt 540 
aggattatag ttaatcccat ttcttcgtca 600 
ttttctgcag ctgaagaatt ttcgtctctg 660 
tgtgatgacg tagagagata tatcaactct 720 
actactacta ctgctccttc gtcgcccgaa 780 
ccaaaaacta taaagagaaa tgacaacgca 840 
tttgaagtat ttaaaaacac gtactacatt 900 
gaaaaggctg tgaagagtgc aatttcttct 960 
attaatggta ataaacctcg attggttatg 1020 
ttcaaagatg cacttaaaca gattggagaa 1080 
aattactttt ctctgattga tgaacaagca 114 0 
ggtgccgggg agaggatttt tgttaacggt 1200 
cctgtaattg atcctctcac atatgaaaat 1260 
gaagatgcta ttctttctgt aaggagagct 1320 
aacaagagag ggaagaaacg cagcgcagaa 1380 
agcagcagca gcagcagcag tcctcctcct 144 0 
cttccggatg aaggcgaaaa atgtacactc 1500 

1506 
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<210> 29 
<2X1> 499 
<212> PRT 
<213> SHRIMP 



<400> 29 



Met 


val 


Tyr 


Lys 


Giy 


t*ne 


vai 




Pro 


Cor 




lie 


Thr 

J. 111 


X xts 


i- y 


Thr 


1 






c 
D 










i n 










15 




Met 


Thr 


Ser 


Asn 


Arg 


Pro 


x nr 


mr 


ber 


Pro 


Leu 


Ser 




OCX 


Glu 


Gly 








20 


























Phe 


Ser 


Leu 


Ser 


Gly 


Asp 


Lys 


Tyr 


Asp 


inr 


Tyr 




Asp 


Tl e 


Leu 


Leu 






35 










a n 










*» .j 








Glu 


Gin 


Phe 


Asn 


Cys 


Phe 


Lys 


Tnr 


Ser 


ber 


rio 


oer 




nld 


Arg 


Lys 




50 




















sn 










Ser 


Glu 


He 


GlU 


Asp 


Lys 


l nr 


Leu 


i le 


rrne 


bin 


Leu 


Lys 




uiy 


UJ.U 


65 










/ U 










75 










fin 


Lys 


Phe 


His 


Leu 


Ala 


Lys 


Giy 


lie 


GlU 


GlU 


Leu 


Arg 




Tip 

lie 


Leu 


Asp 








Q 




















OS 




Asp 


Asn 


Ser 


Ala 


Thr 


He 


Glu 


Pro 


l le 


lie 


Ser 


Pro 


inr 


i nr 


irne 


Asn 






100 










1UO 










i i n 

J. X u 






Asp 


Arg 


Asn 


Glu 


Leu 


Leu 


Asn 


His 


Glu 


Gly 


Asp 


He 


ber 


ber 




Pro 


115 










lzu 


















Leu 


Tyr 


Thr 


Gin 


He 


Met 


Lys 


His 


Pro 


GlU 


His 


Asp 


lie 


Tyr 




Leu 




130 










1 Jo 










140 










Asp 


Leu 


He 


Val 


Gly 


Thr 


Asp 


Leu 


Leu 


Phe 


Gly 


Leu 


Giy 


Vdl 


Asn 


Leu 


145 




















155 










1 fin 


Arg 


Asn 


Val 


Ser 


Lys 


Leu 


Met 


Lys 


Lys 


He 


Ser 


Tyr 


Giy 


inr 


Leu 


Asn 








165 










1 /u 










X / O 




Val 


Val 


Asp 


Val 


Cys 


His 


Arg 


Lys 


Phe 


Phe 


Asn 


Asn 


Arg 


l le 


i ie 


vai 






180 










"IOC 

loo 
















Asn 


Pro 


He 


Ser 


Ser 


Ser 


Phe 


Ser 


Lys 


Asn 


Val 


Cys 


lie 


lie 


pro 


Leu 






195 










200 










one 








Phe 


Ser 


Ala 


Ala 


Glu 


Glu 


Phe 


Ser 


Ser 


Leu 


Gly Glu 


Cys 


Arg 


Asp 


Leu 




210 




















220 










Phe 


Asn 


Gly 


He 


Cys 


Asp 


Asp 


Val 


Glu 


Arg 


Tyr 


He 


Asn 


Ser 


Tyr 


rne 


225 










230 










235 












Phe 


Tyr 


Pro 


Glu 


Asn 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Ala 


Pro 


oer 


ber 








245 










1 C f\ 














Pro 


Glu 


Met 


Glu 


He 


Ala 


Asp 


Glu 


GlU 


GlU 


Gin 


Ser 


irro 


Lys 


a nr 


lie 








260 










2oo 
















Lys 


Arg 


Asn 


Asp 


Asn 


Ala 


Ser 


Arg 


Asn 


Trp 


Ser 


Gly 


val 


Cys 


Leu 


lie 




275 










o o r\ 


















Phe 


Glu 


Val 


Phe 


Lys 


Asn 


Thr 


Tyr 


Tyr 


He 


He 


Asn 


Arg 


Giy 


Asp 


Arg 




o q n 




















300 










Gly 


Gly 


Ser 


Phe 


Glu 


Lys 


Ala 


val 


Lys 


Ser 


Ala 


He 


Ser 


Ser 


He 


Lys 


305 










310 










315 










320 


Glu 


Lys 


Arg 


Cys 


Lys 


He 


Thr 


Asp 


He 


Asn 


Gly Asn 


Lys 


Pro 


Arg 


Leu 










325 










330 










335 




Val 


Met 


Val 


He 


Thr 


Gly 


Cys 


Tyr 


Thr 


Glu 


Leu 


Tyr 


Phe 


Lys 


Asp 


Ala 








340 










345 










350 






Leu 


Lys 


Gin 


He 


Gly 


Glu 


Asn 


Arg 


Arg 


Lys 


Phe 


Leu 


Lys 


Met 


Asn 


Gly 




355 










360 










365 








Asn 


Tyr 


Phe 


Ser 


Leu 


He 


Asp 


Glu 


Gin 


Ala 


Asp 


Leu 


lie 


Glu 


Phe 


Ala 




370 










375 










380 










Met 


Ser 


Val 


Ser 


Gly 


Ala 


Gly 


Glu 


Arg 


He 


Phe 


Val 


Asn 


Gly 


Leu 


Gly 


385 










390 










395 










400 


Met 


Phe 


Gin 


Asn 


Arg 


Lys 


Met 


He 


Pro 


Val 


He 


Asp 


Pro 


Leu 


Thr 


Tyr 










405 










410 










415 




Glu 


Asn 


Val 


Val 


Cys 


Gly 


Glu 


His 


Asp 


He 


Gin 


Lys 


Glu 


Asp 


Ala 


He 








420 










425 










430 






Leu 


Ser 


Val 


Arg 


Arg 


Ala 


He 


Ala 


Asp 


Tyr 


Asn 


Asp 


Phe 


Val 


Ser 


Lys 
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435 

Asn Lys Arg Gly 
450 

Asp Ala Asp Ala 
465 

Pro Pro Ala His 
Thr Leu Cys 



440 

Lys Lys Arg Ser 
455 

Ser Ser Ser Ser 
470 

Lys Lys Ser Arg 
485 



Ala Glu Glu Glu 
460 

Ser Ser Ser Pro 
475 

Leu Pro Asp Glu 
490 



445 

Asn Glu Asp Glu 

Pro Pro Ser Ser 
480 

Gly Glu Lys Cys 
495 



<210> 30 
<211> 846 
<212> DNA 
<213> SHRIMP 



<400> 30 

atggcggtaa 

aaactcgtat 

gcagggtttg 

aaagaaaagg 

cctttgacta 

actgaaaatt 

gtaagcgagg 

aaaactgcaa 

aatcttgcac 

tcggaattgg 

gacagtggca 

tgtttgtgta 

ggtgtcgctg 

ttaataccca 

acataa 



acttggataa 
ccgaggcaat 
acatgagacg 
ccataaaggc 
caatgcagta 
tattaggtga 
aagatattga 
cagtcatagt 
catcattcgg 
ataatgtcgc 
atactaaaaa 
gaatttttgc 
aaaacaacag 
aacaaccagc 



tgttcttgtg 
aaagcggcga 
tcaagttgaa 
attcgatgag 
tgaagagtgg 
tgttgagcac 
aggtttcgct 
taaggcagat 
cgtcactcaa 
agattctttc 
attgttgtat 
agaaattgaa 
agatgaaatt 
tggtgcaaaa 



aatatcaaca 
gctaaaactg 
gctgcattat 
ctcatacaag 
gtaaaccgta 
gccgattttt 
gcttctactt 
tgtgaaacgg 
gaaattaaaa 
catatttata 
gggttaagga 
tcagatggga 
gatgaaaacg 
ttgatcatct 



acaaggatga 
tatttgacac 
atgaagcaat 
aaagaggtga 
caataactcc 
tactggaccg 
ttaaggaggt 
gggatatcga 
tatataggtc 
aaatttctgc 
ataaaaaagc 
ttatggccaa 
aagaaggtaa 
acttcttttt 



agatcttaca 
taaaaatcaa 
atccaaaaag 
tgaaattaca 
ctcattgacg 
aatgacaccc 
atcagattca 
tgaagtgtat 
aaacaattcc 
aacagatagc 
aggttatacg 
tacaaatatc 
atatggtttt 
aaattgttgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

846 



<210> 31 
<211> 282 
<212> PRT 
<213> SHRIMP 



<220> 

<221> VARIANT 

<222> (1) . . . (282) 

<223> Xaa =* Any Amino Acid 



<400> 31 



Met 


Ala 


Val 


Asn 


Leu 


Asp 


Asn 


Val 


Leu 


Val 


Asn 


He 


Asn 


Asn 


Lys 


Asp 


1 








5 










10 










15 




Glu 


Asp 


Leu 


Thr 


Lys 


Leu 


Val 


Ser 


Glu 


Ala 


He 


Lys 


Arg 


Arg 


Ala 


Lys 








20 










25 










30 






Thr 


Val 


Phe 


Asp 


Thr 


Lys 


Asn 


Gin 


Ala 


Gly 


Phe 


Asp 


Met 


Arg 


Arg 


Gin 






35 










40 










45 








Val 


Glu 


Ala 


Ala 


Leu 


Tyr 


Glu 


Ala 


He 


Ser 


Lys 


Lys 


Lys 


Glu 


Lys 


Ala 




50 










55 










60 










He 


Lys 


Ala 


Phe 


Asp 


Glu 


Leu 


He 


Gin 


Glu 


Arg 


Gly 


Asp 


Glu 


He 


Thr 


65 










70 










75 










80 


Pro 


Leu 


Thr 


Thr 


Met 


Gin 


Tyr 


Glu 


Glu 


Trp 


Val 


Asn 


Arg 


Thr 


He 


Thr 










85 










90 










95 




Pro 


Ser 


Leu 


Thr 


Thr 


Glu 


Asn 


Leu 


Leu 


Gly 


Asp 


Val 


Glu 


His 


Ala 


Asp 








100 










105 










110 






Phe 


Leu 


Leu 


Asp 


Arg 


Met 


Thr 


Pro 


Val 


Ser 


Glu 


Glu 


Asp 


He 


Glu 


Gly 






115 










120 










125 








Phe 


Ala 


Ala 


Ser 


Thr 


Phe 


Lys 


Glu 


Val 


Ser 


Asp 


Ser 


Lys 


Thr 


Ala 


Thr 
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130 

Val He Val Lys 
145 

Asn Leu Ala Pro 

Ser Asn Asn Ser 
180 

Tyr Lys He Ser 
195 

Leu Tyr Gly Leu 
210 

He Phe Ala Glu 
225 

Gly Val Ala Glu 

Lys Tyr Gly Phe 
260 

He Tyr Phe Phe 
275 



135 

Ala Asp Cys Glu 
150 

Ser Phe Gly Val 
165 

Ser Glu Leu Asp 

Ala Thr Asp Ser 
200 

Arg Asn Lys Lys 
215 

He Glu Ser Asp 
230 

Asn Asn Arg Asp 
245 

Leu He Pro Lys 

Leu Asn Cys Trp 
280 



140 

Thr Gly Asp He 
155 

Thr Gin Glu He 
170 

Asn Val Ala Asp 
185 

Asp Ser Gly Asn 

Ala Gly Tyr Thr 
220 

Gly He Met Ala 
235 

Glu He Asp Glu 
250 

Gin Pro Ala Gly 
265 

Thr Xaa 



Asp Glu Val Tyr 
160 

Lys He Tyr Arg 
175 

Ser Phe His He 
190 

Thr Lys Lys Leu 
205 

Cys Leu Cys Arg 

Asn Thr Asn He 
240 

Asn Glu Glu Gly 
255 

Ala Lys Leu lie 
270 



<210> 32 
<211> 1089 
<212> DNA 
<213> SHRIMP 



<400> 32 

atgactgtat 

gaagatgaaa 

cttaacgctg 

gaagcagcat 

tctgtcgaaa 

ttggcaggcg 

gcagcattaa 

aacaaagata 

tcagatcttg 

aacaaaatat 

gatactggtg 

ttagtcgaag 

gttgataaaa 

ttcgttatta 

actgaccaag 

tttcaattca 

tcagaaagga 

atattggcct 

gacgattaa 



tagctgtata 
atgaggaaga 
tcaaggaaaa 
taaaaactgc 
tagatgtgga 
caacaagcga 
taacagaaga 
ataaagacgg 
tggggttgta 
tccctcatga 
ttgtgactga 
gggatgaacc 
taaaccacgt 
ttccacggtt 
tcaaacctaa 
caatggataa 
acttgaggaa 
catatgggaa 



cactgcacct 
acccgtaaag 
acctgcagag 
cgaagaaaca 
ggaagtactt 
aacgctaaca 
agatatcact 
ggaagcaact 
cttgaacgaa 
gatggttttt 
tcaagccatt 
tgaagtattc 
cttgaaaatt 
ctctcctgta 
aacattctta 
gcaacattct 
tttacccgat 
gcgtctagga 



caaataaaga 
actttggaag 
tactttgagt 
gcccttcgag 
gaagagaaac 
aacacaatca 
attaaaatgt 
cctgaagaaa 
gtagtagaaa 
gaaagatgtg 
cagataccct 
cctggtgact 
tctgcaaaga 
ggaagtgttt 
tttttagcca 
gtagggtgtg 
tcaaaaccta 
actggtgttt 



aatcgaagaa 
attttgtaaa 
tgctaatatc 
attttgttat 
caagagaata 
tcgcagaggt 
taaaacaatt 
aggaagattt 
aaacaacaaa 
ctattttaat 
ccaacaaata 
gcttggatct 
acggatgtga 
cttccatgat 
acaaaaatga 
agttggacat 
gacctctaag 
tcacaacaga 



gagaaaaatt 
gggtcggctc 
tgcagacact 
tgagaacgac 
tgtcttcaaa 
acaaaaaaag 
cagggctgcg 
taccaataat 
tattgtcatt 
tgaagatttt 
caaaatcaga 
tgcagtttca 
aaacaactgc 
attgggcagc 
cagtacacat 
gttaattttt 
tgatgcagac 
aaatttggta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1089 



<210> 33 
<211> 356 
<212> PRT 
<213> SHRIMP 



<400> 33 
Met Thr Val Leu 
1 

Lys Arg Lys He 
20 

Glu Asp Phe Val 

35 

Ala Glu Tyr Phe 
50 



Ala Val Tyr Thr 
5 

Glu Asp Glu Asn 

Lys Gly Arg Leu 
40 

Glu Leu Leu He 
55 



Ala Pro Gin He 
10 

Glu Glu Glu Pro 
25 

Leu Asn Ala Val 

Ser Ala Asp Thr 
60 



Lys Lys Ser Lys 
15 

Val Lys Thr Leu 
30 

Lys Glu Lys Pro 
45 

Glu Ala Ala Leu 
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Lys 


Thr 


Ala 


Glu 


65 








Ser 


Val 


Glu 


He 


Tyr 


Val 


Phe 


Lys 








100 


lie 


He 


Ala 


Glu 






115 




He 


Thr 


He 


Lys 




130 






Lys 


Asp 


Gly 


Glu 


145 








Ser 


Asp 


Leu 


Val 


Val 


He 


Asn 


Lys 








180 


He 


Glu 


Asp 


Phe 






195 




Pro 


Ser 


Asn 


Lys 




210 






Val 


Phe 


Pro 


Gly 


225 








Asn 


His 


Val 


Leu 


Phe 


Val 


He 


He 








260 


He 


Leu 


Gly 


Ser 






275 




Ala 


Asn 


Lys 


Asn 




290 






His 


Ser 


Val 


Gly 


305 








Asn 


Leu 


Pro 


Asp 


Ala 


Ser 


Tyr 


Gly 








340 


Leu 


Val 


Asp 


Asp 






355 





Glu 


Thr 


TV 1 -* 

Ala 


Leu 




fu 






Asp 


Val 


Glu 


Glu 


85 








Leu 


Ala 


Gly 


Ala 


Val 


Gin 


Lys 


Lys 








i on 
Iz U 


Met 


Leu 


Lys 


Gin 






135 




Ala 


Thr 


Pro 


Glu 




150 






Gly 


Leu 


Asn 


Glu 


165 








He 


Phe 


Phe 


Met 


Asp 


Thr 


Gly 


Val 








200 


Tyr 


Lys 


He 


Arg 






215 




Asp 


Cys 


Leu 


Asp 




230 






Lys 


He 


Ser 


Ala 










Pro 


Arg 


Phe 


Ser 


Thr 


Asp 


Gin 


Val 








280 


Asp 


Ser 


Thr 


His 






295 




Cys 


Glu 


Leu 


Asp 




310 






Ser 


Lys 


Pro 


Arg 


325 








Lys 


Arg 


Leu 


Gly 



Arg 


Asp 


Pne 


vai 






/ o 




Val 


Leu 


Glu 


GlU 










Thr 


Ser 


GlU 


Thr 


105 








Ala 


Ala 


Leu 


He 


Phe 


Arg 


Ala 


Ala 








140 


Glu 


Lys 


Glu 


Asp 






loo 




Val 


Val 


Glu 


Lys 




170 






Val 


Phe 


Glu 


Arg 


185 








Val 


Thr 


Asp 


Gin 


Leu 


Val 


Glu 


Gly 








220 


Leu 


Ala 


Val 


Ser 






235 




Lys 


Asn 


Gly 


Cys 




250 






Pro 


Val 


Gly 


Ser 


265 








Lys 


Pro 


Lys 


Thr 


Phe 


Gin 


Phe 


Thr 








300 


Met 


Leu 


He 


Phe 






315 




Pro 


Leu 


Ser 


Asp 




330 






Thr 


Gly 


Val 


Phe 


345 









He Glu Asn Asp 
80 

Lys Pro Arg Glu 
95 

Leu Thr Asn Thr 
110 

Thr Glu Glu Asp 
125 

Asn Lys Asp Asn 

Phe Thr Asn Asn 
160 

Thr Thr Asn He 
175 

Cys Ala He Leu 
190 

Ala He Gin He 
205 

Asp Glu Pro Glu 

Val Asp Lys He 
240 

Glu Asn Asn Cys 
255 

Val Ser Ser Met 
270 

Phe Leu Phe Leu 
285 

Met Asp Lys Gin 

Ser Asn Leu Arg 
320 

Ala Asp He Leu 
335 

Thr Thr Glu Asn 
350 



<210> 34 
<211> 930 
<212> DNA 
<213> SHRIMP 



<400> 34 

atgtcttctt 

aaagacattg 

attatcaata 

gttgatgaac 

tttacaagtg 

ggagccaacg 

aataaaatga 

aaacgatttt 

atgatgcgtg 

atggacggtt 

aacaaagtag 

agagtagacg 

cttgataatt 

caaagagtca 

tctgatatgt 



cgtcttctga 
taaatgctct 
atcacatctc 
ggtacgatag 
ctcagactat 
ggactggacc 
ggggaaatat 
ccagttctat 
gttctgacgg 
caagaaagaa 
gggatgagtg 
actatgaaag 
tcgaacaagc 
tgttcttacc 
tttctataat 



aactcctaag 
agataataat 
acgaaaaacg 
aaaaatagcc 
gtatgaccaa 
agtcatgggg 
gctcgaatac 
gacaacgctt 
gtattttgca 
agaagaccca 
ggaagagggt 
agatatccat 
attaggagaa 
caagaaatct 
attcccacct 



acttccaccg 
ggcgagtggt 
gcagaaactg 
gacaaaatca 
tatgcaatag 
ccagtgaaca 
gctgaagata 
gaatttgatc 
ttccctttcc 
attaattgta 
aaagaacgtg 
gtttcaattt 
aatgcaaact 
aaactgttca 
gcatctgacc 



atactgggga 
tgtcttccta 
tccaaaaaat 
acgaaa taaa 
acacatttca 
cggttatcga 
tgtgggacgg 
taagttactc 
gtggaacaaa 
tcatttcagt 
aagtgaattt 
tgtgcatg tt 
ctttttattt 
atagacctac 
aagat tttgc 



agaaaggatt 
tattgatccg 
caaccaagaa 
atcatccatc 
agaaggaaaa 
tacaacttta 
agatgactgg 
tgatttaact 
aaagataaag 
aacatatcca 
taacctagaa 
aca tgcacaa 
taaaaagggg 
tgtagaagat 
agatgatatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 
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tattatcgaa taattgtaac atgttcataa 930 

<210> 35 
<211> 307 
<212> PRT 
<213> SHRIMP 



<400> 35 



Met 


Ser 


Ser 


Ser 


Ser 


Ser 


GlU 


Tnr 


Pro 


Lys 


Thr 


Ser 


i nr 


Asp 


i nr 


o±y 


i 

l 








c 










i f\ 

X \J 










± D 




Glu 


Glu 


Arg 


He 


Lys 


Asp 


He 


Val 


Asn 


Ala 


Leu 


Asp 


Asn 


Asn 




r* 1 n 
blU 








zo 


























Trp 


Leu 


Ser 


Ser 


Tyr 


He 


Asp 


Pro 


lie 


He 


Asn 


Asn 


his 


Arg 


Lys 


u nr 






35 










A f\ 
H U 


















Ala 


Glu 


Thr 


Val 


Gin 


Lys 


He 


Asn 


Gin 


Glu 


Val 


Asp 


Glu 


Arg 


Tyr 


Asp 




50 










c c 










60 










Arg 


Lys 


He 


Ala 


Asp 


Lys 


He 


Asn 


Glu 


He 


Lys 


Ser 


Ser 


He 


Phe 


Thr 


65 










70 










75 










o U 


Ser 


Ala 


Gin 


Thr 


Met 


Tyr 


Asp 


Gin 


Tyr 


Ala 


He 


Asp 


Thr 


Phe 


Gin 


Glu 










85 










90 










95 




Gly 


Lys 


Gly 


Ala 


Asn 


Gly 


Thr 


Gly 


Pro 


Val 


Met 


Gly 


Pro 


Val 


Asn 


Thr 








100 










105 










1 1U 






Val 


He 


Asp 


Thr 


Thr 


Leu 


Asn 


Lys 


Met 


Arg 


Gly Asn 


Met 


Leu 


Glu 


Tyr 






115 










120 










125 








Ala 


Glu 


Asp 


Met 


Trp 


Asp 


Gly 


Asp 


Asp 


Trp 


Lys 


Arg 


Phe 


Ser 


Ser 


Ser 




130 










135 










140 










Met 


Thr 


Thr 


Leu 


Glu 


Phe 


Asp 


Leu 


Ser 


Tyr 


Ser 


Asp 


Leu 


Thr 


Met 


Met 


145 










150 










155 










1 bU 


Arg 


Gly 


Ser 


Asp 


Gly 


Tyr 


Phe 


Ala 


Phe 


Pro 


Phe 


Arg 


Gly 


Thr 


Lys 


Lys 










165 










170 










175 




He 


Lys 


Met 


Asp 


Gly 


Ser 


Arg 


Lys 


Lys 


Glu 


Asp 


Pro 


He 


Asn 


Cys 


He 








180 










185 










190 






He 


Ser 


Val 


Thr 


Tyr 


Pro 


Asn 


Lys 


Val 


Gly Asp Glu 


Trp 


Glu 


Glu 


Gly 






195 










200 










205 








Lys 


Glu 


Arg 


Glu 


Val 


Asn 


Phe 


Asn 


Leu 


Glu Arg 


Val 


Asp 


Asp 


Tyr 


Glu 




210 










215 










220 










Arg 


Asp 


He 


His 


Val 


Ser 


He 


Leu 


Cys 


Met 


Leu 


His 


Ala 


Gin 


Leu 


Asp 


225 










230 










235 










240 


Asn 


Phe 


Glu 


Gin 


Ala 


Leu 


Gly 


Glu 


Asn 


Ala 


Asn 


Ser 


Phe 


Tyr 


Phe 


Lys 










245 










250 










255 




Lys 


Gly 


Gin 


Arg 


Val 


Met 


Phe 


Leu 


Pro 


Lys 


Lys 


Ser 


Lys 


Leu 


Phe 


Asn 








2 60 










265 










270 






Arg 


Pro 


Thr 


Val 


Glu 


Asp 


Ser 


Asp 


Met 


Phe 


Ser 


He 


He 


Phe 


Pro 


Pro 




275 










280 










285 








Ala 


Ser 


Asp 


Gin 


Asp 


Phe 


Ala 


Asp 


Asp 


He 


Tyr 


Tyr 


Arg 


He 


He 


Val 




290 










295 










300 










Thr 


Cys 


Ser 




























305 

































<210> 36 . 
<211> 3657 
<212> DNA 
<213> SHRIMP 

<400> 36 

atggagacaa ctatggataa tgtcgttcag 
gttgctactg ttacaacagc aactgagaaa 
cttaaggccg aatgtcctca agtactgaga 
aattttggaa aatccatgtc ggctattttt 
aaacacttta aggacccaaa gacaaagaag 



aataacgacg taacaaaacc aacaccagat 60 

cgtcagtcat gcaaagagaa aaaggatcaa 120 

gcactaaaat tgtccaatac tttaaaggca 180 

gctcaacatt tagtggacat gacaaacgct 240 

attttagaac tggatggaag tagtagcagt 300 
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gacagtgaag 
agaagtgctt 
gctcaaggtg 
cttagagacg 
caggacatgg 
tcagaaatag 
aggtcggctg 
aacagtaata 
catcatgtca 
agttctgtaa 
agtgatgaat 
tcgtccgata 
tcgtctttac 
gatacggcca 
aagcccgtta 
cgaaatgttt 
acgaatgcga 
actcgccata 
agtaaaaaga 
gattcagaaa 
tcatccgaag 
cgcaaaacaa 
ctaaaaacag 
atagacaagg 
tctacgccaa 
gaacaagtaa 
gctttagata 
gctgaattgg 
cgccatgaag 
aaaactgttg 
aatgaagaaa 
caacatgaac 
aaatcaatag 
aagcaaaact 
atatctatcg 
gggaataata 
gcccaagcta 
caaaactcag 
tttcccggaa 
gcacgaggat 
gatgaaggag 
gttaattttg 
aatatgaacc 
atgtttgaaa 
gatgtatcaa 
aaaaatagag 
cattcccagc 
atggaagcag 
tctacaaatg 
tgtacgacca 
atgagtgtat 
ctgatgttgc 
ggaggaggag 
agagattgga 
agtaatgccg 
caagacgaca 



aagaggaaga 
cttcaaagaa 
tattccgcca 
aagtaaagga 
aaaaggtggt 
ccaaaggttt 
ttctagactc 
tttttgacac 
aatatttatt 
agtatgtaaa 
ttaagtccct 
ataatgttgc 
aaaaagtacg 
gagatattgt 
cattattgca 
tcaccgacgg 
acctattatc 
atcttctgag 
aggagaaaaa 
cagattcttc 
acgaagaaga 
aaaagaaacc 
gaggttactt 
atgattttgc 
tgaatcaaag 
aatatagtgt 
gtgctaaatt 
caaggctagc 
cattcattct 
tagagtcatt 
tgatgtcggt 
aacaaccgga 
tgaactcatt 
actcgcctat 
gatctaatat 
ctgtaactgg 
ataatataa t 
cagccatgtc 
acgatccttc 
tttattccat 
gagttaaagc 
ccgctcagtc 
cttctaatgt 
aaaatggcag 
atcgatactt 
cagaaagagc 
tagcaaacgc 
aagcagcgga 
ctattaatat 
tcgccgtgag 
taaacactgc 
agcaacaaca 
tattaggatc 
tattaaacat 
gccggacatt 
agagtatgat 



aactagttct 
agaaaaatgc 
gtttgcagat 
tgaacaaact 
ggaagaagtt 
ggcccgtttc 
gtccatatcc 
actaaacaca 
gcaaaatctt 
gaaatcatat 
tctcactgga 
tactccttac 
tgaaactaaa 
actacttcca 
gttggtgtcc 
cttttttaat 
taacgacttt 
tatgaaaatg 
gaaggataaa 
ctcatcatca 
agaaaaagga 
atcaaaggac 
ccacgacacg 
gggcgtagcc 
attagtatct 
tgatactgaa 
gacatctcaa 
tgccttcttt 
aacatcaaaa 
ctttaaaaat 
gatgccgttc 
tttgaaaaga 
cccttccaat 
gggaagacgc 
ctcgcccaat 
tctaaggcta 
caaaaatgcc 
catactactc 
atctataaaa 
cgcagaagga 
gtacacttta 
tctggaaaaa 
attagaagga 
tgactgtgaa 
gaaaaaattc 
actcgtcgaa 
catgggggta 
atctgaaatg 
aacaaataca 
tgcagccgcc 
aaataacagt 
accaacacat 
aggaactgaa 
gatatctcca 
aggatatgga 
ggataaactt 



tcatccaaac 
cccaatacta 
atcatcatta 
gagctaaaga 
ttagggcgcc 
gatacccacg 
aaagagttgg 
ctcaaggaag 
acaccggagg 
caatactaca 
gtcaacatta 
aaacacatca 
cctgtttcaa 
gagacgggtg 
tacatcaacg 
gcagcgtgcg 
cctaaaccca 
cttcaagaag 
aagaagggcg 
tcatcttctt 
gaagcagtag 
gacgatttag 
agtgaactcg 
caatatgcag 
agtcttttag 
agtacttcct 
caagtggtta 
ttcgtggtgg 
cttttaccct 
ttaacaatta 
gaagacgaac 
gtggtgggag 
aagagtgtac 
ataaatttgg 
atactatttt 
accaacctat 
aacacgctcg 
tttcctccaa 
ttacaggata 
tgtatcgggg 
ctagtggact 
tcaatgtctg 
ggatcttttg 
cccacccctc 
aacaatgata 
caggttacaa 
gctgttatag 
agagcagcaa 
attggaatga 
gacatgtcaa 
catagcagca 
tcagcatttc 
cagacaaagg 
gaagacaaaa 
tctaatataa 
tctgagatgt 



ggaaaagagg 
tcaaaaattg 
atcttccctc 
ccatatatga 
aagacctgtt 
tctcgttgct 
aaaaaaatag 
aaatcaaaga 
atgcaaattt 
tacaaacatc 
aaatattgga 
ctaatcccag 
aggattatcc 
gcatttctga 
ctctcttttc 
tcctaatttc 
ttgaactggc 
gttcatccag 
gtggtggtgg 
cttcttcatc 
aaaaaggcaa 
atacaattag 
gcaataaaat 
taacaatcac 
atttgataat 
ccaccgccaa 
caatgatggt 
tggataatac 
cgaatgaaaa 
gtaacaaagt 
aacaacaaca 
aagtatttct 
aattaacggc 
cggccaagat 
ctaacctccc 
tgaaaaacat 
taaacaatac 
catcaaaaga 
tgactacaat 
tcgtacgctc 
caaatactat 
aagccttaac 
tagacggtgc 
tagcaaaata 
aaaatacaca 
ataagcctac 
gcgcagcatc 
attatcaagc 
tacgtaatac 
agctcgccaa 
gacgaggaga 
tggaacagac 
atcatgtgga 
atactactac 
ctggcataaa 
ctagcttcag 



tagtggtgct 
gctcaatgat 
ttttgatgat 
cttgtataga 
tgatcacaag 
cccttcggat 
caagggcccg 
acttttatgt 
tgtgttcaat 
agagatggaa 
gaagataatc 
gaacattatt 
gttcagagtg 
tctccctatc 
cctcgagcgc 
ccagtgccta 
agctaatgtt 
tgaaaagaag 
tgacgattct 
ctcctcctcc 
gaaaactaaa 
taaactgatt 
tagaaattta 
tgagatgcaa 
gagactaaaa 
atctaataat 
ggattctgga 
tgtagttaat 
tagaggactt 
ctctacctcg 
acaatgccct 
agaaatggga 
tgacgcgttc 
aaaaacggct 
agaatctgta 
atcccaaagt 
gatggaccaa 
aacttctatc 
gtcaaatctg 
gagggaattt 
ggacatggca 
aaataatgcg 
tctttcttac 
tacaatgaaa 
agatttatat 
aagtgttgtg 
aatcaagctc 
aacatcaaaa 
tactcacctg 
caaccatttt 
cagatcgtcc 
aagaggaaga 
acgtatgaag 
tactacaccc 
cacgataaaa 
aacttaa 



360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3657 



<210> 37 
<211> 1210 
<212> PRT 
<213> SHRIMP 
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<400> 37 



Met 




ric l. 




1 








Pro 


Asp 


Val 


Ala 








20 


Lys 


Glu 


Lys 


Lys 






35 




Ala 


Leu 




Leu 




50 






Ser 


Ala 


He 


Phe 


65 








Phe 


Lys 


ASp 


Mr X w 


Asp 


Ser 


OX U 


VZ X U 








X Vj VJ 


Gly Ser 


vjxy 


Al « 






i i t 




Tnr 


lie 


Lys 


Asn 




1 JO 






A±a 


ASp 


He 


He 


145 








vai 


Lys 


Asp 


oXU 


Gin 


Asp 


wet 












Pne 


Asp 


HIS 


Lys 






i yo 




HIS 


vai 


oer 


Leu 




210 






lie 


Ser 


Lys 


Glu 


225 








IT lie 


Asp 


i nr 


Leu 


His 


His 


XT-, i 

vai 


Lys 








zou 


Phe 


Val 


rxie 


Asn 






on ^. 




Tyr 


He 




inr 




290 






Thr 


Gly 


Val 


Asn 


305 








Ala 


Thr 


Pro 


Tyr 


Leu 


Gin 


Lys 


vdl 










Arg 


Val 


Asp 


inr 






355 




He 


Ser 


Asp 


Leu 




370 






Tyr 


He 


Asn 


Ala 


385 








Gly 


Phe 


Phe 


Asn 


Ala 


Asn 


Leu 


Leu 








420 


Asn 


Val 


Thr 


Arg 






435 




Ser 


Ser 


Ser 


Glu 




450 






Lys 


Lys 


Gly 


Gly 



Asn Val Val Gin 
5 

Thr Val Thr Thr 

Asp Gin Leu Lys 
40 

Ser Asn Thr Leu 
55 

Ala Gin His Leu 
70 

Lys Thr Lys Lys 
85 

Glu Glu Glu Thr 

Arg Ser Ala Ser 
120 

Trp Leu Asn Asp 
135 

He Asn Leu Pro 
150 

Gin Thr Glu Leu 
165 

Lys Val Val Glu 

Ser Glu He Ala 
200 

Leu Pro Ser Asp 
215 

Leu Glu Lys Asn 
230 

Asn Thr Leu Lys 
245 

Tyr Leu Leu Gin 

Ser Ser Val Lys 
280 

Ser Glu Met Glu 
295 

He Lys He Lys 
310 

Lys His He Thr 
325 

Arg Glu Thr Lys 

Ala Arg Asp He 
360 

Pro He Lys Pro 
375 

Leu Phe Ser Leu 
390 

Ala Ala Cys Val 
405 

Ser Asn Asp Phe 

His Asn Leu Leu 
440 

Lys Lys Ser Lys 
455 

Gly Gly Gly Asp 



Asn 


Asn 


Asp 


Val 




10 






Ala 


Thr 


Glu 


Lys 


25 








AT a 
nla 










AT a 
rtX d 


A <rt 


Pha 








60 


v a. x 


ASp 


Met 


X I1X 






75 




Tip 


Leu 


Asp 






90 






Ser 


Ser 


Ser 


Ser 


ins 








OfciX 




Lys 


O J. U 


r\±. ct 




uiy 


VOX 








i a n 




IT llC 


ASp 


Asp 






1 




Lys 


1 tlx 


x xts 


Tyr 




170 






Glu 


Val 


Leu 


Gly 










Lys 


oi y 


Leu 


hid 


Arg 




ax a 


Val 








00 f\ 




Lys 


l» J- y 


rxO 






*5 <; 
zoo 






bxll 


l ie 


Lys 




250 






Asn 


Leu 


Thr 


Pro 










Tyr 


vax 


Lys 


Lys 




Asp 


r* i ii 

uiu 


T>Vi 

rne 








i nn 


Tin 

i .Le 


x le 


Car 


oer 










Asn 


Pro 


Arg 


Asn 




330 






Pro 


Val 


Ser 


Lys 


j i -j 








V d JL 


Leu 


Leu 


rx o 


Val 


Thr 


Leu 


Leu 








380 


Glu 


Arg 


Arg 


Asn 






395 




Leu 


He 


Ser 


Gin 




410 






Pro 


Lys 


Pro 


He 


425 








Ser 


Met 


Lys 


Met 


Lys 


Lys 


Glu 


Lys 








460 


Asp 


Ser 


Asp 


Ser 



Thr 


Lys 


Pro 


Thr 






15 




Arg 


Gin 


Ser 


Cys 




30 






Gin 


Val 


Leu 


Arg 


45 








Gly 


Lys 


OCX 


Ma r- 
L 1C t, 


Asn 


Ala 




Hi c: 








80 


OCX 


OCX 


OCX 


oer 






95 




Lys 


- 

Arg 


Lys 


Arg 




110 






Lys 


Cys 


Pro 


Asn 


1 0 s 

X J 








pho 

it lie 


Arg 


oin 


tr lie 


Leu 


Arg 


Asp 


uXU 








i fin 

1DU 


ASp 


Leu 


Xyr 


Arg 






X ' .J 




Arg 


uin 


Asp 


Leu 




190 






Arg 


Phe 


Asp 


Thr 










Leu 


Asp 


ber 


oer 


Asn 


oer 


Asn 


lie 








9 4 n 


ulU 


Leu 


Leu 


Cys 










IjX U 


Asp 


ax a 


Asn 




270 






Ser 


Tyr 


Gin 


Tyr 










Lys 


OCX 


Leu 


Leu 


Asp 


Asn 


Asn 


v ax 










x xe 


x xe 


oer 


oer 










Asp 


Tyr 


Pro 


rfle 




350 






Glu 


Thr 


Gly 


Gly 


365 








Gin 


Leu 


Val 


Ser 


Val 


Phe 


Thr 


Asp 








400 


Cys 


Leu 


Thr 


Asn 






415 




Glu 


Leu 


Ala 


Ala 




430 






Leu 


Gin 


Glu 


Gly 


445 








Lys 


Lys 


Asp 


Lys 


Glu 


Thr 


Asp 


Ser 
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ber 


OCX 


OCI 


oer 


oer 




















GlU 


Asp 


Glu 


blU 


GlU 


GlU 


T « we* 

Lys 


\j±y 


















Thr 


Lys 


Arg 


Lys 


inr 


Lys 


Lys 


Lys 






515 












Thr 


lie 


Ser 


Lys 


Leu 


lie 


Leu 


Lys 




C *i f\ 

530 










D JO 




Ser 


Glu 


Leu 


Gly 


Asn 


Lys 


lie 


Arg 


545 










550 






Ala 


Gly 


Val 


Ala 


Gin 


Tyr 


Ala 


Val 










565 








Pro 


Met 


Asn 


Gin 


Arg 


Leu 


val 


Ser 








tz o n 
DO 0 










Leu 


Lys 


Glu 


Gin 


Val 


Lys 


Tyr 


Ser 






C ft c 

595 










C f\ f\ 


Thr 


Ala 


Lys 


Ser 


Asn 


Asn 


Ala 


Leu 




610 










OlD 




Gin 


Val 


Val 


Thr 


Met 


Met 


val 


Asp 


c o c 










con 






Ala 


Ala 


Phe 


Phe 


Phe 


val 


val 


val 










645 








Glu 


Ala 


Phe 


He 


Leu 


Thr 


Ser 


Lys 








660 










Gly 


Leu 


Lys 


Thr 


Val 


Val 


Glu 


Ser 






675 










CO f\ 


Asn 


Lys 


Val 


Ser 


Thr 


Ser 


Asn 


Glu 




690 










695 




Glu 


Asp 


Glu 


Gin 


Gin 


Gin 


Gin 


Gin 


705 










710 






Asp 


Leu 


Lys 


Arg 


Val 


Val 


Gly 


Glu 










725 








lie 


Val 


Asn 


Ser 


Phe 


Pro 


Ser 


Asn 








740 










Ala 


Phe 


Lys 


Gin 


Asn 


Tyr 


Ser 


Pro 






755 










760 


Ala 


Lys 


He 


Lys 


Thr 


Ala 


He 


Ser 




770 










775 




He 


Leu 


Phe 


Ser 


Asn 


Leu 


Pro 


Glu 


785 










/ yo 






Gly 


Leu 


Arg 


Leu 


Thr 


Asn 


Leu 


Leu 










o /\c 








Ala 


Asn 


Asn 


He 


He 


Lys 


Asn 


Ala 








O *"> A 

820 










Asp 


Gin 


Gin 


Asn 


Ser 


Ala 


Ala 


Met 






835 










840 


Ser 


Lys 


Glu 


Thr 


Ser 


He 


Phe 


Pro 




850 










Odd 

ODD 




Leu 


Gin 


Asp 


Met 


Thr 


Thr 


Met 


Ser 


865 










870 






Glu 


Gly 


Cys 


He 


Gly 


val 


Val 


Arg 










885 








Val 


Lys 


Ala 


Tyr 


Thr 


Leu 


Leu 


Val 








900 










Val 


Asn 


Phe 


Ala 


Ala 


Gin 


Ser 


Leu 






915 










920 


Thr 


Asn 


Asn 


Ala 


Asn 


Met 


Asn 


Pro 




930 










935 




Phe 


Val 


Asp 


Gly Ala 


Leu 


Ser 


Tyr 


945 










950 











47S 










4 80 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 




*1 «7 W 










4 95 




m ii 

\J JL U 




VOX 




Lys 


Glv 


Lys 


Lys 












510 






Pro 


ocx 


Lys 


Asp 


Asp 


Asp 


Leu 


Asp 










525 








Thr 

X I IX. 




\siy 


i yr 


Phe 


His 


Asp 


Thr 


















Asn 


Leu 


ile 


Asp 


Lys 


sp 


sp 


Phe 






JJJ 










560 


IJli 


Tip 

lie 


X 11 XT 




Met 


Gin 




Thr 














S75 




Ser 


Leu 


Leu 


Asp 


Leu 


Tip 
lie 




Arg 


















vai 


Asp 


TK -r- 

i nr 




c Dr 


X 1XX7 


0€x 


OCX 










DUQ 








Asp 


oer 




Lys 


Leu 


Th r 
i in 


Del 


V3111 








ton 










bsr 




Ala 




Leu 


Ala 
nlo 


Arg 


Leu 


















Asp 


Asn 


inr 


vax 


Vol 


Asn 


Arg 


nio 














\j -j *j 




Leu 


Leu 


Pro 


oer 


Asn 


blu 


Asn 


Arg 


ODJ 










670 






r*ne 


pne 


Lys 


Asn 


Leu 


i nr 


l ie 


oer 










OOJ 








GlU 


net 


mi a 4- 

wer. 


oer 


Vol 


Mat- 

net 


Pro 


pho 

IT X1C5 








■7AA 

/ uu 










Cys 


Pro 


Gin 


HIS 


r* l »i 


(— i — 


uin 


Pro 






Tic 

/ 13 












val 


Phe 


Leu 


/■"111 
GlU 


nei. 


biy 


Lys 


oer 




"7 "3 f\ 
/jU 










/ oD 




Lys 


Ser 


vai 


Gin 


Leu 


inr 


nld 


Asp 


/4 5 
















Met 


Gly 


Arg 


Arg 


i ie 


Asn 


Leu 


nld 










1 o D 








lie 


Giy 


oer 


Asn 


Tift 

x j_e 


oer 


fro 


Asn 








7Qft 

/ o u 










Ser 


vai 


\a ±y 


Asn 


Asn 


xnr 


vai 


x nr 






/ 27 J 










Ron 


Lys 


Asn 


x ie 


Cor 

oer 


uin 




Aid 


uin 




Q1 A 
O X U 










OlJ 




Asn 


i ni 


Leu 


Vol 


Asn 


Asn 


i eh 


Mot 
lit; t- 












D Jv* 






C A ^ 

oer 


Tl « 

± ie 


Leu 


Leu 


rne 


Pro 


Pro 


TK t~ 

inr 










Q/R 
Ol J 








Giy 


Asn 


Asp 


Pro 


oci 


oer 


11c 


Lys 








UDU 










Asn 


Leu 


Arg 


Pne 


Tyr 


O A V 

ser 


lie 


Ala 






875 










880 


Ser 


Arg 


Glu 


Phe 


Asp 


Glu 


Gly 


Gly 




890 










895 




Asp 


Ser 


Asn 


Thr 


Met 


Asp 


Met 


Ala 


905 










910 






Glu 


Lys 


Ser 


Met 


Ser 


Glu 


Ala 


Leu 










925 








Ser 


Asn 


Val 


Leu 


Glu 


Gly 


Gly 


Ser 








940 










Met 


Phe 


Glu 


Lys 


Asn 


Gly 


Ser 


Asp 






955 










960 
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Cys 


Glu 


Pro 


1111. 


Pro 
965 


Leu 


Ala 


Lys 


Tyr 


Thr 
970 


Met 


Lys 


Asp 


Val 


Ser 
975 


Asn 


Arg 


Tyr 


Leu 


Lys 
980 


Lys 


Phe 


Asn 


Asn 


Asp 
985 


Lys 


Asn 


Thr 


Gin 


Asp 
990 


Leu 


Tyr 


Lys 


Asn 


Arg 


Ala 


Glu 


Arg 


Ala 


Leu 


Val 


Glu 


Gin 


Val 


Thr 


Asn 


Lys 


Pro 




995 










1000 








1005 






Thr 


Ser 


Val 


Val 


His 


Ser 


Gin 


Leu 


Ala 


Asn 


Ala 


Met 


Gly 


Val 


Ala 


Val 




1010 








1015 








1020 








He 


Gly 


Ala 


Ala 


Ser 


He 


Lys 


Leu 


Met 


Glu 


Ala 


Glu 


Ala 


Ala 


Glu 


Ser 


1025 








1030 








1035 








1040 


Glu 


Met 


Arg 


Ala 


Ala 


Asn 


Tyr 


Gin 


Ala 


Thr 


Ser 


Lys 


Ser 


Thr 


Asn 


Ala 










1045 








1050 








1055 


He 


Asn 


He 


Thr 


Asn 


Thr 


He 


Gly 


Met 


He 


Arg 


Asn 


Thr 


Thr 


His 


Leu 








1060 








1065 








1070 




Cys 


Thr 


Thr 


He 


Ala 


Val 


Ser 


Ala 


Ala 


Ala 


Asp 


Met 


Ser 


Lys 


Leu 


Ala 






1075 








1080 








1085 






Asn 


Asn 


His 


Phe 


Met 


Ser 


Val 


Leu 


Asn 


Thr 


Ala 


Asn 


Asn 


Ser 


His 


Ser 




1090 








1095 








1100 








Ser 


Arg 


Arg 


Gly 


Asp 


Arg 


Ser 


Ser 


Leu 


Met 


Leu 


Gin 


Gin 


Gin 


Gin 


Pro 


1105 








1110 








1115 








1120 


Thr 


His 


Ser 


Ala 


Phe 


Leu 


Glu 


Gin 


Thr 


Arg 


Gly Arg 


Gly 


Gly 


Gly 


Val 










1125 








1130 








1135 


Leu 


Gly 


Ser 


Gly 


Thr 


Glu 


Gin 


Thr 


Lys 


Asp 


His 


Val 


Glu 


Arg 


Met 


Lys 








1140 








1145 








1150 




Arg Asp 


Trp 


He 


Leu 


Asn 


Met 


He 


Ser 


Pro 


Glu 


Asp 


Lys 


Asn 


Thr 


Thr 






1155 








1160 








1165 






Thr 


Thr 


Thr 


Pro 


Ser 


Asn 


Ala 


Gly Arg 


Thr 


Leu 


Gly 


Tyr 


Gly 


Ser 


Asn 



1170 1175 1180 

He Thr Gly He Asn Thr He Lys Gin Asp Asp Lys Ser Met Met Asp 
1185 1190 1195 1200 

Lys Leu Ser Glu Met Ser Ser Phe Arg Thr 
1205 1210 



<210> 38 
<211> 2376 
<212> DNA 
<213> SHRIMP 



<400> 38 

atgtctcttg 

gttgtggagg 

tcttcttctt 

cccccaaaga 

ccagcatctc 

cctctaatcg 

aaaacgctac 

tcaaccttta 

ggaaataaca 

acacaagctg 

ctccctgaag 

aaaaatattg 

ttgttccaca 

gt taagaaag 

atcggggatt 

gttaagaggc 

actacttcca 

tccatttcta 

caagaagaat 

catcctactt 

tactgcatct 



ttgagaataa 
gagcagaagt 
cagct tcgga 
agaaacgtaa 
cagaaatgct 
ataatagttt 
tcactgaaac 
a tcaactcga 
actacagata 
ttttggacgc 
gggagggaat 
tccaggctgc 
agatggagaa 
taaggaggaa 
ctcttcaaga 
cattctccaa 
ctggtgttgt 
agggatgttt 
tcgtttctga 
tcaagaagct 
gcccctacta 



tacccaagaa 
tgcccctaga 
tagtgaagat 
tatcaatagc 
cagtgctgta 
tggcgttcag 
tattgccgtg 
aaggaccgga 
ctggcaaatt 
aatcgttgaa 
cgggcttcaa 
cgatattgaa 
ggccgatgaa 
caagtcccag 
gcgtattgat 
ttctgctgct 
taatccacga 
cactgtgagg 
gttgtacact 
tattcatgac 
caccacagga 



gaaatgattc 
ggtgtcaaga 
gatgaaggag 
ggaaagtact 
aatgatattg 
tttaaaaaga 
gaatatggaa 
gagcctctta 
cgaattgaag 
ggaaatgaca 
tttaacaaga 
tttggacaag 
tcttctaatt 
cctacaaatt 
aacgcaatca 
gctgctgaag 
gggattaagg 
aatattgttg 
aaccttctaa 
cgtacaatga 
ggcgtccccc 



tggaaactac 
gaccccttcc 
gagagcaacc 
ggaaaatcga 
ataacgtgtc 
gtgtatctga 
caatcacaaa 
agaagaagcg 
ccgctgcagc 
ccgtgatcaa 
gtgttagctc 
ttgcgcacat 
cttctggtga 
cttactacac 
aagttatcga 
aagacaccac 
atattcactt 
cagcaaatgg 
aggttgaaga 
atcgacacat 
ctgcagctga 



tgttgaaggt 
ttcttcctca 
acaaacgaaa 
aacaatcgaa 
caaaactatt 
agaacagatc 
tgttaaatat 
ctcaaacaat 
agagaatgtt 
ggcaatcctc 
tcaacaagct 
gaagtgtaat 
atcaccaaag 
tttcactatg 
aatgtctcca 
tactacgacc 
tttcgattca 
tgaagttcca 
gaagg tggat 
taaggcatgg 
taaggtttcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
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gctaagggaa 
ggggcacata 
tctatgctct 
gaaggtttgg 
gatagcaggg 
gctgatgttt 
gatactatgg 
attaagaagg 
actgcaatca 
cgagcacaca 
tcaacaacag 
tttgcaaagc 
aaggaaatgg 
aacgaagaga 
gtaaagatca 
ttccctgctg 
tttgatgttg 
tctgaaaata 
ggagagaagc 



ttgctaccta 
catctactac 
tccagagtcc 
agcctatttc 
caacacgctg 
tgtctattgt 
aatatggtgt 
gaggagatac 
cttcggagga 
agaaacgaaa 
caaatactcc 
ttactttaga 
tgcagagaat 
agatggaaat 
agacgaatgc 
ataaggtgtg 
taagggagga 
aggcaagcgg 
tgttgcacga 



cagaatatac 
cccagcacag 
tggaactgat 
atcgggcgaa 
cttcaaattt 
tcatactgat 
tgcaaagagc 
tttccatagc 
acttaatagg 
gagcaactct 
aaagaagacg 
ttatgttgat 
tttggctcag 
tgtagaggga 
aaaggcatac 
tctaaagcac 
taatgttaat 
aggagatgat 
attgttgaat 



gaggacagga 
gcagccgagg 
attcaaaagt 
attgtgtacc 
tattcctctt 
ggacttttca 
aagagtaaaa 
gaggaggata 
gaatgtaaca 
tctactacta 
aagaagagcg 
agtacatcat 
gagagagtaa 
gaagaagcac 
aatcttgcca 
acattggaag 
aagacggtcg 
gaagaaactc 
gaataa 



ctggagtgtt 
caactggtgc 
tcctcgatgc 
gttcaaagtg 
cagacgagaa 
gtagtgtaca 
ttatccccaa 
ttgaagtgcc 
ccaagggaat 
ctacttccac 
cttctgctgc 
ttgtatttta 
agacgctaaa 
aagaaactta 
acaaaacatc 
atttggggga 
cttctactac 
caatggaatt 



ccaattcgat 
tattcacaag 
taagaaggca 
gagccccaat 
gatgaacatt 
ctttagaaag 
aactatcaag 
tgtaaaattc 
gaacagcctc 
cacttccact 
cagtgaccca 
caatatcagt 
ggcagtcaag 
tagaggaatc 
tggagtactt 
tgtacttgat 
tactacatca 
tgaaactgac 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2376 
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<400> 39 



Met 


Ser 


Leu 


Val 


Glu 


Asn 


Asn 


Thr 


Gin 


Glu 


Glu 


Met 


He 


Thr 


Thr 


Val 


1 








5 










10 










15 




Glu 


Gly 


Val 


Val 


Glu 


Gly 


Ala 


Glu 


Val 


Ala 


Pro 


Arg 


Gly 


Val 


Lys 


Arg 








20 










25 










30 






Pro 


Leu 


Pro 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ala 


Ser 


Asp 


Ser 


Glu 


Asp 






35 










40 










45 








Asp 


Glu 


Gly 


Gly 


Glu 


Gin 


Pro 


Gin 


Thr 


Lys 


Pro 


Pro 


Lys 


Lys 


Lys 


Arg 




50 










55 










60 










Asn 


lie 


Asn 


Ser 


Gly 


Lys 


Tyr 


Trp 


Lys 


He 


Glu 


Thr 


He 


Glu 


Pro 


Asp 


65 










70 










75 










80 


Glu 


Met 


Leu 


Ser 


Ala 


Val 


. Asn 


Asp 


He 


Asp 


Asn 


Val 


Ser 


Lys 


Thr 


lie 










85 










90 










95 




Pro 


Leu 


lie 


Asp 


Asn 


Ser 


Phe 


Gly 


Val 


Gin 


Phe 


Lys 


Lys 


Ser 


Val 


Ser 








100 










105 










110 






Glu 


Glu 


Gin 


He 


Lys 


Thr 


Leu 


Leu 


Thr 


Glu 


Thr 


He 


Ala 


Val 


Glu 


Tyr 






115 










120 










125 






Gly 


Thr 


He 


Thr 


Asn 


Val 


Lys 


Tyr 


Ser 


Thr 


Phe 


Asn 


Gin 


Leu 


Glu 


Arg 




130 










135 










140 








Thr 


Gly 


Glu 


Pro 


Leu 


Lys 


Lys 


Lys 


Arg 


Ser 


Asn 


Asn 


Gly 


Asn 


Asn 


Asn 


145 










150 










155 










160 


Tyr 


Arg 


Tyr 


Trp 


Gin 


He 


Arg 


He 


Glu 


Ala 


Ala 


Ala 


Ala 


Glu 


Asn 


Val 










165 










170 










175 




Thr 


Gin 


Ala 


Val 


Leu 


Asp 


Ala 


He 


Val 


Glu 


Gly Asn 


Asp 


Thr 


Val 


He 








180 










185 










190 






Lys 


Ala 


He 


Leu 


Leu 


Pro 


Glu 


Gly 


Glu 


Gly 


He 


Gly 


Leu 


Gin 


Phe 


Asn 






195 










200 










205 








Lys 


Ser 


Val 


Ser 


Ser 


Gin 


Gin 


Ala 


Lys 


Asn 


He 


Val 


Gin 


Ala 


Ala 


Asp 




210 










215 










220 










lie 


Glu 


Phe 


Gly 


Gin 


Val 


Ala 


His 


Met 


Lys 


Cys 


Asn 


Leu 


Phe 


His 


Lys 


225 










230 










235 










240 


Met 


Glu 


Lys 


Ala 


Asp 


Glu 


Ser 


Ser 


Asn 


Ser 


Ser 


Gly 


Glu 


Ser 


Pro 


Lys 










245 










250 










255 




Val 


Lys 


Lys 


Val 


Arg 


Arg 


Asn 


Lys 


Ser 


Gin 


Pro 


Thr 


Asn 


Ser 


Tyr 


Tyr 








260 










265 










270 






Thr 


Phe 


Thr 


Met 


He 


Gly 


Asp 


Ser 


Leu 


Gin 


Glu 


Arg 


He 


Asp 


Asn 


Ala 
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275 




lie 


Lys 


Val 


lie 




290 






Ala 


Ala 


Ala 


Ala 


305 








Gly 


Val 


Val 


Asn 


Ser 


lie 


Ser 


Lys 








340 


Gly 


Glu 


Val 


Pro 






355 




Leu 


Lys 


Val 


Glu 




370 






His 


Asp 


Arg 


Thr 


385 








Pro 


Tyr 


Tyr 


Thr 


Ala 


Lys 


Gly 


lie 








420 


Phe 


Asp 


Gly Ala 






435 




Thr 


Gly 


Ala 


lie 




450 






lie 


Gin 


Lys 


Phe 


4 65 








Ser 


Ser 


Gly 


Glu 


Arg 


Ala 


Thr 


Arg 








500 


Asn 


lie 


Ala 


Asp 






515 




Ser 


Val 


His 


Phe 




530 






Lys 


Ser 


Lys 


He 


545 








Thr 


Phe 


His 


Ser 


lie 


Thr 


Ser 


Glu 








580 


Ser 


Leu 


Arg 


Ala 






595 




Thr 


Ser 


Thr 


Thr 




610 






Lys 


Lys 


Ser 


Ala 


625 








Asp 


Tyr 


Val 


Asp 


Met 


Val 


Gin 


Arg 








660 


Val 


Lys 


Asn 


Glu 






675 




Glu 


Tyr 


Gly 


He 




690 






Ala 


Asn 


Lys 


Thr 


705 








Lys 


His 


Thr 


Leu 


Arg 


Glu 


Asp 


Asn 








740 


Ser 


Glu 


Asn 


Lys 






755 





280 

Glu Met Ser Pro 
295 

Glu Glu Asp Thr 
310 

Pro Arg Gly lie 
325 

Gly Cys Phe Thr 

Gin Glu Glu Phe 
360 

Glu Lys Val Asp 
375 

Met Asn Arg His 
390 

Thr Gly Gly Val 
405 

Ala Tyr He Tyr 

His Thr Ser Thr 
440 

His Lys Ser Met 
455 

Leu Asp Ala Lys 
470 

He Val Tyr Arg 
485 

Cys Phe Lys Phe 

Val Leu Ser He 
520 

Arg Lys Asp Thr 
535 

He Pro Lys Thr 
550 

Glu Glu Asp He 
565 

Glu Leu Asn Arg 

His Lys Lys Arg 
600 

Ser Thr Ser Thr 
615 

Ser Ala Ala Ser 
630 

Ser Thr Ser Phe 
645 

He Leu Ala Gin 

Glu Lys Met Glu 
680 

Val Lys He Lys 
695 

Ser Gly Val Leu 
710 

Glu Asp Leu Gly 
725 

Val Asn Lys Thr 

Ala Ser Gly Gly 
760 



vai 


Lys 


Arg 


fro 








o n a 


Thr 


Thr 


Thr 


Thr 






315 




Lys 


Asp 


lie 


riXS 




330 






Val 


Arg 


Asn 


He 


345 








Val 


Ser 


Glu 


Leu 


His 


Pro 


Thr 


Pne 








380 


He 


Lys 


Ala 


Trp 






395 




Pro 


Pro 


Ala 


Ala 




410 






Glu 


Asp 


Arg 


Thr 


425 








Thr 


Pro 


Ala 


Gin 


Leu 


Phe 


Gin 


Ser 








460 


Lys 


Ala 


Glu 


Gly 






475 




Ser 


Lys 


Trp 


Ser 




490 






Tyr 


Ser 


Ser 


Ser 


505 








Val 


His 


Thr 


Asp 


Met 


Glu 


Tyr 


Gly 








540 


He 


Lys 


He 


Lys 






555 




Glu 


Val 


Pro 


Val 




570 






Glu 


Cys 


Asn 


Thr 


585 








Lys 


Ser 


Asn 


Ser 


Thr 


Ala 


Asn 


Thr 








620 


Asp 


Pro 


Phe 


Ala 






635 




Val 


Phe 


Tyr 


Asn 




650 






GlU 


Arg 


IT- 1 

vai 


Lys 


665 








He 


Val 


Glu 


Gly 


Thr 


Asn 


Ala 


Lys 








700 


Phe 


Pro 


Ala 


Asp 






715 




Asp 


Val 


Leu 


Asp 




730 






Val 


Ala 


Ser 


Thr 


745 








Asp 


Asp 


Glu 


Glu 



285 



rne 


Oci. 


Asn 




xnr 


inr 




i nr 








•J O A 


pne 


Fne 


Asp 








J JO 




Val 


Ala 


Ala 


Asn 




JDU 






Tyr 


Thr 


Asn 


Leu 


365 








Lys 


Lys 


Leu 


He 


Tyr 


Cys 


lie 


Cys 








400 


Asp 


Lys 


vai 


ber 






415 




Gly 


Val 


Phe 


Gin 




430 






Ala 


Ala 


Glu 


Ala 


445 








Pro 


Gly 


Thr 


Asp 


Leu 


Glu 


Pro 


He 








480 


Pro 


Asn 


Asp 


Ser 






495 




Asp 


Glu 


Lys 


Met 




510 






Gly 


Leu 


Phe 


Ser 


525 








Val 


Ala 


Lys 


Ser 


Lys 


Gly 


Gly 


Asp 








560 


Lys 


Phe 


Thr 


Ala 






575 




Lys 


Gly 


Met 


Asn 




590 






Ser 


Thr 


Thr 


Thr 


605 








Pro 


Lys 


Lys 


Thr 


Lys 


Leu 


Thr 


Leu 








64 0 


He 


Ser 


Lys 


Glu 






655 




Thr 


Leu 


Lys 


Ala 




670 






Glu 


Glu 


Ala 


Gin 


685 








Ala 


Tyr 


Asn 


Leu 


Lys 


Val 


Cys 


Leu 








720 


Phe 


Asp 


Val 


Val 






735 




Thr 


Thr 


Thr 


Ser 




750 






Thr 


Pro 


Met 


Glu 



7 65 
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Phe Glu Thr Asp Gly Glu Lys Leu Leu His Glu Leu Leu Asn Glu 
770 775 780 



<210> 40 
<211> 1878 
<212> DNA 
<213> SHRIMP 



<400> 40 

atgagctgtt 

gtagaaggag 

cgaaagagat 

gattctgact 

acaaggaaac 

ctctacatgt 

ggtgaagaag 

cttggttgta 

gagtatgaag 

gattgtgtta 

ttccagttct 

atgaattcag 

ttgggtgaca 

tcttcatcat 

gcaatgggaa 

aatttccgcc 

aacaagaaac 

tttagagaaa 

atgaataatg 

aatattattg 

agagatgcac 

aagagactcc 

aatgatgttc 

tttgcttctg 

ttcaatagta 

gctaatttaa 

agaacattaa 

tctagtggtt 

caattaatgt 

ggaaacttztg 

gtgaagactg 

aacgtccctt 



cttcttcttc 
gaggaggcag 
cttctgttaa 
ctgatagaga 
cgcccaaagt 
ctagttcatc 
aaagtgacga 
agaaggaaaa 
aagaattcaa 
tacctgatcg 
ctcagccagt 
cattctcttc 
catctaaaat 
cgacgttcgt 
gagtagcaga 
ctgtagtttc 
tcaatagtaa 
tacacagtgt 
acaacacaaa 
gtaaaattcc 
tatatttgta 
gtatccctca 
ccattacctc 
ctatggtgga 
gttattccgt 
tcctatctat 
ctcagagcgt 
tgaccagtga 
atgagggtgt 
aggatgaaga 
ttggactatc 
tctactga 



atcgtgctca 
aatcggacca 
aagtacatca 
agaaaaagaa 
aagaaaactg 
ttcttcttcc 
tgatgatgat 
atctccccaa 
gagaatggca 
gattttaacc 
ttctttcctc 
catgttatcc 
ttcctccttt 
caataattgt 
attgctgtca 
cttaatgcga 
caggcaaact 
tatcgcgcta 
ttctagcgga 
ttataatgaa 
ccagaatgtg 
acaacaagct 
agatttactt 
tcctgcatat 
atataaagtc 
tttatcagct 
gtatttgttc 
agagagttct 
gactcgtcaa 
cggtgtaaca 
tgcattgcta 



agttcatctg 
acagaggcca 
tcttcttctt 
ggaagaaaac 
gatactcctt 
tcctcatctt 
gacgattatg 
gacatagaag 
ttaccttcac 
ctcttttcta 
cggttggtga 
agttctggta 
ataacacctc 
acagatgaag 
aatattgcag 
ggaccaactt 
attccccaag 
tatttatcgt 
tacgctgaag 
atgagtagag 
atcacggata 
gatttttgtt 
ttaactggaa 
cgcctggctg 
ctagatcttg 
agaaataagt 
ctaaatcatc 
ctaggaacag 
accatcgagg 
ctgaaatgtt 
tctgataggc 



aagaggagaa 
agaaaaagat 
cctcatcatc 
tatacgtgga 
cacaaacttt 
ccgactcatc 
acccagataa 
ctgaaaagga 
ggataaacac 
cacttttgaa 
tgaagcaagt 
tgaggttggt 
aaacggatac 
atatcaagaa 
ccagttctaa 
gtggaggttc 
tattaaacaa 
ccgtctgtgt 
ggatggttac 
aaaaattcat 
tgactggccc 
acattatagc 
aggcgacaaa 
tccataaaat 
accataaaat 
gtctcagtga 
ttttgcggaa 
ccgtaaaatt 
acggatgtag 
tgggagccga 
ttagaaaaaa 



tgaggtagga 
cctccgtaaa 
atcatcagac 
tattgcagat 
agagaacgac 
atcgtcttct 
tgtccatgtt 
aaaggaggaa 
atctgtagat 
gaaaaatagt 
aaacgaggcc 
ggaggactct 
tagtaattca 
gcgtaacatt 
tgaggagaat 
taatgcatcc 
ggttatattt 
gcagcgagca 
taaaatcttg 
atccgttgga 
caaacataac 
catgttggtt 
tttagtacaa 
ggcgtctgtt 
gttattaagg 
aagaaaacct 
caaattgagg 
ggtgtcacag 
catgattagc 
tgttaaggac 
cattagaaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1878 



<210> 41 
<211> 621 
<212> PRT 
<213> SHRIMP 



<400> 41 



Met 


Ser 


Cys 


Ser 


Ser 


Ser 


Ser 


Ser 


1 








5 








Asn 


Glu 


Val 


Gly 


Val 


Glu 


Gly 


Gly 








20 










Ala 


Lys 


Lys 


Lys 


He 


Leu 


Arg 


Lys 






35 










40 


Thr 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 




50 










55 




Asp 


Arg 


Glu 


Glu 


Lys 


Glu 


Gly Arg 


65 










70 






Thr 


Arg 


Lys 


Pro 


Pro 


Lys 


Val 


Arg 



85 



Cys 


Ser 
10 


Ser 


Ser 


Ser 


Glu 


Glu 
15 


Glu 


Gly 


Gly 


Arg 


He 


Gly 


Pro 


Thr 


Glu 


25 










30 






Arg 


Lys 


Arg 


Ser 


Ser 
45 


val 


Lys 


Ser 


Ser 


Ser 


Ser 


Asp 
60 


Asp 


Ser 


Asp 


Ser 


Lys 


Leu 


Tyr 
75 


Val 


Asp 


He 


Ala 


Asp 
80 


Lys 


Leu 
90 


Asp 


Thr 


Pro 


Ser 


Gin 
95 


Thr 
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Leu 


Glu 


Asn 


Asp 








T Art 

1UU 


Ser 


Ser 


Asp 


Ser 






lib 




Asp 


Asp 


Asp 


Asp 




130 






Lys 


Glu 


Lys 


Ser 


145 








Glu 


Tyr 


Glu 


Glu 


Thr 


Ser 


Val 


Asp 








i on 


Ser 


Thr 


Leu 


Leu 






195 




Phe 


Leu 


Arg 


Leu 




210 






Phe 


Ser 


Ser 


Met 


225 








Leu 


Gly 


Asp 


Thr 


Thr 


Ser 


Asn 


Ser 








260 


Glu 


Asp 


lie 


Lys 






275 




Leu 


Ser 


Asn 


lie 




290 






Val 


Val 


Ser 


Leu 


305 








Lys 


Leu 


Asn 


Ser 


lie 


Phe 


Phe 


Arg 








340 


Val 


Cys 


Val 


Gin 






355 




Tyr 


Ala 


Glu 


Gly 




370 






Pro 


Tyr 


Asn 


Glu 


385 








Ala 


Leu 


Tyr 


Leu 


His 


Asn 


Lys 


Arg 








420 


lie 


lie 


Ala 


Met 






435 




Leu 


Thr 


Gly 


Lys 




4 50 






Asp 


Pro 


Ala 


Tyr 


4 65 








Ser 


Ser 


Tyr 


Ser 


Leu 


Arg 


Ala 


Asn 








500 


Leu 


Ser 


Lys 


Pro 






515 




His 


Leu 


Leu 


Arg 




530 






Ser 


Ser 


Leu 


Gly 


545 








Glu 


Gly 


Val 


Thr 


Gly 


Asn 


Phe 


Glu 



Leu Tyr Met Ser 

Ser Ser Ser Ser 
120 

Tyr Asp Pro Asp 
135 

Pro Gin Asp lie 
150 

Glu Phe Lys Arg 
165 

Asp Cys Val lie 

Lys Lys Asn Ser 
200 

Val Met Lys Gin 
215 

Leu Ser Ser Ser 
230 

Ser Lys lie Ser 
245 

Ser Ser Ser Ser 

Lys Arg Asn lie 
280 

Ala Ala Ser Ser 
295 

Met Arg Gly Pro 
310 

Asn Arg Gin Thr 
325 

Glu lie His Ser 

Arg Ala Met Asn 
360 

Met Val Thr Lys 
375 

Met Ser Arg Glu 
390 

Tyr Gin Asn Val 
405 

Leu Arg lie Pro 

Leu Val Asn Asp 
440 

Ala Thr Asn Leu 
455 

Arg Leu Ala Val 
470 

Val Tyr Lys Val 
485 

Leu lie Leu Ser 

Arg Thr Leu Thr 
520 

Asn Lys Leu Arg 
535 

Thr Ala Val Lys 
550 

Arg Gin Thr lie 
565 

Asp Glu Asp Gly 



Ser Ser Ser Ser 
105 

Gly Glu Glu Glu 

Asn Val His Val 
140 

Glu Ala Glu Lys 
155 

Met Ala Leu Pro 
170 

Pro Asp Arg lie 
185 

Phe Gin Phe Ser 

Val Asn Glu Ala 
220 

Gly Met Arg Leu 
235 

Ser Phe lie Thr 
250 

Thr Phe Val Asn 
265 

Ala Met Gly Arg 

Asn Glu Glu Asn 
300 

Thr Cys Gly Gly 
315 

lie Pro Gin Val 
330 

Val lie Ala Leu 
345 

Asn Asp Asn Thr 

lie Leu Asn lie 
380 

Lys Phe lie Ser 
395 

lie Thr Asp Met 
410 

Gin Gin Gin Ala 
425 

Val Pro lie Thr 

Val Gin Phe Ala 
460 

His Lys Met Ala 
475 

Leu Asp Leu Asp 
490 

lie Leu Ser Ala 
505 

Gin Ser Val Tyr 

Ser Ser Gly Leu 
540 

Leu Val Ser Gin 
555 

Glu Asp Gly Cys 
570 

Val Thr Leu Lys 



Ser Ser Ser Ser 
110 

Ser Asp Asp Asp 
125 

Leu Gly Cys Lys 

Glu Lys Glu Glu 
160 

Ser Arg lie Asn 
175 

Leu Thr Leu Phe 
190 

Gin Pro Val Ser 
205 

Met Asn Ser Ala 

Val Glu Asp Ser 
240 

Pro Gin Thr Asp 
255 

Asn Cys Thr Asp 
270 

Val Ala Glu Leu 
285 

Asn Phe Arg Pro 

Ser Asn Asn Lys 
320 

Leu Asn Lys Val 
335 

Tyr Leu Ser Ser 
350 

Asn Ser Ser Gly 
365 

lie Gly Lys lie 

Val Gly Arg Asp 
400 

Thr Gly Pro Lys 
415 

Asp Phe Cys Tyr 
430 

Ser Asp Leu Leu 
445 

Ser Ala Met Val 

Ser Val Phe Asn 
480 

His Lys Met Leu 
495 

Arg Asn Lys Cys 
510 

Leu Phe Leu Asn 
525 

Thr Ser Glu Glu 

Gin Leu Met Tyr 
560 

Ser Met lie Ser 
575 

Cys Leu Gly Ala 
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580 585 590 

Asp Val Lys Asp Val Lys Thr Val Gly Leu Ser Ala Leu Leu Ser Asp 

595 600 605 

Arg Leu Arg Lys Asn lie Arg Arg Asn Val Pro Phe Tyr 
610 615 620 



<210> 42 
<211> 3303 
<212> DNA 
<213> SHRIMP 

<400> 42 

atgtttggaa gttctgctaa taactttaat ggtgacaaga aatcttcctc atcatcatca 60 
gctgccgcat catctgacga tcagcagtta ggtccccttg gactgtctac tgctgatttc 120 
aagaaagttg ctgccattct tgccaacaga acagagtccc tatatctgtt gccagattct 180 
cccaacttta agaatgtgat taacaaccct aaccagatat ctattgtgcc gtttctgggt 240 
tcatcgaagg cagctgaaag tggtagtgca aacaagaatg aaaaccaggc tgaaaattct 300 
tctaaaggag gaagtgatgg aaagaaatca tcgcagcaga acaagtttaa cctattgaac 360 
aaggtagagg ctgaagaaat ggcctttaaa cgtgtggctg aactcattgc ggatactcct 420 
ccctctaaag ataatccatt gagggatgac cctgatgcta ttccatcacg caacccatgg 480 
gtcaaattga ctcagaagaa tctggaatat cttttctggg aagcagtaac aattgaagtc 540 
tccaatgata ggagtatccg tagtggaaga tatcttcaag ccagtgaagt gggggagaat 600 
ccattcctaa tgaccatcag tgttgacatt agaatcctgc aaagaatggc acttaatgtc 660 
gtgtggttct tcaacagatt cttccgcatg gtttctggac ttggagtaga aaacagggcc 720 
aattccacct atgtagcaac tagcgatgct attgcccaga tctgggtaga gatgctcctc 7 80 
aagaacttta tttctggaga aaatgtgccc caggcactga agtatttgaa ggaacattat 840 
gagcatgttt ataacaagat cagtaaatgt ggacgtcagc catcctattt tgttgttgaa 900 
tttgaaagag tggacaacac gattggattt gtcaattctg atactgaaca taatggatca 960 
tcatacatgg aatacaggtg ctttgacaca atcaggaaga atgcatcatc tgggccaagt 1020 
ggaggcggaa agagtggtgt cctgtcttct ggaacattct tcattgataa cgaaatgggg 1080 
aataacaata gtagtgcagc tgcagcttct gcccctgctg tttctgctgg agtttctcca 1140 
tcactttctc catttagcag tgatggagat gatgatgacg acgattgtag tggtgatgac 1200 
gtgtggggga agaagatgat attcaacaca tcaggagacg gatcaggaga atcttctggg 1260 
cagaacggtg gtggtgcatc aacttacaag agatttaggt gtggagaaaa tactgcttct 1320 
ctttctcaga aggaaaatgt tcgtctcatg gcgatgccaa agggaaatga agataaacaa 1380 
ctactcaaga acattatcaa ctttctgaat agtgcactta actctgttga aaaccatgta 1440 
atgtgtacag atgaaaatat cttcgatgag gatcaagccg agcattatac gtcaaacaag 1500 
gaactataca aggctattgt ttgctccaac ccagccaatg tgtacagagt aatggttgaa 1560 
ctgtttgtca accttattct tccccgtctt aggaacccaa ttgtgagtga cattgaaact 1620 
gtacaaaatc ttccttcaaa caatggatcg gtgagaacaa agaagatggt tgaacatgga 1680 
tgtacagata tgcgctatga catccctcca tatgcaaagg gaaagattcg tcttagtgcc 1740 
aagagagctt gtgagtgcag aaaactgtgc aaggacgtga ggtgctttga caagtctaga 1800 
gaggcaaatc tcacccctag ccagaaggca ggaagggagg tggaggaacc ttttccccgc 1860 
aaccacaatt ctcacaggag taacgctcac gactttactt tctatgacaa gtacagggca 1920 
aggatgaaca agctcaagaa ggattcaaag aagaaggtaa agaagattga cacctttaca 1980 
acaacagacg attttctcct gcaagatagg aacgcttttg atctacttag aaagtgcttc 2040 
ctttctgcct ctcttcatca cattttctgt cctgatgtgc ttatggtgca tagaggagac 2100 
agcttcaata ttaactttgc aaacaacaaa ctcgagtgct ataatgaacg taacggaatt 2160 
gaagaagtta cttcctcaca aacggtcaac gccaaggaag cacttgagga tattacaaaa 2220 
attaaaatga agagagggga tgatattata gatgttgtga agagtaaagg actttctttg 2280 
agggaattct ctaagaaggt tagtaagatt gtgagaaggt ttaatgagat cacaaaccaa 2340 
ctctgcaaca actgcaacgt taactcttct aatggagatg tggatttcca cgtctttact 24 00 
tctgtgtgtg tctacatcca caacattatt cctgtgctcg aagatatctc catttttgca 24 60 
gaattgggtg aa'gaattgac caagcttgtt aaggagtgta gagacgtggc tggagaggac 2520 
aagacatatg atgatattat ccgcaattac gaaattactg taaagtactt taagctcttt 2580 
aatgcactcg ttaaattctg tcacaggaat tataatgtgg cagtaacctc tgccattaac 264 0 
aggagagggt acatgtgcat ggtgagcaac cttgtcgggt attattgtaa gctgtctgat 2700 
aacgctatcc agtatcacga atcactatgc tctttgcact ctagcatctc ttatgcagac 2760 
tattatacgt ctcgcaataa caattctgaa gatggaggag gaaactcttc ttcagaaaag 2820 
agcaatgcag atgtagccaa gactatggcc tctttctatg accagttcga taagagtgaa 2880 



WO 01/38351 



123 



PCT/DS00/28888 



gacagcaaga 
gatagggttt 
gagaatgaag 
gtggaagatg 
aagcaggaag 
aaagaggaac 
gacgagaaag 
taa 



aaaataagaa 
tggatggcat 
aggaggagga 
ttgatagcaa 
aagatgtgga 
aggaggcgaa 
cagtagcatc 



caaaacttca 
ggatgatgat 
agaggaggaa 
taagaagaca 
gatggaggaa 
ggaggaagac 
tgatgaagat 



aatgagatcc 
gatgatgaag 
attgtaaaga 
ctgccaaagg 
gtgaaggaag 
gctactgagt 
gaagatgatg 



ttataaaaat 
atagtgatag 
aaccagcaaa 
aacctgccgt 
cagcagcaga 
atgacgacga 
aagattctaa 



gttccaaatg 
tagtagcagt 
gaagaggaaa 
taagaaggtg 
agaagaaaag 
tacagaagag 
agctattttc 



2940 
3000 
3060 
3120 
3180 
3240 
3300 
3303 



<210> 43 
<211> 1100 
<212> PRT 
<213> SHRIMP 



<400> 43 












Ma4~ 

Met 


irne 


Gxy 


Ser 


Ser 


Ala 


Asn 


Asn 


1 








c 
D 








Ser 


Ser 


Ser 


Ser 


Ala 


Ala 


Ala 


Ser 








20 










Leu 


Gly 


Leu 


Ser 


Thr 


Ala 


Asp 


Phe 






35 










40 


Asn 


Arg 


Thr 


Glu 


Ser 


Leu 


Tyr 


Leu 




50 










55 




Asn 


val 


He 


Asn 


Asn 


Pro 


Asn 


Gin 


65 










70 






ber 


Ser 


Lys 


Ala 


Ala 


Glu 


Ser 


Gly 










85 








Ala 


Glu 


Asn 


Ser 


Ser 


Lys 


Gly 


Gly 








100 
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<210> 44 
<211> 618 
<212> DNA 
<213> SHRIMP 



<400> 44 

atggcaggaa 

aaggaaatac 

tttaacatgc 

cctttccctc 

gagcttctgg 

cccagaacct 

actaccggtc 

gttaggcacg 

gttggtaagc 

gttgtgatta 

aagttctcaa 



tcggtagaag 
cctataacgt 
agaagtgctc 
cataccctgg 
ttgaccaatc 
accctctcga 
acagcgatat 
gttaccacta 
tcactgaaaa 
atctgtcaag 
aaaactag 



agataataga 
tccaccaacc 
agactgtgct 
tactgcagta 
tttccctccc 
atttcccgag 
ccccttcatc 
cttaatcaac 
gcaagaaaac 
cactttgaag 



cccgtattac 
cctattatct 
cctttccctc 
gaagaggagg 
ccattccctg 
aagaaggaga 
gatctggaga 
cccaacaagg 
ctgaacaaat 
gaacttgaga 



<210> 45 
<211> 205 
<212> PRT 
<213> SHRIMP 



atctggacat 
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cctaccccgg 
agaagcaaaa 
gaaataagct 
aggatttccc 
aaaccccacc 
ttggggagct 
tggtgttgga 
agctgcgagc 



tgatccaaat 
ccccttcgtg 
cactgagaag 
ggaaattgag 
gagagatatc 
ttgcgttgac 
ccgtagtgac 
taaccatatc 
tgttgatgac 
tggcctgtgc 
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15 10 15 

He Asp Pro Asn Lys Glu He Pro Tyr Asn Val Pro Pro Thr Pro He 
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<210> 46 
<211> 630 
<212> DNA 
<213> SHRIMP 



<400> 46 

atggcctcat 

gaaagagatg 

gttagcgccg 

cttgaagatt 

agcgatgacg 

ctgtgctctt 

gtgttccttc 

actgaagaag 

ggagatagtg 

ggatctcttt 

gatggtggag 



catcatcatc 
aagaaaatac 
cctgggtgcc 
tttcttcaga 
atgacgatga 
ttgtgagggg 
ttacaagcgc 
gcgcagcaac 
gtattgctgc 
tggaatctat 
aagaaaccac 



ccctgttgcc 
actgtccctc 
tgttgatgaa 
tgaagaagac 
cgaggaggat 
tttctttggc 
agccgttgct 
catggaagaa 
tgatgttgtc 
tgcaacaaca 
agaattgtaa 



ctctcctctg 
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gaggatgaag 

gatgataata 
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<210> 47 
<211> 207 
<212> PRT 
<213> SHRIMP 
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100 

Ser Leu Thr Lys 
115 

Ala He Phe Lys 
130 

Thr Met Glu Glu 
145 

Ser Gly lie Ala 

Glu Asn Gly Ser 
180 

Thr He Glu Asn 
195 



Lys Gin Val Phe 
120 

Thr Arg Asp Val 
135 

Asn Ser Thr Asp 
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Ala Asp Val Val 
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Leu Val Asp Gly 
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<210> 48 
<211> 2685 
<212> DNA 
<213> SHRIMP 



<400> 48 

atgacgagac 

gtcgactaca 

gacaatatca 

aagaaaataa 

gccaccaagg 

gaaaacgcca 

ctatcagaaa 

cgagacacgt 

ttgatatttg 

ttgtgtgtga 

gctaacgtgt 

cgactcggga 

cccggtatgt 

ggggaatact 

tctgactggt 

ggagaaggag 

ttccattact 

gaaaaatacg 

tacccgtacg 

atttctgaag 

gcaaatttag 

caactttttg 

ctttctttcg 

aaagagagat 

agtatgatga 

aaggataaac 

tcaaagggtt 

ccttatgttt 

aataatagta 

aaacaccgtg 

aattacatct 

agtttattac 

gaaagggccc 

actctatttg 

aaacgtaaat 

gtcaatttga 

aatagtacca 

cgttttaaag 

tcctcttcat 

cctagtgatt 



atggtgtgct 
ctttctgtac 
cagaccaaaa 
agaaggaaga 
aattcgaaca 
tcgcggccgc 
gtgaaggaaa 
cccagaaaca 
aaactggatc 
ttaaagccac 
ctatagtgaa 
aattcttcga 
ttttcaagct 
acacaaaggt 
tgaggataaa 
agagtgggtg 
ttatggacaa 
gcatcccgca 
tcaaaccatc 
tggataggac 
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gtgggcttag tgtttaaaat gtgtgatatt cttgcatcct tagcaagcag gagagggatg 24 60 

gaagatcgtc ccacagccaa ctatagaacc tccttacatt cagctactca aaataaaacc 2520 

aatttgaata aactattagt ttctgctatc aaggaaacag gagccactga aactgaagca 2580 

cagatattca acaagattat tggtagtgaa aagggactat caattctctg tcaacttgtg 2640 

gaaaggagga acaaagacaa taatgtcttc gactgatttg tctaa 2685 
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sp 


Jjys 


±rp 




Arg 


X Hi 


cxy 


Arg 


Pro 


305 










310 










315 










320 


Glu 


Phe 


Leu 


Thr 


Asp 


Pro 


Val 


Leu 


His 


Phe 


Lys 


Lys 


Ala 


Glu 


Ala 


Val 










325 










330 










335 




Cys 


Lys 


Ser 


Thr 


Asn 


Pro 


Asn 


Leu 


Arg 


Ala 


He 


Tyr 


Ser 


Pro 


Asp 


Asn 








340 










345 










350 






Lys 


Gly 


Phe 


Leu 


Cys 


Ala 


Pro 


Val 


Ala 


Glu 


Leu 


Val 


Lys 


Thr 


Ala 


Leu 






355 










360 










365 








Thr 


Phe 


Arg 


Gly 


Ser 


His 


Glu 


Pro 


Ser 


Leu 


He 


Val 


Glu 


Arg 


Asp 


He 




370 










375 










380 










Asn 


Gin 


Ala 


Glu 


Asn 


Leu 


Pro 


Ser 


Asn 


Ser 


Phe 


Gly 


Val 


Asn 


Trp 


Pro 


385 










390 










395 










400 


Tyr 


Val 


Asn 


Leu 


Leu 


Asn 


Arg 


He 


Gin 


Asp 


Gin 


Tyr 


Thr 









405 410 



<210> 52 
<211> 1401 
<212> DNA 
<213> SHRIMP 



<400> 52 

atgtctgcat 

gtagctgact 

ctttctgttg 

atcactggag 

atgtctgttg 

gaaattagaa 

agcccgctcg 

attgtttgtg 

gattttcaaa 

aaggtcgact 

atggatataa 

gattctacac 

aaaacagcag 

cagctaacct 

gcactcatgt 

aaggaagatg 

tctggaaacc 

gggctctgtt 

cctaccattc 

cttagaaaat 

gaaagageca 

gcagtagcaa 

agtgcgcaga 



ctttaatatt 
catttgagaa 
ctctctatgg 
aaagaacatc 
ctcgcgttgg 
ccatcatgga 
aggaagaaaa 
gatattatac 
accccactgt 
ctgctaggtc 
agtcatcaat 
ctaacccttg 
ttaatgatat 
ttatgagega 
ccgatactag 
catcaaatac 
ccagacagcc 
tcatgtttaa 
cccattcaac 
aeggttctte 
atgcagtgaa 
atctcaccgg 
gatcgtgttc 



ggacgaatac 
aatcaaggga 
agcacctcca 
tttaaacgac 
acaagaaaat 
agatattaca 
taaggtgcat 
aatggacaca 
cattgecaat 
tacaagattc 
agattgggca 
ttcagttgac 
ataegcttet 
gattgaaaaa 
ggagagggca 
caa ttcaaaa 
cataaaattg 
ggtaatgect 
acaataccat 
attcgaaaag 
aaaatttggt 
acctgtgtat 
aactaaattt 



ctcaagaaga 
gaaatccaat 
aaaccttcag 
aaatatctat 
aategcaaga 
gggagtttgt 
ataggcatca 
tctatttctt 
gtgactaagc 
gaegcttttg 
aatatggtag 
actattgtgt 
gttggaaaat 
ctgegaaagg 
ttcgtagagt 
ttgtttgttc 
gtacgccgtt 
ccagaattca 
ggtctatatg 
tcctgggctc 
gtaaacgata 
gttctcattt 
ctccgtgaaa 



ctgcttcagc 
cacctgagga 
etteggctgt 
eggataatgt 
gagccgacca 
ccggtgcgta 
tgaataacaa 
ccgaacctct 
ggatggagag 
ttaatggtgt 
aaaatgtgat 
ccagagacgc 
cttattgtcg 
ctgcagttgt 
tcctatttta 
agaataagct 
ctgctgagga 
tgaactgtat 
gtacatgttt 
attttgagga 
cgaggataga 
tagatcttgt 
ttaaggaaaa 



cgttctggat 
ggctgegget 
ggcctctatc 
cctattgaaa 
ggcagctgat 
caggcaatat 
aacgcctagc 
ttctctaaca 
cattttttca 
tgcgaataat 
caaattacca 
aagtgtagtt 
tcctgcaaca 
atgttttgag 
cgttagcttt 
atcttccatg 
aacactattc 
atttaacttc 
aacccctcta 
aattttaagc 
ttgtctagat 
aegtactcta 
ctatcttttg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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tggaataggt ttgtgtcata a 

<210> 53 
<211> 459 
<212> PRT 
<213> SHRIMP 

<220> 

<221> VARIANT 

<222> (1) . . . (459) 

<223> Xaa = Any Amino Acid 



<400> 53 



Met 


Ser 


Ala 


Ser 


Leu 


He 


Leu 


Asp 


1 








5 








Ala 


Val 


Leu 


Asp 


Val 


Ala 


Asp 


Ser 








20 










Gin 


Ser 


Pro 


Glu 


Glu 


Ala 


Ala 


Ala 






35 










40 


Lys 


Pro 


Ser 


Ala 


Ser 


Ala 


Val 


Ala 




50 










55 




Ser 


Leu 


Asn 


Asp 


Lys 


Tyr 


Leu 


Ser 


65 










70 






Val 


Ala 


Arg 


Val 


Gly 


Gin 


Glu 


Asn 










85 








Ala 


Asp 


Glu 


He 


Arg 


Thr 


He 


Met 








100 










Gly 


Ala 


Tyr Arg 


Gin 


Tyr 


Ser 


Pro 






115 










120 


lie 


Gly 


lie 


Met 


Asn 


Asn 


Lys 


Thr 




130 










135 




Thr 


Met 


Asp 


Thr 


Ser 


He 


Ser 


Ser 


145 










150 






Gin 


Asn 


Pro 


Thr 


Val 


He 


Ala 


Asn 










165 








Phe 


Ser 


Lys 


Val 


Asp 


Ser 


Ala 


Arg 








180 










Asn 


Gly 


Val 


Ala 


Asn 


Asn 


Met 


Asp 






195 










200 


Asn 


Met 


Val 


Glu 


Asn 


Val 


He 


Lys 




210 










215 




Cys 


Ser 


Val 


Asp 


Thr 


He 


Val 


Ser 


225 










230 






Ala 


Val 


Asn 


Asp 


He 


Tyr 


Ala 


Ser 










245 








Ala 


Thr 


Gin 


Leu 


Thr 


Phe 


Met 


Ser 








260 










Ala 


Val 


Val 


Cys 


Phe 


Glu 


Ala 


Leu 






275 










280 


Phe 


Val 


Glu 


Phe 


Leu 


Phe 


Tyr 


Val 




290 










295 




Ser 


Lys 


Leu 


Phe 


Val 


Gin 


Asn 


Lys 


305 










310 






Arg 


Gin 


Pro 


He 


Lys 


Leu 


Val 


Arg 










325 








Gly 


Leu 


Cys 


Phe 


Met 


Phe 


Lys 


Val 








340 










lie 


Phe 


Asn 


Phe 


Pro 


Thr 


He 


Pro 






355 










360 


Thr 


Cys 


Leu 


Thr 


Pro 


Leu 


Leu 


Arg 




370 










375 





1401 



Glu 


Tyr 


Leu 


Lys 


Lys 


Thr 


Ala 


Ser 




10 










15 




Phe 


Glu 


Lys 


He 


Lys 


Gly 


Glu 


He 


25 










30 






Leu 


Ser 


Val 


Tyr 


Gly Ala 


Pro 


Pro 










45 








Ser 


He 


He 


Thr 


Gly 


Glu 


Arg 


Thr 








60 










Asp 


Asn 


Val 


Leu 


Leu 


Lys 


Met 


Ser 






75 










80 


Asn 


Arg 


Lys 


Arg 


Ala 


Asp 


Gin 


Ala 




90 










95 




Glu 


Asp 


He 


Thr 


Gly 


Ser 


Leu 


Ser 


105 










110 






Leu 


Glu 


Glu 


Glu 


Asn 


Lys 


Val 


His 










125 








Pro 


Ser 


He 


Val 


Cys 


Gly 


Tyr 


Tyr 








140 










Glu 


Pro 


Leu 


Ser 


Leu 


Thr 


Asp 


Phe 






155 










160 


Val 


Thr 


Lys 


Arg 


Met 


Glu 


Ser 


He 




170 










175 




Ser 


Thr 


Arg 


Phe 


Asp 


Ala 


Phe 


Val 


185 










190 






He 


Lys 


Ser 


Ser 


He 


Asp 


Trp 


Ala 










205 








Leu 


Pro 


Asp 


Ser 


Thr 


Pro 


Asn 


Pro 








220 










Arg 


Asp 


Ala 


Ser 


Val 


Val 


Lys 


Thr 






235 










240 


Val 


Gly 


Lys 


Ser 


Tyr 


Cys 


Arg 


Pro 




250 










255 




Glu 


He 


Glu 


Lys 


Leu 


Arg 


Lys 


Ala 


265 










270 






Met 


Ser 


Asp 


Thr 


Arg 


Glu 


Arg 


Ala 










285 








Ser 


Phe 


Lys 


Glu 


Asp 


Asn 


Thr 


Asn 








300 










Leu 


Ser 


Ser 


Met 


Ser 


Gly 


Asn 


Pro 






315 










320 


Arg 


Ser 


Ala 


Glu 


Glu 


Thr 


Leu 


Phe 




330 










335 




Met 


Pro 


Pro 


Glu 


Phe 


Met 


Asn 


Cys 


345 










350 






His 


Ser 


Thr 


Gin 


Tyr 


His 


Gly 


Gly 










365 








Lys 


Tyr 


Gly 


Ser 


Ser 


Phe 


Glu 


Lys 



380 
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Ser Trp Ala His Phe Glu Glu lie Leu Ser Ala Asn Ala Val Lys Lys 
385 390 395 400 

Phe Gly Val Asn Asp Thr Arg lie Asp Cys Leu Asp Ala Val Ala Asn 

405 410 415 

Leu Thr Gly Pro Val Tyr Val Leu He Leu Asp Leu Val Arg Thr Leu 

420 425 430 

Ser Ala Gin Arg Ser Cys Ser Thr Lys Phe Leu Arg Glu He Lys Glu 

435 440 445 

Asn Tyr Leu Leu Trp Asn Arg Phe Val Ser Xaa 
450 455 



<210> 54 
<211> 819 
<212> DNA 
<213> SHRIMP 

<400> 54 

atggctcaga catcaaagat gggaactaac aagaggtgtt ttgaggagga agtggaggaa 60 

gaaaggcaac aacctttcac aaagaaatct aaatcggaac cacccagttt tgaagacaag 120 

agttcatcca catcttctaa gaagaagagc aaatccaata aacacaccaa gaccaaggaa 180 

gaacaacttc tagaattcgt gaaggatctg gagcggagcg accccactgt tcctgatgag 240 

aaggtcaagc aagaagttga agaaaagt.cc cctgaagcta ttgctgaaat tttttcaatg 300 

tttgggatcg ctcaagacag caagttcaag agccttcttc ccattgaacg cataaagagc 360 

atcactacta aaattgttat cgatgcaatt aatcagcctg tgcgcaagat gttggttgac 420 

cacctctatc attttaagga gatgcagaat gttgtggaga aatataagga cgatagcgac 4 80 

gaaaaactga gcgtcattct taagagtaag aaatccccca aagaatttga cctctccttt 540 

tccgattacg ttgatcgcct taacaggatt ctggttggtg taattaagag ggtggccgga 600 

gctattgaaa gtaaggaatt gttgcagagt aacagcatga tcatgaacag tgttctgggt 660 

actgttgtgt ccaacattcc ttacaacatg aagattaata tttgtgtgtt tttgactaac 720 

tttatttgta catttgctaa tgatgatttg tacacattct ttagggatga tgagaaattt 7 80 
gtaatgagtc aggtaacaag atacatttca aaggattag 819 

<210> 55 
<211> 272 
<212> PRT 
<213> SHRIMP 



<400> 55 



Met 


Ala 


Gin 


Thr 


Ser 


Lys 


Met 


Gly 


Thr 


Asn 


Lys 


Arg 


Cys 


Phe 


Glu 


Glu 


1 








5 










10 










15 




Glu 


Val 


Glu 


Glu 
20 


Glu 


Arg 


Gin 


Gin 


Pro 
25 


Phe 


Thr 


Lys 


Lys 


Ser 
30 


Lys 


Ser 


Glu 


Pro 


Pro 
35 


Ser 


Phe 


Glu 


Asp 


L y s 

40 


Ser 


Ser 


Ser 


Thr 


Ser 
45 


Ser 


Lys 


Lys 


Lys 


Ser 
50 


Lys 


Ser 


Asn 


Lys 


His 
55 


Thr 


Lys 


Thr 


Lys 


Glu 
60 


Glu 


Gin 


Leu 


Leu 


Glu 


Phe 


Val 


Lys 


Asp 


Leu 


Glu 


Arg 


Ser 


Asp 


Pro 


Thr 


Val 


Pro 


Asp 


Glu 


65 










70 










75 










80 


Lys 


Val 


Lys 


Gin 


Glu 
85 


Val 


Glu 


Glu 


Lys 


Ser 
90 


Pro 


Glu 


Ala 


He 


Ala 
95 


Glu 


He 


Phe 


Ser 


Met 
100 


Phe 


Gly 


He 


Ala 


Gin 
105 


Asp 


Ser 


Lys 


Phe 


Lys 
110 


Ser 


Leu 


Leu 


Pro 


He 
115 


Glu 


Arg 


He 


Lys 


Ser 
120 


He 


Thr 


Thr 


Lys 


He 
125 


Val 


He 


Asp 


Ala 


He 
130 


Asn 


Gin 


Pro 


Val 


Arg 
135 


Lys 


Met 


Leu 


Val 


Asp 
140 


His 


Leu 


Tyr 


His 


Phe 


Lys 


Glu 


Met 


Gin 


Asn 


Val 


Val 


Glu 


Lys 


Tyr 


Lys 


Asp 


Asp 


Ser 


Asp 


145 










150 










155 










160 


Glu 


Lys 


Leu 


Ser 


Val 

165 


He 


Leu 


Lys 


Ser 


Lys 
170 


Lys 


Ser 


Pro 


Lys 


Glu 
175 


Phe 
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Asp Leu Ser Phe Ser Asp Tyr Val 
180 

Gly Val lie Lys Arg Val Ala Gly 
195 200 
Gin Ser Asn Ser Met lie Met Asn 

210 215 
Asn lie Pro Tyr Asn Met Lys lie 
225 230 
Phe lie Cys Thr Phe Ala Asn Asp 
245 

Asp Glu Lys Phe Val Met Ser Gin 
260 



Asp Arg Leu Asn Arg lie Leu Val 
185 190 
Ala lie Glu Ser Lys Glu Leu Leu 
205 

Ser Val Leu Gly Thr Val Val Ser 
220 

Asn lie Cys Val Phe Leu Thr Asn 
235 240 
Asp Leu Tyr Thr Phe Phe Arg Asp 

250 255 
Val Thr Arg Tyr lie Ser Lys Asp 
265 270 



<210> 56 
<211> 1398 
<212> DNA 
<213> SHRIMP 



<400> 56 

atggctgtag 

atgattttaa 

gacgtttata 

gtggtttctt 

caatttgtta 

cctaagaacg 

gattataaaa 

aacaagagga 

tatgtcttta 

tacgttagaa 

agtaagctcg 

tacgtgatca 

tcttcttctg 

acggacccaa 

gcaaaagaag 

cttaaggctc 

gctgcaaaac 

acatctgctg 

gtgaagctat 

tatggtacat 

tgcaccgaca 

gatatggtgg 

tcttcttcgt 

ataaagaaaa 



gggattatct 
attttataaa 
gaagatgcaa 
cgactttaca 
gaggtttagt 
tgggggaata 
agggtaagaa 
ttatagggcc 
agagattgag 
aattttcccc 
tgtatcctca 
acaaagcata 
tgctcgtccc 
cacatagaat 
aaggttatgt 
ctctaattac 
gtgttttatt 
ctcatcaact 
cagatacggg 
ctttagtaaa 
aggctgatgg 
ggtatcttaa 
ggttgacaac 
ttgtgtaa 



ttctatgtcc 
ctttgtcaca 
gcggccgtct 
agaagctaat 
ccccagaaaa 
tattttacct 
ggttggtctt 
aagggatctg 
caaagatccg 
tttcagagca 
ccctatacgg 
cggaatggat 
agactctaat 
cggcatctac 
agaaatgttg 
agagtattct 
ccctgcccct 
agacggagct 
tgactatgca 
tcttccagaa 
aatatatcaa 
cgccaaccct 
tggtaacaaa 



tctgtagggg 
atattatctt 
accaatggat 
ctcgttacca 
atgatggaaa 
tcagaaaaag 
ttaactgccc 
attagtagag 
ctcgtctact 
aaaaaattca 
tatggtactg 
aatgaggatc 
aatgatagat 
ggctttggag 
ttatgtaatt 
acaaatccaa 
ggttccgagc 
actggcgagc 
gttggatcac 
acaggatctc 
gtcaatcaaa 
gtggaagctg 
atatcttctg 



aagcaacgct 
taataatata 
attctggttg 
ctgaaaaaga 
aatatagatc 
aaacagacaa 
tgagtaatgg 
atgatgtgaa 
actcttctgc 
tgacatcaac 
cttttgtact 
tacacacttg 
taacagtaga 
ggtctgatga 
gtgacaacca 
ctgaaattca 
ctgtaaaatc 
acgatatttc 
ccattgtatt 
ctctggcatt 
agggagggat 
catcactttc 
ttacatgtga 



cgtgggtttt 
tgccgtgaca 
tacaaccaat 
taaacctgtt 
ggacttgtct 
attgaaaagt 
tcatgacagt 
ggacaaaagt 
aacctctaaa 
acagttgggg 
acccacggga 
gaacccaccc 
atgtgctaaa 
taatagacgt 
caaggacttg 
agtagatgtt 
ttcccaagtg 
ccatgagccc 
caagccagtt 
gaactgcccc 
attatataga 
ttcctcggac 
aggagaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1398 



<210> 57 
<211> 463 
<212> PRT 
<213> SHRIMP 



<400> 57 

Met Ala Val Gly Asp Tyr Leu Ser Met Ser Ser Val Gly Glu Ala Thr 

15 10 15 

Leu Val Gly Phe Met lie Leu Asn Phe lie Asn Phe Val Thr lie Leu 

20 25 30 

Ser Leu He He Tyr Ala Val Thr Asp Val Tyr Arg Arg Cys Lys Arg 

35 40 45 

Pro Ser Thr Asn Gly Tyr Ser Gly Cys Thr Thr Asn Val Val Ser Ser 

50 55 60 

Thr Leu Gin Glu Ala Asn Leu Val Thr Thr Glu Lys Asp Lys Pro Val 
65 70 75 80 
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Gin 


Phe 


Val 


Aro 


Glv 


Leu 


Val 


Pro Arg 


Lys 


Met 


Met 


Glu 


Lvs 


Tvr 


Arcr 










85 










90 










95 




Ser 


Asp 


Leu 


Ser 


Pro 


Lys 


Asn 


Val 


Gly 


Glu 


Tyr 


He 


Leu 


Pro 


Ser 


Glu 








100 










105 










110 






Lys 


Glu 


Thr 


Asp 


Lys 


Leu 


Lvs 


Ser 


Asp 


Tyr 


Lys 


Lys 


Glv 


Lys 


Lys 


Val 






115 










120 










125 








Glv 


Leu 


Leu 


Thr 


Ala 


Leu 


Ser 


Asn 


Gly 


His 


Asp 


Ser 


Asn 


Lys 




He 




130 










135 










140 










He 


Glv 


Pro 


Arg 


Asp 


Leu 


He 


Ser Arg Asp Asp 


Val 


Lys 


Asp 


Lys 


Ser 


145 










150 










i 

loo 










160 


Tvr 


Val 


Phe 


Lys 


Arg 


Leu 


Ser 


Lys 


Asp 


riO 


Leu 


Val 


Tvr 


Tvr 


Ser 


Ser 










165 










1 *7 fi 
x f U 










175 




Ala 


Thr 


Ser 


Lys 


Tyr 


Val 


Arg 


Lys 


trne 


C a -w 

oer 


Pro 


Phe 


Arg 


Ala 


Lys 


Lys 








180 










103 










190 






Phe 


Met 


Thr 


Ser. 


Thr 


Gin 


Leu 


oiy 


Ser 


Lys 


Leu 


Val 


xyx 


Pro 


His 


Pro 






195 




















205 








He 


Arg 




Gly Thr 


Ala 


Phe 


vax 


Leu 


fro 


lnr 


Gly 




Val 


He 


Asn 




210 










215 










220 










Lys 


Ala 


Tvr 




Met 


Asp 


Asn 


(jJLU 


Asp 


lieu 


HIS 


Thr 


Trp 


Asn 


Pro 


Pro 


225 










230 




















240 


Ser 


Ser 


Ser 


Val 


Leu 


Val 


Pro 


Asp 


Ser 


Asn 


Asn. 


Asp 


Arg 


Leu 


Thr 


Val 










245 










OCA 










255 




Glu 


Cys 


Ala 


Lys 


Thr 


Asp 


Pro 


l nr 


ixe 


oxy 


lie 




Gly 


Phe 


Glv 
vjxy 


Glv 








260 










£ DO 










270 






Ser 


Asp 


Asp 


Asn 


Arg 


Arg 


Ala 


Lys 


CjlU 




uxy 




Val 


Glu 


Met 


Leu 






275 




















285 








Leu Cys 


Asn 


Cys 


Asp 


Asn 


His 


Lys 


Asp 


Leu 


Leu 


Lys 


Ala 


Pro 


Leu 


He 




290 










-7 — > 










300 










Thr 


Glu 


lyr 


Ser 


Thr 


Asn 




Thr 


Glu 


He 


Gin 


VOX 


Asp Val 


Ala 


Ala 


305 










310 










315 










320 


Lys 


Arg 


val 


Leu 


Phe 


prn 


mo 


Pro 


Gly 


Ser 


Glu 


Pro 


Val 


Lys 




OCX 










325 










330 










335 




Gin 


Val 


X HI 


Ser 


Ala 


ai a 

nick 


Hi cr 


Gin 


Leu 


Asp 


Gly 




Thr 


Gly 




His 








340 










345 










350 






Asp 


He 




His 


Glu 


r XU 


VaJL 


Lys 


Leu 


Ser 


Asp 


*phr 
x US. 


Gly Asp 


Tyr 


Ala 
nld 






355 










360 










365 








Val 


Gly 




Pro 


He 


Vdl 




Lys 


Pro 


Val 


Tyr 


tjxy 


Thr 


Ser 


Leu 






370 










375 










380 










Asn 


Leu 


Pro 


Glu 


Thr 


Gly 


Ser 


Pro 


Leu 


Ala 


Leu 


Asn 


Cys 


Pro 


Cys 


Thr 


385 










390 










395 










400 


Asp 


Lys 


Ala 


Asp 


Gly 


He 


Tyr 


Gin 


Val 


Asn 


Gin 


Lys 


Gly 


Gly 


He 


Leu 










405 










410 










415 




Tyr Arg 


Asp 


Met 


Val 


Gly 


Tyr 


Leu 


Asn 


Ala 


Asn 


Pro 


Val 


Glu 


Ala 


Ala 








420 










425 










430 






Ser 


Leu 


Ser 


Ser 


Ser 


Asp 


Ser 


Ser 


Ser 


Trp 


Leu 


Thr 


Thr 


Gly 


Asn 


Lys 






435 










440 










445 








He 


Ser 


Ser 


Val 


Thr 


Cys 


Glu 


Gly 


Glu 


Lys 


He 


Lys 


Lys 


He 


Val 






450 










455 










460 











<210> 58 
<211> 2361 
<212> DNA 
<213> SHRIMP 

<400> 58 

atggagtaca tggaagaagg agacatcgct 
ctggacgaaa actctgcttg tgtagttaat 
atggacgccg aggaggcaca gtacgcacag 
atccgtctgg cccactgctc cgaatccaac 
ggtcttttca tcaacaacat ctttgacaac 



gaaaggcgct cagaaggtgt cgactatatt 60 
gtgaagagta tccgtaacag gctcggtgcc 120 
gacatttccg cccaacttgt cacccatatt 180 
aagattaagg ataccattgc cagtattgcg 240 
aattcaacaa agaacaaact taaaacgtat 300 
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aatcaattca 
gatcctctga 
aatttggaca 
attgacgata 
aaggaagcca 
aaccataata 
atgagatact 
tgcctcgaca 
gctgattcag 
gtagacagag 
gtatctggag 
gcgattgaaa 
tttttgagct 
ctaaaagcct 
atttccggag 
gacatctctc 
aaggctagca 
tatatcccgt 
cacaacatca 
ttacacattc 
gatcaaactg 
aagcaggggt 
actacttcat 
tacgtcatct 
ccccattcta 
cccgacactc 
aatctttctg 
gacaaactca 
gctatcatta 
tcttcctcag 
ctccaaagta 
agtgcagtaa 
tctgaagaac 
agcctgatcg 
atcagagcta 



aggcagagtc 
gtatgctttc 
aatctttggg 
ttgtccttct 
tccgccagga 
ggttcaattt 
ctcaggacga 
gggatgatcc 
ataacgcttg 
ccgttattgc 
acagcttcaa 
ttttgttctg 
atgttaatgc 
accgttcatg 
gatggaagaa 
aagtcatcaa 
acgttacatc 
ttggaatact 
gccgtccagc 
taccttcaat 
tcaatgtttt 
cggggaagat 
tccctagtag 
tctcttcaac 
atgttctgaa 
gattccactt 
ctctaactac 
ccaccagagc 
agaacgtttc 
cgcctaactt 
ttcgcaaatt 
acaatgggaa 
tgttcgagct 
cctcatctga 
atgcttctta 



acaaaacaag 
tagcttcatg 
tgtgctcttt 
ggaaatgtgc 
acaacccatg 
tggaagcgac 
acgcgcagtc 
agctcatact 
ggtttcttcc 
tcacttttac 
gcagtttgtc 
tgatactgaa 
catggttaac 
ggtggtggat 
gaactacccc 
tcatcttgca 
tggcctggat 
tgaaaataaa 
aagagaacaa 
taagggctgt 
tgataatttt 
tcttggactg 
tgaagcggaa 
acccatcaac 
ctcccctatc 
tctgctcatg 
aagtcagcta 
gttcttcaat 
tggaagcggc 
tttccaaatc 
cattggagaa 
cagaggagta 
gctctacaaa 
attcttgagc 

g 



tctagcgttc 
ggttctgatc 
gaggtgcttc 
ccatccaagt 
gaagcaatgt 
ataaagtcag 
gttgtgcctc 
ctttcctctt 
ctgtttgcag 
gtgtacacaa 
tataccgtat 
aattcgtctg 
gtatccgtgc 
caagcatccg 
tcacctgacc 
tcaccttcta 
agtatcaggt 
gcagggtatg 
tccaacggaa 
gaggcacttg 
gtcgcatctc 
ctcactagca 
tacaagaaga 
gacgaactag 
agcttgagaa 
atgtggcagc 
gaattgttgc 
atcgacagga 
ttcctagatg 
ttcagtggtg 
tctatgcagc 
gaaaattatg 
ttgatcatcg 
aactacgtca 



tcaatatctt 
cagcaaagag 
aaaattacaa 
gcgccgcctg 
tgttgttttt 
catacgcctc 
tgaggagtat 
tcggggatac 
ccgtctctag 
tgctcagccg 
ttgttcgtat 
tagaatgtga 
tgggttctac 
tcgcaccggt 
acatcaagag 
gaatggttaa 
ctgttcgtca 
gtgtaataaa 
ggaactttaa 
gggcacaaaa 
atatggatat 
tgattgacag 
gaatccatga 
tcaattctcg 
atattgaccc 
ggccaaatat 
ttagcaagaa 
tcaatttcca 
ggagtaaaac 
ctgaatgcac 
atgtacaaaa 
acggactcaa 
aggaggatat 
acgccatgga 



tggctctcta 
tggaggggaa 
cccttgcaag 
caccggtctc 
caaatgtatc 
tgaaacatgc 
cctcctcggc 
tatcgagtat 
aatgcctatg 
acataggcga 
gatttactct 
tggaaagcac 
gtttaacgta 
tctagacatt 
ggtggcgtac 
aggtaacaac 
agcagaaaaa 
cattgccaag 
ttgcaacgct 
ggggagcgca 
tgccatgaaa 
gcaaggtctg 
tttcacaaga 
ctgtattctt 
agaatcagtc 
cgatgaacct 
ccaaaaatgg 
gatggcagac 
tgcctcttct 
tgcaaagcag 
ggaatggagt 
tgctcagttc 
gcggccatcc 
tgaacttctt 



360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2361 



<210> 59 
<211> 778 
<212> PRT 
<213> SHRIMP 



<400> 59 



Met 


Glu 


Tyr 


Met 


Glu 


Glu 


Gly 


Asp 


He 


Ala 


Glu 


Arg 


Arg 


Ser 


Glu 


Gly 


1 








5 










10 










15 




Val 


Asp 


Xyr 


He 


Leu 


Asp 


Glu 


Asn 


Ser 


Ala 


Cys 


Val 


Val 


Asn 


Val 


Lys 








20 










25 










30 






Ser 


He 


Arg 


Asn 


Arg 


Leu 


Gly 


Ala 


Met 


Asp 


Ala 


Glu 


Glu 


Ala 


Gin 


Tyr 






35 










40 










45 








Ala 


Gin 


Asp 


He 


Ser 


Ala 


Gin 


Leu 


Val 


Thr 


His 


He 


He 


Arg 


Leu 


Ala 




50 










55 










60 










His 


Cys 


Ser 


Glu 


Ser 


Asn 


Lys 


He 


Lys 


Asp 


Thr 


He 


Ala 


Ser 


He 


Ala 


65 










70 










75 










80 


Gly 


Leu 


Phe 


He 


Asn 


Asn 


He 


Phe 


Asp 


Asn 


Asn 


Ser 


Thr 


Lys 


Asn 


Lys 










85 










90 










95 




Leu 


Lys 


Thr 


Tyr 


Asn 


Gin 


Phe 


Lys 


Ala 


Glu 


Ser 


Gin 


Asn 


Lys 


Ser 


Ser 








100 










105 










110 






Val 


Leu 


Asn 


He 


Phe 


Gly 


Ser 


Leu 


Asp 


Pro 


Leu 


Ser 


Met 


Leu 


Ser 


Ser 






115 










120 










125 








Phe 


Met 


Gly 


Ser 


Asp 


Pro 


Ala 


Lys 


Ser 


Gly 


Gly 


Glu 


Asn 


Leu 


Asp 


Lys 




130 










135 










140 










Ser 


Leu 


Gly 


Val 


Leu 


Phe 


Glu 


Val 


Leu 


Gin 


Asn 


Tyr 


Asn 


Pro 


Cys 


Lys 
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145 150 155 160 



lie 


Asp 


Asp 


He 


Val 


Leu 




Glu 


Met 


Cys 


Pro 




Lys 


Cys 


Ala 


Ala 










165 










1 / V 










175 




Cys 


Thr 


Gly 


Leu 


Lys 


V3l Li 


Ala 
nia 


Tip 
1 lc 


Arg 


V31 11 


OX Li 


r» 1 n 

V?1U 


riu 


Met 


Glu 


Ala 








180 










1 O «J 










1 7\S 






Met 


Leu 


Leu 


Phe 


Phe 


Lys 


Cys 


He 


Asn 


His 


Asn 


Arg 


Phe 


Asn 


Phe 


Glv 


























one 

^ U J 








Cor 


Asp 


Tip 

■Lit; 


Lys 


Ser 


nlo 


Tyr 


Ala 


OCX 


o i u 


i nr 


Cys 


Met- 
ric L. 


Arg 


Tyr 






91 ft 










£lJ 










*?:?n 












Asp 


ft! n 

UlU 


Arg Ala 


Val 


V al 


V oil 


Pro 


Leu 


Arg 


oer 


lie 


Leu 


Leu 


uiy 


<l ^ o 






























Z. fi u 




L6U 


Asp 


Arg Asp 


Asp 


Pro 


A 1 si 
aid 


Hie 


i nr 


Leu 


OCi 


del 


ir jie 


uiy 


Asp 










245 










^ jU 














Jl XIX. 


Tip 

lie 


olU 


Tyr Ala 


Asp 




Asp 


Asn 


Ala 


Trp 


W a 1 
Vdl 


oer 


ocl 


Leu 


rile 








260 










ZOO 










£. / U 






Ala 
nld 


Ala 


v ct J. 


Ser 


Arg 


Mo-t- 


irro 


nec 


v ai 


Asp 


Arg 


Ala 

Aia 


Val 


i ie 


Aid 


HIS 






^ / «j 




























Phe 


iyi 


vox 


Tyr 


Thr 


nc I— 


Leu 


Ser 


Arg 


n i o 


Arg 


Arg 


Va 1 
vdl 


iJCl 


uiy 


Asp 
























mn 










Ser 


Phe 


Lys 


Gin 


Pne 


Va 1 
vol 


Tyr 


X lit. 


Va 1 
vox 




Va 1 
vai 


Arg 


fit; u 


l it 


Tyr 


OCX 












OIL* 










jl j 










•ion 


Ala 


Tip 
J. ic 


it 

UiU 


He 


Leu 


IT He 


Cys 


Asp 


i nr 


bill 


Asn 


JCl 


Car 

oer 


vai 


^i ii 

ulU 


Cys 










325 










jjU 
















oiy 


Lys 


His 


Phe 


Leu 


O t5 1 


iyr 


Va 1 


Asn 


Aid 


nc l. 


Va 1 
val 


Asn 


Va 1 
v dl 


O trx 








340 




















JjU 






vai 


Leu 




Ser 


Thr 


rile 


Asn 


vai 


Leu 


lays 


Ala 


Tyr 


Arg 


Ser 


Trp 


vai 
















JDU 










"3 fit: 
ODD 








Vai 


Asp 


Gin 


Ala 


Ser 


Vdl 


Ala 


Pro 


Vdl 


Leu 


Asp 


i ie 


T 1 a 

i ie 


Ser 


Giy 


Giy 




O /V 






























Trp 


Lys 


Lys 


Asn 


Tyr 


rxo 


Otrl 


Pro 


Asp 


HIS 


i ie 


i»ys 


Arg 


vai 


aia 
Aia 


Tyr 


ODD 










jyu 




















400 


Asp 


lie 


Ser 


Gin 


Val 


lie 


Asn 


His 


Leu 


Asp 


Ser 


Arg 


Met 


Val 


Lys 


Gly 










405 










410 














Asn 


Asn 


Lys 


Asn 


Val 


i nr 


O A V 


Gly 


Leu 


Asp 


Ser 


T 1 A 

lie 


Arg 


Ser 


vai 


Axg 








420 










4 
















Gin 


Ala 


Glu 


Lys 


Tyr 


lie 


Pro 


Phe 


Gly 


He 


Asn 


Lys 


TV 1 

Ala 


Gly 


Tyx 


Gly 






435 










4 4 U 










/IXC 

4 43 










Tl 0 

lie 


Asn 


He 


Ala 


Lys 


fllS 


Asn 


T 1 a 

i le 


oer 


Arg 


Pro 


Aia 


Axg 


ulU 


r*l -r* 

Gin 














A 

4oo 










4 bU 










Ser 


Asn 


Gly Arg 


Asn 


rne 


Asn 


Cys 


Asn 


Ala 


Leu 


HIS 


lie 


Leu 


Pro 


oer 












Aid 










A 1 C 










4o0 


Tic* 


Lys 


Gly 


Cys 


Glu 


Ala 
Ala 


Leu 


biy 


Aid 


bin 


Lys 


Gly 


t>er 


Aia 


Asp 


Gin 










485 










A on 

4 y u 










4 yo 




inr 


Vdl 


Asn 


Val 


Phe 


Asp 


Asn 


rne 


va l 


Aid 


O a 


HIS 


Met 


Asp 


T 1 A 

ne 


Aia 








500 










jUj 










OlU 






Met 


i»ys 


Lys 


Gin 


Gly 


Ser 


Gly 


Lys 


i le 


Leu 


Gly 


Leu 


Leu 


Thr 


Ser 


Met 






515 










con 


















He 


Asp 


Arg 


Gin 


Gly 


Leu 


Thr 


Thr 


Ser 


Phe 


Pro 


Ser 


Ser 


Glu 


Ala 


Glu 




530 










535 










540 










Tyr 


Lys 


Lys 


Arg 


He 


His 


Asp 


Phe 


Thr 


Arg 


Tyr 


Val 


He 


Phe 


Ser 


Ser 


545 










550 










555 










560 


Thr 


Pro 


He 


Asn 


Asp 


Glu 


Leu 


Val 


Asn 


Ser 


Arg 


Cys 


He 


Leu 


Pro 


His 










565 










570 










575 




Ser 


Asn 


Val 


Leu 


Asn 


Ser 


Pro 


He 


Ser 


Leu 


Arg 


Asn 


He 


Asp 


Pro 


Glu 








580 










585 










590 






Ser 


Val 


Pro 


Asp 


Thr 


Arg 


Phe 


His 


Phe 


Leu 


Leu 


Met 


Met 


Trp 


Gin 


Arg 






595 










600 










605 








Pro 


Asn 


He 


Asp 


Glu 


Pro 


Asn 


Leu 


Ser 


Ala 


Leu 


Thr 


Thr 


Ser 


Gin 


Leu 




610 










615 










620 










Glu 


Leu 


Leu 


Leu 


Ser 


Lys 


Asn 


Gin 


Lys 


Trp 


Asp 


Lys 


Leu 


Thr 


Thr 


Arg 



625 630 635 640 
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Ala Phe Phe Asn 

lie Lys Asn Val 
660 

Ser Ser Ser Ser 
675 

Glu Cys Thr Ala 
690 

Ser Met Gin His 
705 

Asn Arg Gly Val 

Phe Glu Leu Leu 
740 

Ser Leu lie Ala 
755 

Asp Glu Leu Leu 
770 



He Asp Arg He 
645 

Ser Gly Ser Gly 

Ser Ala Pro Asn 
680 

Lys Gin Leu Gin 
695 

Val Gin Lys Glu 
710 

Glu Asn Tyr Asp 
725 

Tyr Lys Leu He 

Ser Ser Glu Phe 
760 

He Arg Ala Asn 
775 



Asn Phe Gin Met 
650 

Phe Leu Asp Gly 
665 

Phe Phe Gin lie 

Ser He Arg Lys 
700 

Trp Ser Ser Ala 
715 

Gin Ala Gin Phe 
730 

He Glu Glu Asp 
745 

Leu Ser Asn Tyr 
Ala Ser 



Ala Asp Ala lie 
655 

Ser Lys Thr Ala 
670 

Phe Ser Gly Ala 
685 

Phe He Gly Glu 

Val Asn Asn Gly 
720 

Ser Glu Glu Leu 
735 

Met Arg Pro Ser 
750 

Val Asn Ala Met 
765 



<210> 60 
<211> 18234 
<212> DNA 
<213> SHRIMP 



<400> 60 

atggaccagt 

caacaacaac 

tacagtaaca 

aggtcaatcg 

cgtgtccaat 

ggtagttctg 

cccaactaca 

tcaactagtc 

agtaatagca 

tgtttttcca 

aacgccgcca 

gcagcatgga 

aagtacagtc 

cagtcacaat 

gatatcctaa 

gatactgctc 

gcagccgcag 

aatagcaggc 

tcaaaatatt 

gctaaga tag 

agaagagcag 

acttccatta 

gagtatgaat 

ggaagtt tat 

gctatcgaaa 

ccttcctctc 

cacgtttcag 

ttgagtatca 

agggataggc 

tccacccctt 

tctggttgcc 

attagtggtc 

tcggctgcag 

tcacccatga 

atcaagaagg 



acccagaagt 
aagcagcaac 
ctgtttctgc 
cagcttcgtg 
tccatccact 
ttgtgttttt 
ttgcagtgcc 
tataccaatg 
aacatagtca 
aacaaaaaaa 
aggcattttc 
acgcgggaag 
ccgatagaaa 
caggtacttc 
aacgtcacaa 
tttctagggt 
attcagatca 
gtgagggtat 
ctgtcgcaga 
ggacaaagat 
atacttcagt 
aaaataccgg 
tttctctcca 
tgagctgtgc 
ggattatacg 
tttcagatca 
atcctaagga 
ttactagatc 
ttcttgttaa 
ctcaaagaag 
tacctatcat 
ttgtgggtga 
ctattgccgc 
tgaaaaaatt 
ctatgcgtag 



gagggatact 
aacaacagct 
agaaacttta 
ctggatcaat 
tagttccacg 
gaagcctaga 
tactcttcgt 
ttcaatgttc 
attcggtgat 
tgtagaaaat 
tcatgcagtt 
tgctgcaaac 
atacaaagca 
ttcttcctca 
gggaacgtcc 
ttttaccgag 
cctatccgcg 
tctagattca 
gtacttgatg 
tgcaacagat 
cgatgatctc 
cgatattaat 
catcacacag 
tttcggtgta 
taaaacagat 
atatctcttg 
tatcgttttc 
taaaaatgat 
cgataccgtg 
agtccccacc 
caggggaccc 
ttggtatatc 
tggccaccag 
ctcaaagaag 
gaatgcagat 



cctcagacag 
gccgctgctg 
tccgccattt 
aaccttaacc 
accacatacg 
gccctcccca 
gctgcgtcag 
aattcgtgga 
cgtgtcatca 
cttttaaagg 
caacagaaat 
ctcgaaaaat 
ggaggtcttt 
gttgaacata 
cttgatttag 
tttaaggaac 
tcggacccta 
gtgccaaata 
gcagacagag 
tttgaagcat 
aaagaatcac 
tcggtcacaa 
cttttcgcgc 
cagttcccct 
cctgatactg 
ttggtaggaa 
ggtagacaag 
aagaatgaaa 
ttgagagacg 
gccgcaggtg 
caggtagtga 
tctttggggg 
agagctctcg 
ggaggaaagt 
cgttctgcta 



aacaggaaca 
ctgccgctgc 
ctgaagatgg 
ctgatgaaaa 
attcagaaaa 
ccgggggcac 
aaattatcga 
atcttattcc 
agagatcaat 
aattgcgcag 
cagcagtaaa 
tagtagattt 
ttagtgcgtc 
cttctaatga 
attcagcaac 
aggctagagc 
ttttctctat 
tcggtatgtt 
acgagtctgc 
tgagaggaga 
tagcggattc 
atattcccac 
aggcattttt 
tctcagatga 
gaaaggtttc 
atttccaggt 
ttacgcctaa 
cgtctactat 
ccacgcaaaa 
aaccaaaaaa 
ccagggaaag 
tctattatgc 
cttcagccga 
atacagagga 
ggattt tggc 



ggcggcggcg 
tcctacgcag 
aaaattggag 
aatggctcaa 
tgtgaaccct 
gtgtttagcc 
ttctattgca 
catttttatg 
gatccgtaat 
gagaaaggtt 
tactgccctt 
ttgcaaattg 
agcaacagct 
tttcttactc 
aaacacattt 
agccgtagat 
tgttcgacac 
ggctccacga 
agatatcgct 
taataataag 
tattgaaaaa 
agatacagag 
ggaaactatg 
aggttttgcg 
agaaatggac 
ttctcccttc 
tacgcctatt 
tattaatttc 
cgtctcaact 
gccaatgctc 
tgatgatatg 
catgggttca 
gtcaattaat 
ggaaaagcga 
tttgttgggg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 
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caaactgatg 
tcgaatgccg 
gcagttagga 
gagagatatc 
atgtttgatc 
gccaggaacg 
cgcgtattca 
ccagaacagg 
gccaataaga 
ggaatctctg 
tctcgcgtca 
attccagtta 
tgtgtggaca 
caactttgcg 
gcaggtaaag 
cttttgcgtt 
atgcccctca 
atcagaggca 
tccgataggt 
aagactcaag 
accaacaagg 
ccaattgcta 
atcactgaga 
attgaagagt 
aattttgtgt 
gcattgtcca 
cctttgatcg 
aaaacaacaa 
gccaatacta 
actgccatca 
cacttggggg 
aacgttgcga 
gggttaaccg 
ctcgtgcaag 
tctaggtccc 
ttcaaaattg 
agacagctat 
gctaggattg 
cccggaactt 
gtatggtcta 
atttctcacg 
cttaacagcg 
aaattctttg 
actcttttgg 
agtaacgata 
cttcaagttg 
actgacgtat 
tcatctttgt 
cccgctcttc 
gctcctcttt 
tacgaagttg 
agaaagatta 
cgcattgcta 
acagtaaacg 
tcaagattgg 
tatgatggtg 
agattctggt 
gctggtcatg 
gtcaacacct 
gcgactcagg 
accgatgtgg 



cccagtatgg 
ctattcgtgc 
aacaattgga 
ttagagactc 
aaggttttga 
cttggcaaca 
acattgaagg 
ggcgtaaaaa 
tcattaacgg 
atattattgg 
aagctcttaa 
gtcgagcagt 
t tccttcagt 
gaggagggat 
aacatacagg 
atatttggtt 
acacaatgtc 
tggttaataa 
acaattgcgg 
gtgaattgtt 
gaggcgtggg 
atgcggatgt 
ccatcaactt 
atcttaggag 
atccaaatat 
gtaatctcga 
taaaggctag 
gaacagcagc 
ttcctcaatt 
atgtcgttaa 
ttatgggtaa 
acaagcgttt 
tatctctcat 
atttcgctaa 
agaagagttt 
atagggaaat 
ctaagaatgc 
ccaagtcagg 
cttcctccgc 
tgggagctct 
atcaaatgtt 
a ttgcaaacc 
gcctttcttc 
cttcacctgc 
t tgtgcataa 
gtgcctcagc 
attcccttgt 
cgtctgaggg 
tcggtaccgc 
cagccgtgtt 
cagctgtttg 
tgggaatggt 
gtcttgttag 
attacaaaaa 
gaggaatatc 
ttgtttcttc 
cgggagtatt 
gaggtagtaa 
acgcagctct 
aaagaaagaa 
tggggggaga 



gtatgtcgaa 
aaaggcaaag 
cggaaaatgt 
atttttcagg 
tatgggaaga 
gtacgcagaa 
tcttttcagt 
gactgtaggt 
catcacagag 
ttcagttagc 
aacattgtcc 
aaagtgtgcc 
cattattgca 
gaacctcgct 
aaaggaatcc 
caacgcagcg 
cgagggcact 
ttacaacagt 
tagtggaaat 
aactgtcctt 
tgcaaccccc 
agttaagaac 
cttttcattt 
atataggcta 
tgcagctatt 
tactgatcgt 
caagggttat 
agaacaagct 
agtaaacagg 
agctttcaca 
cgcgattaat 
aacagttaac 
tcttgctcac 
gcttttactg 
cttccccatt 
gttcgataat 
taccgccgcc 
tgaaatttat 
tttaaccttg 
cccccatttc 
ccgcctatct 
agaagaatgg 
ggtgagcgat 
agagatttgc 
tattggatcg 
agtagaaaaa 
ttctgctctg 
acatctcacc 
taaaactaca 
ggactttaga 
tagtgtcatg 
ggaacaagaa 
tacagttgct 
ttatctcatt 
ccctactagt 
ctcatcatca 
ttctcagtgc 
catgttccaa 
gagctctggc 
taggattgca 
tactttggat 



cataattcta 
gaggatgctc 
tcctcttcaa 
tcagttaata 
tttgccgact 
gtaatgagag 
gctctaaatt 
ggaaggcata 
atgactctcc 
gatgggtggg 
aatttcagcg 
gcgggaagtc 
aacctaatct 
cacgaaatca 
gttttcttgt 
gttgtttccc 
ggcgatgaca 
agtctctcct 
acttcaacca 
caacagactg 
gatgctgcca 
accaacgtgg 
ttgtctcaga 
ggagaaggac 
gttaagcgag 
ccaattacta 
gcctctaacc 
cagatggagc 
ttgaccatcc 
gaaaatggag 
gaaatgcaac 
gtgggttcag 
tcaaaggcta 
gcagtcactg 
cctccttcag 
atgaagacag 
atcgaaaggt 
aacaaggatg 
ttcgccaata 
gatatggccg 
acatattatc 
gataattctc 
aacaaccgtt 
gatctggtga 
aattccaaca 
tacgacgagt 
gctaagagca 
tctaaggaga 
tcctattctg 
aggaatgttg 
agtaaagagg 
tcgcccgtta 
acccccaaac 
agaaaagttg 
ggtaacaccg 
atgaccccct 
ctagagactg 
atcactgcac 
gttgagcggc 
aagagcatcg 
caattgcgta 



ccctagattc 
ttagccgtgc 
gggatgaata 
ggtcaggagg 
ttttgagtga 
gactttctaa 
ctttcaagtt 
ggcttaacaa 
agtcagccat 
gaaatactac 
gaaacggaaa 
gaggtgggga 
ctgataagag 
caaactttat 
ctcctagatt 
ttacggatag 
tttataggga 
ctatttcagt 
gtaataagaa 
ctaatgcctt 
atatggccaa 
ttgtttcagg 
tcaaaacaat 
tagataagaa 
aattgggggt 
tcgacctgaa 
gctacgctaa 
agtataatgc 
ctggatccat 
aatttagtaa 
ctcttttcac 
ttagtaagct 
gcccctatgt 
cagagacttc 
tattttcttc 
attatgtagt 
gcaatgattc 
ttgcatcaac 
atctccagaa 
tggtaccaaa 
agggtattca 
ttcctggaaa 
cattcaattt 
cgagggaaat 
cggacgcgct 
ctactctttc 
aatctcctct 
ttgataggac 
tttctgaaga 
tggatgctac 
aggatgtgcg 
tgcaagacat 
agcatcgcag 
catcgaatcc 
attacaacct 
cgtccatgtc 
gcccttcaat 
ctaaacttta 
tgagagactc 
aagctctgga 
aggcccagaa 



gttctggtct 
agaaatcttg 
ttccatggta 
aggatatgaa 
caactctgct 
gcatgagaaa 
cccccttgtt 
tttgaaggcg 
cgatggtact 
agctcagccg 
tgttgtgtca 
aactctcaag 
aattctagat 
cgagacgatt 
gtctgtcatt 
caacattaaa 
ctatttggcc 
caaggctatt 
tgtgaccatt 
gtccgccttt 
cgttatttcc 
gctagatagg 
gaacgagaac 
agaattggat 
aagtggttcc 
cactgaacag 
attattcaac 
acaaatggct 
cacggcagac 
cgcagaaaca 
ggacggattc 
gattcagaat 
ctttaagcct 
tctggttgtc 
aggtggtctt 
ggaagtaatt 
cgattcagct 
cactgcagct 
ccctgcaaag 
acttcatggc 
taagatggaa 
tagggctagc 
ggcattggat 
ggtaaagacc 
tcaaaagagc 
tactaaagaa 
atcttcttcc 
atggaacacc 
cgctctcaat 
taaatctctg 
ttcttcgagt 
tggcattgac 
attcttacag 
ccttctctct 
taaagctgta 
tgtctctgac 
gtttgccgat 
cggttctaga 
tatttcttct 
aacgttcgta 
catgtacaac 
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aaactgtcag 
aaaatcatga 
cttagctctc 
actcaatttg 
aatatccacg 
tataatgtgt 
ctcggcggaa 
ttctcccagg 
ttggttaaac 
tcttcaatta 
tcactttatt 
actattaaca 
aggacttcct 
gaggatgtac 
agggcagctg 
atggaatctc 
gaaatgatgg 
gatgatattg 
tcagatgaat 
tctccttatt 
aagcctctta 
ctctatgaag 
agagatcttc 
gaacgttcaa 
ggtttcctat 
cgaggcaaga 
ctcctcttct 
ttcagcaagg 
agagttgtac 
atcaccccag 
aagagggaaa 
ttgcccgacg 
aggtctctta 
atctttgacg 
ttgggtgata 
ctaatgaaag 
aaagacaccc 
gccggctatc 
gttgatcgtc 
attgtttttg 
atgacctccg 
tctcagcgtc 
cacactgaag 
tctgattaca 
aattgtagcc 
acatcacgta 
tccattgttg 
gaggtaggaa 
gtcgattttt 
tttgttccca 
gtcaaggatg 
ggaaaattca 
aacctaactg 
acaccaacag 
ctcaaaacac 
gcaattgtag 
tccaagtcaa 
tccacatctg 
aatctgagaa 
catattagta 
ttaacccccg 



atattacttc 
agaatgtgac 
tcttgcacga 
ctctggcgag 
agaaggaaac 
cggcagctgc 
ccatgctcca 
tttctgataa 
tggtaggatt 
gacgtgtaca 
cagcatacag 
aacttttcgg 
tggtacatga 
ttgaggctcc 
cttctaggcg 
tttacgggga 
atatagagta 
cctttattga 
attccgattc 
caactacatc 
ctgccatcta 
aagaacaaga 
atgagagtga 
tgtcagaaac 
ctcattctaa 
aattcaggcc 
ctgacgaatc 
ggagaaaatg 
gtgcttttgt 
tgactagtga 
gggcgcgtct 
aatacgtaga 
tcaaggcagc 
ctctcaccac 
cacttttgtt 
acctaactga 
aaatgatggc 
taggtgtccc 
ttttggctac 
cacgttcatt 
ctttgacaga 
taaaagtaaa 
cagtcatggg 
cacagcgcag 
aattgccacg 
aatttgcagg 
atgctctttc 
agtggcagga 
gttcaaagaa 
atggaactaa 
acgcttcaat 
ttacagcctc 
aaaaactata 
aaatggccac 
cattcggggg 
ttagagccaa 
ggggaatcaa 
aatttgaaag 
ccattgagaa 
atgatggact 
tgtatgcgga 



caactctatc 
gagcaagaaa 
actcgctggc 
ccatgttatc 
attcagtcaa 
catgtgtcag 
acaaggcctg 
tattaaaatg 
ttgcggtaca 
gaacgaagag 
agatttgagg 
acatcaaaac 
cgctgtttcc 
ttccacggtt 
agtgaagagg 
agaagtcttg 
tggtgaggga 
ttccgaagaa 
atccgatgag 
ttattcgtat 
cgggtgcagg 
aaggaggaga 
tgatgacgac 
tattgcaacc 
tattcttaac 
ttctgcgtac 
taccacttct 
caagcgccga 
gcccaccaga 
aaatacagta 
gatcgaagaa 
gggtagagca 
tgcttatgtt 
aacatcaaac 
cgccaaacaa 
aatctctcct 
cgatgccaaa 
tctcagaact 
caaaaataag 
caacga tact 
catgtacagc 
gagtactctt 
tctcgtatac 
agagatgcaa 
aaaggatatt 
ttatggagga 
ttgcccctct 
tatgggagag 
caatatttct 
catggctgat 
gatcaggctt 
agccatgggt 
cgactcgttc 
tgctattatc 
agatattgcc 
ggaaatgcgt 
ctcgttctca 
gatactagaa 
tagactcgcc 
ttccgagaca 
tgacaccagc 



tatagtgatt 
atgaccgcta 
ctggtacaca 
aaggcaaagt 
ttgatggccg 
cgtctagttg 
ttcgtttcat 
aacgaattga 
gtttcagatg 
gataagaaat 
aagaagactg 
tttatgtctt 
ggccctaggc 
cacaaatcgt 
gctggactca 
aacgatatga 
ggattcatga 
gagtctgaat 
tatgattttg 
gatgctctag 
ggagaaggtg 
cgtcgctcat 
gatgactact 
agaagagctg 
cgtccggcta 
gatagattct 
tcttcctctt 
acaagcgagg 
gtaacaatga 
ggattctatg 
tacaaaattg 
tctaaacaag 
gcccgcaccc 
gccactctag 
ctagaggcaa 
tcacttttca 
cagatcgttt 
cttgcttcat 
aaccatctcg 
actttccatg 
gctttcacca 
ttagattcta 
cctacagcgt 
tcactcgctc 
ggagacactg 
gaaaggggag 
gacaatcggc 
gaaatcttct 
ctggaaaatg 
attggcatga 
aggcgcgcag 
aatttgtacg 
gttctgctcc 
aaccgtatga 
acctataaga 
aactctatta 
tcccgcagtg 
acatctgctg 
gaacactaca 
cgcgcagtcg 
gagagaggag 



tcggaaacat 
gacaacaatc 
aacaacaacc 
atgtcactaa 
tggccggtgt 
cttccgacgt 
tccttcttaa 
acgatgaaac 
cgctaggatc 
tagacaggag 
aactatacag 
acgaatcttc 
caagaaggta 
tcatggtttc 
gggctctggc 
ggtcttcggc 
tgatgattag 
catctactga 
atgatgataa 
accgtctgaa 
aagacgatga 
caaagatggg 
ttgatgacga 
gccgtattca 
aagcacgcgc 
ttatggagga 
ccgattctcc 
accaatgtgc 
tcaatggtcg 
aaaattacca 
ttaagggtgc 
tgtctcccag 
aagaaagtaa 
ttaatgaccc 
ttaccgagag 
catcattcgg 
caggaggaaa 
gtattaaggg 
aatggatgac 
cactcgaaga 
acctcgtcgg 
ttttcaacac 
tcatcaacca 
ttaacattct 
ctggcctgtt 
gtttgtctct 
tcaagggagc 
acaagaggag 
ccgtaggtcc 
ccgatatcat 
aagagggcgc 
gaggtattga 
aagattcaga 
agtcgaggaa 
acttcccatc 
gcactatcgt 
gttctacttt 
ttctttcaaa 
acaaactaaa 
ttgccgtaat 
catctgttag 



tgactgcgct 
agatactatt 
tcaattggct 
tgacctcaat 
tgccgattac 
aacaatgttc 
caacgtactt 
aaagtctctt 
taggcacgtg 
ttttgttaca 
ggaaactgat 
catgctcaag 
cagcaccctt 
ttacccagag 
tgataacagg 
ggtctcttcc 
tgatgatgag 
tttctcctca 
taatggccag 
ttctgccgct 
ggaaaatgac 
gaagatcctt 
atttgatggc 
atatggtcca 
tttcttgaca 
tgacgattcc 
attctcctcc 
ctttgttaag 
agttagcatg 
gaaggccaac 
ttcagctacc 
ggaactgagg 
cttgaatatt 
atctactctt 
gaggaatagg 
tgatgcaagt 
tttcaagtct 
cactaataca 
cacagccgct 
tacactaaaa 
atcggaacat 
taggatggct 
tgaaatgccc 
taggggagtt 
gacctttatt 
atacagaatg 
agtctctcta 
caacgatctg 
tattgctagg 
ttctagaaca 
tggcgcagca 
taccgttgtg 
ctcgttcaat 
acataaggct 
ctcctctgaa 
gatggacatt 
ggccaagatt 
tacaaaggcc 
acaattcagc 
tgctgaatct 
tgaactattg 
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acagacaata 
aaacgtcacg 
ctgctcgttg 
aaattgactg 
ataaagcaca 
atcaaggaca 
agaaatgcaa 
gacatttttg 
ggcgaagcac 
tctaacacaa 
ttctcaatgg 
caaaacaaac 
tcttcttcat 
caaacacaag 
gcgtctcata 
gttgttgcta 
ggaattctta 
ccatcatcat 
aaccagtacg 
aaggaagcag 
gaaatggtca 
caagctaaat 
gaggtttaca 
atggtttctt 
tcatcctctt 
attgatgaaa 
gctgccgttc 
gctattgaca 
tcagacacta 
gacaaagaac 
gacaagataa 
aacaacaaaa 
gatgtagcaa 
ctcttccaga 
gacatgagca 
cagattctcg 
ttttcaaagg 
tcttcagttg 
ataatgatgg 
aatactgtta 
aggctcacag 
aacgacgaga 
ttgatcaagg 
cagcagcttc 
agggttgttt 
cgtatttctt 
cccactagcg 
gactcttctt 
cctggactct 
tctcctcatt 
aacgctgcca 
gagtccaaca 
atcacccctg 
gttatccttc 
tccatggtta 
atcgtggaga 
gaccagaggc 
tcttcatctc 
ggcttcccag 
actatggaag 
caaacttata 



ctctcctcaa 
tgaccgccgc 
cctcagccga 
aaggaaatag 
aagaaacagg 
atgatcagcg 
ccatccgttc 
cacttgatga 
tcgcagagaa 
catcctcctc 
actattcaaa 
aatctaatgt 
ctgccaatat 
aatctaacga 
atcaagctgc 
taccaaatgc 
cccgcttctc 
catcatcatc 
ccaactcaat 
aaaggaaaat 
ccgtgtctaa 
cccttaagaa 
ccccagattc 
cttcctcggg 
cctctaatgt 
atactgaaga 
gcatccctct 
agttgaacga 
aacaggcttc 
aggtgatttc 
agatggctat 
ctaggagctt 
acttcaagga 
gcctgaaagc 
atggaagaga 
agggaggaat 
ctttggaggc 
tggagaggag 
agcgcaccga 
ataatgtaga 
agattaggaa 
ttgtagagtc 
tagatagagt 
ccaaattgac 
acgatccttc 
ctgtctataa 
atgccgactt 
ccacttcctc 
gccagcaatg 
cattcaattt 
cttcatccag 
atggttacgt 
ttaatacact 
ctacttctga 
acaggacaac 
ctcttgctcg 
gacacttcaa 
aattgaacac 
tgtttacagg 
gagctcgcaa 
tcgatatggg 



atttattgtt 
aattgaaggt 
catcaaccgt 
taactttgta 
tatctggctg 
tagagtagct 
tcgtctacag 
tgccgatatt 
ggcatctcct 
ttcttctccc 
caaccttgcc 
caattcttct 
tgataatgtt 
tttgagtaac 
aactgttggc 
aaacaaggct 
agcccctgaa 
aaatggagat 
cttatcatct 
cgatctggcc 
gtgcccctct 
aagtgccctc 
atctattgca 
atcaggatct 
gacagactat 
aggggcagaa 
tctagtatca 
atactaccaa 
ctcatgggcc 
aaagattaat 
tagtgttttg 
tattgaaact 
gattatcgat 
attcgctgct 
cacaaaatca 
caaactcatc 
tttccctaga 
acagcgagaa 
gagtctgagg 
agaaactgtt 
tcaagcacaa 
tcctctcacc 
aggaagtatc 
agctacagaa 
atacacctgc 
ttcaaagaac 
gcaactgatg 
caagaaaatg 
cgcaatgatg 
cgagaacaaa 
tgacggtcct 
caaagat ttt 
tctgggaggt 
gttgtttaac 
cgacaagaat 
cacgtctccc 
ctccatcacc 
tacttgtgat 
aagaaagcag 
ggacaagaac 
cacaaaattc 



caaaatgaac 
tcatcccaac 
atgtccgccc 
ccaatgacta 
aagaccgatg 
aaaaccatcc 
tctctttgct 
aagaatatgg 
tctagctcgg 
agttcttccc 
aaaactatcc 
gacgcatcat 
gagcacaaaa 
gtactttctg 
atcttcaacg 
aataataatg 
aatgtttcct 
gataataagg 
attgaaaacg 
atccaggcag 
gctaaccaga 
gaattattgg 
gccgtttctc 
gctccctctt 
ttcaactatg 
actgtccaga 
tatgctccat 
ctgattgatg 
atcaaggaaa 
aacttgcaac 
gacaacaaaa 
acaaagagcc 
tacgaaaaca 
gataactcgg 
gacagtaaat 
aatgagggac 
caa tccaacg 
cgtcttcagg 
aagaggttgg 
aatagtggta 
atcgctgaaa 
ctctctttgg 
caacagcagc 
cagagaaagg 
ttcctgcaac 
aagggtcctc 
accatcactg 
ctgtacgaaa 
atcaccaacg 
agatccctga 
gccgtgagac 
ggattcactc 
actttcagtg 
tgcccaggag 
gtggctgacg 
aacgccgagc 
gacgccatca 
caaaatctgg 
ggcgaaagaa 
agtggcatcc 
atggt tgctc 



tgaaaaacat 
tgcacgaaaa 
aaaataacct 
acgaccaagg 
aagaaaataa 
tcgcaattgt 
ttggaaaata 
acaaactcat 
ccatttcttc 
catcatcatc 
cctacatgcc 
cctcatcacc 
aagtggctct 
ttaccaccaa 
gaaggcaaca 
ccaccgtttc 
ccaccagcat 
taccagtaac 
catcagaatt 
cttccaccac 
ctgccatcac 
aaagagttat 
ttcccgttaa 
catcatcctc 
cttacggaaa 
aaaacatggt 
tcagcgaaat 
ccatcaaaac 
cggacaagga 
aaaacttttc 
ggaacgaatt 
gtatcgaggc 
catctgaaaa 
ggacagttta 
ttgtcgacat 
aaaatactgt 
gtgcttcaga 
ctgtcgagat 
cagattcggc 
tggttaacat 
gcactgcact 
gagcacgagt 
aacagcaaca 
aacaacaata 
ctcttcacga 
tcagtaacac 
acttgtctag 
atgttccctc 
tccacgaagc 
agcagctgac 
acgatgtact 
accgccaaaa 
gaaatgttgc 
ttgaaaatga 
cacccaagtc 
acctttactt 
tttctggtat 
taaacattga 
ggattgtgca 
cttcatgtac 
caggctctct 



tgaagaggca 
aatgttgagc 
cgaatgtaag 
tggtacattc 
caccagttct 
agaggacaat 
tgccatgaac 
tgaaaaacta 
ttcctcatca 
atcttcctct 
tatcgtcttc 
atcatcatct 
ccaacaactt 
gcacagattt 
cgcagagaca 
cgcaggccaa 
gcaattgcct 
tgtcaggctt 
taaggacttg 
agaaacaaag 
tgccatctct 
caaggcagtc 
tggagattct 
ctcctcatcc 
attgaagaac 
cgaacaagat 
gatgagacgt 
aaagatcgtg 
gcttgatatg 
aaacgaatca 
agagcttcag 
tggtggagga 
tgacaacaat 
cacccccact 
gtacaacaaa 
aaaggtagac 
gcctgtatct 
gtttatggca 
tgctcagtgg 
caagagtgaa 
aaactccatc 
cgaccagctc 
gcagcaacag 
cgctgcagat 
gacaattaaa 
acgtggtgtt 
gtctgtactc 
atcaattgtt 
cactcatact 
agaaatgttg 
aacaatgtta 
ggttgcgtgt 
acctaatact 
caaatttaga 
atctgcaagc 
ccccttcaag 
gagcggcgaa 
tcaaactact 
cactgaaaac 
aaaggaccgt 
tctgaatgct 
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aacaaggaag 
ggcactgatg 
gccgcctcct 
ggatctgtat 
ttgttgagca 
tgttcctcta 
cagcgcctgt 
atggatttga 
aaaatgacat 
tctaacgttc 
aggaataggc 
gtacaggcca 
cgccagaatg 
agtggccgtc 
tcccgcatca 
tcttccttct 
ggaaatgttg 
aacaagagat 
gccatccagc 
gacttgtgcg 
ttgaagaggt 
accaacaatg 
atgaacaaca 
ctcgttaggc 
aggagcatgc 
cctcttctca 
gacagactac 
ccagccttcc 
catacaaagt 
ttcaagaaca 
tcccacaagc 
aacgagcatc 
agagccacac 
tccaagatcg 
gaaatgagaa 
aactctggca 
agaagtgtac 
ggaactgtcg 
tctgaactga 
gctcaaactg 
atcgtgaacg 
accaaggaaa 
ggagtagaat 
atctcagaag 
ccggaaaatg 
tcaaagggag 
tctacctatt 
gccatctcta 
aatatttctt 
gctatcaaga 
acacccgcat 
cttctttcaa 
tcagttgatc 
gacaca ttca 
acagctctct 
cttaccacct 
gtgggtgcag 
tctaccgtca 
tctgttacac 
gttaacaaca 
gtatccatgc 



aaactctccg 
ttcatgtggc 
cattccctga 
ctgccatctc 
ccaacacgtc 
attacttggg 
tccctaacgg 
tgggaagtac 
ctactaacaa 
ctgctatcaa 
tttctcccct 
tgaccgatac 
gcagaaatac 
catcactctc 
atagagacgc 
ttaactcttt 
cagcagcgga 
tcgagatgct 
gcaatacaat 
aaattgaaaa 
ctatgctgat 
ttcccctttc 
acagtgccaa 
acatgatgtt 
tgacccaacc 
ctgaaatctg 
taacttcagc 
ctgccgccct 
catctttcat 
ttgaacgatt 
aaaattgtaa 
tggtatcctc 
cctatcaggc 
attctagagt 
tgaactccaa 
tgacaacggc 
taggaatatt 
cttcagccat 
tccagaatgc 
tcgtgtttgc 
aactggccgg 
tgattgatag 
caatggatta 
ccatgaagaa 
taccattcgc 
atacattcga 
cttcctcttc 
agcaagtaac 
ctgactatac 
gagaattcag 
tagtcgaccc 
ccaaatcaaa 
ttagccatct 
tgtacagaaa 
tgcaacagga 
tccttaatca 
gaggaggaaa 
caacatcaaa 
aagatcctgt 
acaatactgg 
catctgaatc 



tctaaacagg 
aggcgcaaac 
cggagataag 
tgctcaaaaa 
tatccagaat 
gtctgctgaa 
agatgacgaa 
aaagcgctac 
aaaggggtca 
cactgccttt 
ttatgaagac 
tgctgtggat 
tcttctttct 
ttattcttcg 
tagcctcctg 
ccttcgttct 
aattgttctg 
tggaggagga 
gtcctctatt 
taaaattgag 
gactccagga 
tcttctcatg 
tgtaatggaa 
ggatagtgga 
aagagctctc 
cctctacaac 
ctatctagta 
cacctgcgct 
ggacaacatc 
tgagaaattc 
ctgccccttc 
tttcgcattc 
caacaagctt 
aacaggatct 
ctttaatgga 
aggagtcaac 
ggaatgtcat 
gccagccgtc 
tctgccaagg 
taatgttttg 
caccatcgct 
actaggctct 
tgaagatagc 
tgccgtctat 
ctcatgcgcc 
agaaaagaac 
taacactact 
tgatgctgaa 
taactgtcca 
cagaattgtt 
taaaggcgcg 
actagaaaac 
aaaacccatt 
tgtccaccct 
acaaatggac 
tcccaacact 
cccaatgggc 
ccccgtcaca 
tatggtagtg 
aattgatgtc 
atctggcctc 



ctttcagaca 
tctgcatgga 
gaatcggcta 
tctgccagtc 
ctggtcaagg 
tctacgttcg 
aacgttacaa 
aatgacgctt 
aattgtgaaa 
ggagcctttg 
agcaccaaat 
gctttgtccg 
ctccctactt 
gacatgaaat 
tcaatgggag 
tcttccatcc 
gggactattc 
aaaatggtcg 
ctccagatga 
actaggcaac 
ggcgtgggag 
tcacgtgtcg 
gctgtggata 
aagtcccccg 
accgcccgcg 
acccgcgaca 
aaacaagcta 
tctcacctca 
aaattgcaca 
ttgggaagat 
catctccacc 
gcccgcccag 
attagtgaca 
tccctcctta 
acatttgaac 
ctcgacgtta 
cgccagcacg 
ttccaggcaa 
aacagataca 
gaacaactta 
gaatctgtac 
gacgacctct 
gaaacaacat 
cacacactaa 
agcggccctc 
gccgaacaag 
cttcgtaagc 
ttcaaggata 
ccaaatacta 
tccttcttaa 
ttacacgaga 
tttttccaat 
aattgtagca 
attcttatta 
cccgatactg 
gcttcaatgg 
ttgtatcttt 
gacaccacag 
aaccccttca 
ttgaatgata 
gtcaccaata 



ttaacaacgt 
gaattggtga 
tgaaaaagat 
acattaacga 
aagctttccc 
ccactcaact 
ccgtttcaaa 
tcaacaacat 
atctactgaa 
aagaagcttc 
attcgtccaa 
ctgtttctac 
ctattacttc 
ctaacctcat 
acagccaagt 
ctgtcaccac 
tcgacaagac 
ccgggagtcc 
acgaaaatga 
tgagggatgc 
ccatttcttc 
atgcatccag 
gtttcaatac 
ttcccatggc 
ctctactgag 
ctcaaccaga 
aaagattcga 
tgctttcttc 
tgactgatac 
atggggacga 
acacttttac 
aagtctccat 
aacattacgt 
agaaggttag 
catcaagact 
ttgtcaaacc 
tgtgcaccgc 
ccgatggaaa 
tccaaaagag 
tcgccgatct 
cagaaagcgt 
tcaaatctaa 
ccaacaatgg 
tttccggcaa 
tcgcctttga 
gtgcagcagc 
atttggctcg 
tcctcaacga 
accaaaatgc 
ccattcttcg 
aagtagccat 
acggtctcag 
acaatgtcaa 
tggccctccc 
ccattgaaag 
cgaacggtgc 
cttcccacat 
aaaacgtcaa 
aggattctgc 
agtcgtgcaa 
ctggatgctc 



gagacattat 
ggtggtgaga 
gcttcttcta 
tcctactgct 
agaccctgtt 
cgcctaccgc 
tatctgccct 
ctttggctct 
atctgccatg 
atcttctgtc 
ccaacttgct 
tgttgtcggt 
tatcgcaacc 
caagacaatt 
agctgcaggt 
cagccaggat 
tgtggagatc 
tgaagctcgt 
actcgctcgt 
tttccaggat 
tggagcaagt 
cggtcttctg 
tactcctttg 
caaggaaatt 
cgaatcttcc 
aagggcagtc 
cggagttgac 
catggattcc 
tcaatgcttc 
atacgccatg 
tccctcagat 
ggaagaaatt 
gatgaacatg 
cgaatggact 
cgccctctcc 
aaataatgca 
cgacgccaag 
cggtaacgaa 
cacaatgaac 
tggaaaggtt 
atatgaaaac 
taataatgga 
tcccgtcctc 
ggcagctcgc 
tttccttctg 
tgccgtatcc 
agttttcgaa 
tatcgaacgt 
ctttgctcta 
taagaacatt 
ctatttgacc 
taattcgtcc 
gaatattgaa 
agaaaatttc 
cagacgctcc 
aagagccgct 
tcttcacgag 
ctatcattcc 
taggttgatc 
ctacttgcaa 
ttcttcttct 



13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
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tcctcatctt cgtctgatac 
ccccgtgtca caccagccgt 
tccagggcag atattgtcct 
attgctgccg tctccaaatt 
tctcagaaca acatcaacaa 
gtcgatccag tcactgggga 
ctatacactg cagcaaatgg 
aaaatgcacg ccaaggcttt 
gctaacggag tgcctcttac 
atcgcaggca tctctgaagg 
cgcctctcat ccgacatgga 
caaacattca agaaagcttc 
cccagaagcc aagaaacaaa 
gaggaatgtc gtagggtgat 
ttgaccggtg gctacggagg 
atgactggtc ttattggtgc 
ctgggaggtg tttctgctgc 
aacgggaaaa cactaccaga 
gattcagact ctattagcaa 
agggataaca acaatttggg 
ggggatttgg tttattctgc 
aacaatcgcg atcttaccaa 
attctagaaa atgttgcaga 
gtatctgctc aggaaatggc 
ctgtcatcag acgacaatgt 

<210> 61 
<211> 6025 
<212> PRT 
<213> SHRIMP 



<400> 61 



Met 
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Gin 


Tyr 


Pro 


Glu 


Val 


Arg 


Asp 


Thr 


Pro 


Gin 


Thr 


Glu 


Gin 


Glu 


1 
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Gin 


Ala 


Ala 


Ala 


Gin 


Gin 


Gin 


Gin 


Ala 


Ala 


Thr 


Thr 


Thr 


Ala 


Ala 


Ala 
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Ala 


Ala 


Ala 


Ala 


Ala 


Pro 


Thr 


Gin 


Tyr 


Ser 


Asn 


Thr 


Val 


Ser 


Ala 


Glu 
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Ala 


He 


Ser 


Glu 


Asp 


Gly 


Lys 


Leu 


Glu 


Arg 


Ser 


He 


Ala 
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Ala 


Ser 


Cys 


Trp 


He 


Asn 


Asn 


Leu 


Asn 


Pro 


Asp 


Glu 


Lys 


Met 


Ala 


Gin 


65 










70 










75 










80 


Arg 


Val 


Gin 


Phe 


His 


Pro 


Leu 


Ser 


Ser 


Thr 


Thr 


Thr 


Tyr 


Asp 


Ser 


Glu 










85 










90 










95 




Asn 


Val 


Asn 


Pro 


Gly 


Ser 


Ser 


Val 


Val 


Phe 


Leu 


Lys 


Pro 


Arg 


Ala 


Leu 








100 










105 










110 






Pro 


Thr 


Gly 


Gly 


Thr 


Cys 


Leu 


Ala 


Pro 


Asn 


Tyr 


He 


Ala 


Val 


Pro 


Thr 






115 










120 










125 








Leu 


Arg 


Ala 


Ala 


Ser 


Glu 


He 


He 


Asp 


Ser 


He 


Ala 


Ser 


Thr 


Ser 


Leu 




130 










135 










140 










Tyr 


Gin 


Cys 


Ser 


Met 


Phe 


Asn 


Ser 


Trp 


Asn 


Leu 


He 


Pro 


He 


Phe 


Met 


145 










150 










155 










160 


Ser 


Asn 


Ser 


Lys 


His 


Ser 


Gin 


Phe 


Gly 


Asp 


Arg 


Val 


He 


Lys 


Arg 


Ser 










165 










170 










175 




Met 


He 


Arg 


Asn 


Cys 


Phe 


Ser 


Lys 


Gin 


Lys 


Asn 


Val 


Glu 


Asn 


Leu 


Leu 








180 










185 










190 






Lys 


Glu 


Leu 


Arg 


Arg 


Arg 


Lys 


Val 


Asn 


Ala 


Ala 


Lys 


Ala 


Phe 


Ser 


His 






195 










200 










205 








Ala 


Val 


Gin 


Gin 


Lys 


Ser 


Ala 


Val 


Asn 


Thr 


Ala 


Ala 


Trp 


Asn 


Ala 


Gly 




210 










215 










220 










Ser 


Ala 


Ala 


Asn 


Leu 


Glu 


Lys 


Leu 


Val 


Asp 


Phe 


Cys 


Lys 


Leu 


Lys 


Tyr 



cttcaagtac 
tctctgttct 
cgaaaacatg 
caaggggctg 
caacagcaac 
tattgttatc 
aggaacatca 
ccccaccttc 
atctgaggga 
atcaattggc 
acctgtcatg 
agataaactc 
cgctattatt 
ccaagacgag 
agatgctgct 
aatctccgcg 
taacgcagct 
acatggaatc 
cctatacaac 
aagaattatg 
tagacaactg 
gcgtcaagct 
taccatttac 
ttgtgcgtct 
tcaatccctg 



gtcaggagag 
gatgcttcct 
aacgtgagat 
accaaggaag 
aacaacaacg 
accaatgcca 
tcattcaaat 
tttattggta 
atttccctca 
acgggggctc 
aagggatgga 
actcatcttt 
aacaagatgc 
gctgctctgc 
ctggccatgg 
ccagttagag 
atccgcaagc 
gtacacaaat 
actgatcttc 
caatctttgg 
acggacctta 
atccttaaaa 
cttacaacag 
ctgacagtcg 
aaccgccttt 



acaatacgcc 
ctaatctctt 
ttggtttcat 
aggttattaa 
gaaatgggaa 
cattccccga 
ggggagatat 
acccaaccgc 
ctgaagaaaa 
tgcgtgcagc 
acaacattgt 
tgagatcggg 
acgacagctt 
tcgttgccac 
tttctccagt 
gtattagcca 
gcctcaacct 
cagccaagac 
aagacgttgt 
gacttaaggg 
ttactgtacc 
tgctcatttc 
gtaaaaatgc 
gaggaagtgg 
attttcgggt 



tgtgaatctt 
ggacgtgttc 
gcccgagatt 
gcaaatggtt 
gaaaacaacc 
cactcgtcct 
caacgacaga 
cgccgcaaca 
acgcaagaaa 
cgccaacacc 
tcagcttcaa 
aggaattcca 
caagacattg 
cagcgatctt 
acgtccagaa 
cttgttgaaa 
acctacatcc 
acttttgctt 
ctctaacgct 
gaataatgca 
agaatatgga 
taaccctgaa 
tctcgcaccg 
aggaggaaaa 
ctag 



16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18234 
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225 230 235 240 

Ser Pro Asp Arg Lys Tyr Lys Ala Gly Gly Leu Phe Ser Ala Ser Ala 

245 250 255 

Thr Ala Gin Ser Gin Ser Gly Thr Ser Ser Ser Ser Val Glu His Thr 

260 265 270 

Ser Asn Asp Phe Leu Leu Asp lie Leu Lys Arg His Lys Gly Thr Ser 

275 280 285 

Leu Asp Leu Asp Ser Ala Thr Asn Thr Phe Asp Thr Ala Leu Ser Arg 

290 295 300 

Val Phe Thr Glu Phe Lys Glu Gin Ala Arg Ala Ala Val Asp Ala Ala 
305 310 315 320 

Ala Asp Ser Asp His Leu Ser Ala Ser Asp Pro lie Phe Ser lie Val 

325 330 335 

Arg His Asn Ser Arg Arg Glu Gly He Leu Asp Ser Val Pro Asn He 

340 345 350 

Gly Met Leu Ala Pro Arg Ser Lys Tyr Ser Val Ala Glu Tyr Leu Met 

355 360 365 

Ala Asp Arg Asp Glu Ser Ala Asp He Ala Ala Lys He Gly Thr Lys 

370 375 380 

He Ala Thr Asp Phe Glu Ala Leu Arg Gly Asp Asn Asn Lys Arg Arg 
385 390 395 400 

Ala Asp Thr Ser Val Asp Asp Leu Lys Glu Ser Leu Ala Asp Ser He 

405 410 415 

Glu Lys Thr Ser He Lys Asn Thr Gly Asp He Asn Ser Val Thr Asn 

420 425 430 

He Pro Thr Asp Thr Glu Glu Tyr Glu Phe Ser Leu His He Thr Gin 

435 440 445 

Leu Phe Ala Gin Ala Phe Leu Glu Thr Met Gly Ser Leu Leu Ser Cys 

450 455 460 

Ala Phe Gly Val Gin Phe Pro Phe Ser Asp Glu Gly Phe Ala Ala He 
465 470 475 480 

Glu Arg He He Arg Lys Thr Asp Pro Asp Thr Gly Lys Val Ser Glu 

485 490 495 

Met Asp Pro Ser Ser Leu Ser Asp Gin Tyr Leu Leu Leu Val Gly Asn 

500 505 510 

Phe Gin Val Ser Pro Phe His Val Ser Asp Pro Lys Asp He Val Phe 

515 520 525 

Gly Arg Gin Val Thr Pro Asn Thr Pro lie Leu Ser He He Thr Arg 

530 535 540 

Ser Lys Asn Asp Lys Asn Glu Thr Ser Thr lie He Asn Phe Arg Asp 
545 550 555 560 

Arg Leu Leu Val Asn Asp Thr Val Leu Arg Asp Ala Thr Gin Asn Val 

565 570 575 

Ser Thr Ser Thr Pro Ser Gin Arg Arg Val Pro Thr Ala Ala Gly Glu 

580 585 590 

Pro Lys Lys Pro Met Leu Ser Gly Cys Leu Pro He He Arg Gly Pro 

595 600 605 

Gin Val Val Thr Arg Glu Ser Asp Asp Met He Ser Gly Leu Val Gly 

610 615 620 

Asp Trp Tyr He Ser Leu Gly Val Tyr Tyr Ala Met Gly Ser Ser Ala 
625 630 635 640 

Ala Ala He Ala Ala Gly His Gin Arg Ala Ser Ala Glu Ser He Asn 

645 650 655 

Ser Pro Met Met Lys Lys Phe Ser Lys Lys Gly Gly Lys Tyr Thr Glu 

660 665 670 

Glu Glu Lys Arg He Lys Lys Ala Met Arg Arg Asn Ala Asp Arg Ser 

675 680 685 

Ala Arg He Leu Ala Leu Leu Gly Gin Thr Asp Ala Gin Tyr Gly Tyr 

690 695 700 

Val Glu His Asn Ser Thr Leu Asp Ser Phe Trp Ser Ser Asn Ala Ala 
705 710 715 720 
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He 
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A 1 
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TV 1 
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He 
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1. 
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Mot- 
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ri lc 
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Val 


Asn 


Arg 
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oiy 
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PIC L. 
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rzl n 
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Asp 
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Ala 
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Asn 
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Gin 


CZ~\ n 
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nXd 


V3 J. U 


val 




Arg 


Gly 
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Leu 


Ocl 


Lys 


His 


Glu 
815 


Lys 


Arg 


Val 


rne 


Asn 
820 


lie 




fit \T 


Leu 


t*ne 
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Ser 


Ala 


Leu 


Asn 


Ser 
830 


Phe 


Lys 


Phe 


Pro 


Leu 


vax 


Pro 


m it 
oiu 


oin 


uiy 


Arg 


Lys 


Lys 


J. Ill 


vax 


Gly Gly Arg 
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840 
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His 


Arg 
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Leu 


Asn 


Asn 


Leu 


Lys 
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O ^ *J 
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Asn 
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T 1 £V 

x xe 


Asn 


Gly 


He 


Thr 


Glu 




inr 
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Gly 
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Asp 
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He 


He 


vjxy 


C A 1** 


vax 


C« 


Asp 


uiy 


Trp 


Gly Asn 


1 1 IX 


x nr 


Ala 


Gin 


Pro 
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895 




Ser 


Arg 


vax 


Lys 


rild 


Leu 


Lys 


l nr 


Leu 


Ser 


Asn 


file 


otr 


Gly Asn 


Gly 
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Asn 


Val 


Vdl 

y xo 


Ser 


lie 


Pro 


vax 


Ser 
920 


Arg 


Ala 


Val 


Lys 


Cys 
925 


Ala 


Ala 


Gly 


Ser 


Arg 
930 


Gly 


vaiy 


uiU 


lnr 


Leu 
935 


Lys 


Cys 


Val 


Asp 


He 
940 


Pro 


Ser 


Val 


He 


He 
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Asn 


Leu 
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lie 
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Asp 
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Arg 


He 


Leu 


Asp 
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Leu 


Cys 


Gly 
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Met 
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Glu 
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Asn 


Phe 


He 


Glu 


Thr 


He 
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Gly 


Lys 


Glu 
980 


His 


Thr 


Gly 


Lys 


Glu 
985 


Ser 


Val 


Phe 


Leu 


Ser 
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Pro 


Arg 


Leu 


Ser 


Val 


He 


Leu 


Leu 


Arg 


Tyr 


He 


Trp 


Phe 


Asn 


Ala 


Ala 


Val 


Val 
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Ser 


Leu 


Thr 


Asp 


Ser 


Asn 


He 


Lys 


Met 


Pro 


Leu 


Asn 


Thr 


Met 


Ser 


Glu 




1010 
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Gly Thr 


Gly 


Asp 


Asp 


He 


Tyr 


Arg 


Asp 


Tyr 


Leu 
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He 


Arg 


Gly Met 


1025 








1030 








1035 








1040 


Val 


Asn 


Asn 
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Ser 


Ser 
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Ser 


Ser 


He 


Ser 


Val 


Lys 
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He 
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Ser 
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Ser 


Gly 
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Ser 
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Ser 
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Gly 


Glu 
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Val 


Leu 
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Gly 
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Val 
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Val 
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Ser 
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Asp 
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He 
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He 
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Glu 


Glu 
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Arg 
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Leu 


Gly 


Glu 
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Glu 


Leu 


Asp 
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Val 
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He 
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Gly 


Val 
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Ser 


Ser 
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1205 1210 1215 

Pro Leu lie Val Lys Ala Ser Lys Gly Tyr Asn Arg Tyr Ala Lys Leu 

1220 1225 1230 

Phe Asn Lys Thr Thr Arg Thr Ala Ala Glu Gin Ala Gin Met Glu Gin 

1235 1240 1245 

Tyr Asn Ala Gin Met Ala Ala Asn Thr lie Pro Gin Leu Val Asn Arg 

1250 1255 1260 

Leu Thr lie Pro Gly Ser lie Thr Ala Asp Thr Ala lie Asn Val Val 
1265 1270 1275 1280 

Lys Ala Phe Thr Glu Asn Gly Glu Phe Ser Asn Ala Glu Thr His Leu 

1285 1290 1295 

Gly Val Met Gly Asn Ala He Asn Glu Met Gin Pro Leu Phe Thr Asp 

1300 1305 1310 

Gly Phe Asn Val Ala Asn Lys Arg Leu Thr Val Asn Val Gly Ser Val 

1315 1320 1325 

Ser Lys Leu He Gin Asn Gly Leu Thr Val Ser Leu He Leu Ala His 

1330 1335 1340 

Ser Lys Asp Tyr Val Phe Lys Pro Leu Val Gin Asp Phe Ala Lys Leu 
1345 1350 1355 1360 

Leu Leu Ala Val Thr Ala Glu Thr Ser Leu Val Val Ser Arg Ser Gin 

1365 1370 1375 

Lys Ser Phe Phe Pro He Pro Pro Ser Val Phe Ser Ser Gly Gly Leu 

1380 1385 1390 

Phe Lys He Asp Arg Glu Met Phe Asp Asn Met Lys Thr Asp Tyr Val 

1395 1400 1405 

Val Glu Val He Arg Gin Leu Ser Lys Asn Ala Thr Ala Ala He Glu 

1410 1415 1420 

Arg Cys Asn Asp Ser Asp Ser Ala Ala Arg lie Ala Lys Ser Gly Glu 
1425 1430 1435 1440 

He Tyr Asn Lys Asp Val Ala Ser Thr Thr Ala Ala Pro Gly Thr Ser 

1445 1450 1455 

Ser Ser Ala Leu Thr Leu Phe Ala Asn Asn Leu Gin Asn Pro Ala Lys 

1460 1465 1470 

Val Trp Ser Met Gly Ala Leu Pro His Phe Asp Met Ala Val Val Pro 

1475 1480 1485 

Lys Leu His Gly He Ser His Asp Gin Met Phe Arg Leu Ser Thr Tyr 

1490 1495 1500 

Tyr Gin Gly He His Lys Met Glu Leu Asn Ser Asp Cys Lys Pro Glu 
1505 1510 1515 1520 

Glu Trp Asp Asn Ser Leu Pro Gly Asn Arg Ala Ser Lys Phe Phe Gly 

1525 1530 1535 

Leu Ser Ser Val Ser Asp Asn Asn Arg Ser Phe Asn Leu Ala Leu Asp 

1540 1545 1550 

Thr Leu Leu Asp Ala Glu He Cys Asp Leu Val Thr Arg Glu Met Val 

1555 1560 1565 

Lys Thr Ser Asn Asp He Val His Asn He Gly Ser Asn Ser Asn Thr 

1570 1575 1580 

Asp Ala Leu Gin Lys Ser Leu Gin Val Gly Ala Ser Ala Val Glu Lys 
1585 1590 1595 1600 

Tyr Asp Glu Ser Thr Leu Ser Thr Lys Glu Thr Asp Val Tyr Ser Leu 

1605 1610 1615 

Val Ser Ala Lys Ser Lys Ser Pro Leu Ser Ser Ser Ser Ser Leu Ser 

1620 1625 1630 

Ser Glu Gly His Leu Thr Ser Lys Glu He Asp Arg Thr Trp Asn Thr 

1635 1640 1645 

Pro Ala Leu Leu Gly Thr Ala Lys Thr Thr Ser Tyr Ser Val Ser Glu 

1650 1655 1660 

Asp Ala Leu Asn Ala Pro Leu Ser Ala Val Leu Asp Phe Arg Arg Asn 
1665 1670 1675 1680 

Val Val Asp Ala Thr Lys Ser Leu Tyr Glu Val Ala Ala Val Cys Ser 
1685 1690 1695 
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Val Met Ser Lys Glu Glu Asp Val Arg Ser Ser Ser Arg Lys lie Met 

1700 1705 1710 

Gly Met Val Glu Gin Glu Ser Pro Val Met Gin Asp lie Gly lie Asp 

1715 1720 1725 

Arg lie Ala Ser Leu Val Ser Thr Val Ala Thr Pro Lys Gin His Arg 

1730 1735 1740 

Arg Phe Leu Gin Thr Val Asn Asp Tyr Lys Asn Tyr Leu lie Arg Lys 
1745 1750 1755 1760 

Val Asn Pro Leu Leu Ser Ser Arg Leu Gly Gly lie Ser Pro Thr Ser 

1765 1770 1775 

Gly Asn Thr Asp Tyr Asn Leu Lys Ala Val Tyr Asp Gly Val Val Ser 

1780 1785 1790 

Ser Ser Ser Ser Met Thr Pro Ser Ser Met Ser Val Ser Asp Arg Phe 

1795 1800 1805 

Trp Ser Gly Val Phe Ser Gin Cys Leu Glu Thr Gly Pro Ser Met Phe 

1810 1815 1820 

Ala Asp Ala Gly His Gly Gly Ser Asn Met Phe Gin lie Thr Ala Pro 
1825 1830 1835 1840 

Lys Leu Tyr Gly Ser Arg Val Asn Thr Tyr Ala Ala Leu Ser Ser Gly 

1845 1850 1855 

Val Glu Arg Leu Arg Asp Ser lie Ser Ser Ala Thr Gin Glu Arg Lys 

1860 1865 1870 

Asn Arg lie Ala Lys Ser lie Glu Ala Leu Glu Thr Phe Val Thr Asp 

1875 1880 1885 

Val Val Gly Gly Asp Thr Leu Asp Gin Leu Arg Lys Ala Gin Asn Met 

1890 1895 1900 

Tyr Asn Lys Leu Ser Asp lie Thr Ser Asn Ser lie Tyr Ser Asp Phe 
1905 1910 1915 1920 

Gly Asn lie Asp Cys Ala Lys lie Met Lys Asn Val Thr Ser Lys Lys 

1925 1930 1935 

Met Thr Ala Arg Gin Gin Ser Asp Thr lie Leu Ser Ser Leu Leu His 

1940 1945 1950 

Glu Leu Ala Gly Leu Val His Lys Gin Gin Pro Gin Leu Ala Thr Gin 

1955 1960 1965 

Phe Ala Ser His Val lie Lys Ala Lys Tyr Val Thr Asn Asp Leu Asn 

1970 1975 1980 

Asn lie His Glu Lys Glu Thr Phe Ser Gin Leu Met Ala Val Ala Gly 
1985 1990 1995 2000 

Val Ala Asp Tyr Tyr Asn Val Ser Ala Ala Ala Met Cys Gin Arg Leu 

2005 2010 2015 

Val Ala Ser Asp Val Thr Met Phe Leu Gly Gly Thr Met Leu Gin Gin 

2020 2025 2030 

Gly Leu Phe Val Ser Phe Leu Leu Asn Asn Val Leu Phe Ser Gin Val 

2035 2040 2045 

Ser Asp Asn lie Lys Met Asn Glu Leu Asn Asp Glu Thr Lys Ser Leu 

2050 2055 2060 

Leu Val Lys Leu Val Gly Phe Cys Gly Thr Val Ser Asp Ala Leu Gly 
2065 2070 2075 2080 

Ser Arg His Val Ser Ser lie Arg Arg Val Gin Asn Glu Glu Asp Lys 

2085 2090 2095 

Lys Leu Asp Arg Ser Phe Val Thr Ser Lys Ala Tyr Arg Asp Leu Arg 

2100 2105 2110 

Lys Lys Thr Glu Leu Tyr Arg Glu Thr Asp Thr lie Asn Lys Leu Phe 

2115 2120 2125 

Gly His Gin Asn Phe Met Ser Tyr Glu Ser Ser Met Leu Lys Arg Thr 

2130 2135 2140 

Ser Leu Val His Asp Ala Val Ser Gly Pro Arg Pro Arg Arg Tyr Ser 
2145 2150 2155 2160 

Thr Leu Glu Asp Val Leu Glu Ala Pro Ser Thr Val His Lys Ser Phe 

2165 2170 2175 

Met Val Ser Tyr Pro Glu Arg Ala Ala Ala Ser Arg Arg Val Lys Arg 
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2180 2185 2190 

Ala Gly Leu Arg Ala Asp Asn Arg Met Glu Ser Leu Tyr Gly Glu Glu 

2195 2200 2205 

Val Leu Asn Asp Met Arg Ser Ser Ala Val Ser Ser Glu Met Met Asp 

2210 2215 2220 

lie Glu Tyr Gly Glu Gly Gly Phe Met Met Met lie Ser Asp Asp Glu 
2225 2230 2235 2240 

Asp Asp lie Ala Phe lie Asp Ser Glu Glu Glu Ser Glu Ser Ser Thr 

2245 2250 2255 

Asp Phe Ser Ser Ser Asp Glu Tyr Ser Asp Ser Ser Asp Glu Tyr Asp 

2260 2265 2270 

Phe Asp Asp Asp Asn Asn Gly Gin Ser Pro Tyr Ser Thr Thr Ser Tyr 

2275 2280 2285 

Ser Tyr Asp Ala Leu Asp Arg Leu Asn Ser Ala Ala Lys Pro Leu Thr 

2290 2295 2300 

Ala lie Tyr Gly Cys Arg Gly Glu Gly Glu Asp Asp Glu Glu Asn Asp 
2305 2310 2315 2320 

Leu Tyr Glu Glu Glu Gin Glu Arg Arg Arg Arg Arg Ser Ser Lys Met 

2325 2330 2335 

Gly Lys lie Leu Arg Asp Leu His Glu Ser Asp Asp Asp Asp Asp Asp 

2340 2345 2350 

Tyr Phe Asp Asp Glu Phe Asp Gly Glu Arg Ser Met Ser Glu Thr lie 

2355 2360 2365 

Ala Thr Arg Arg Ala Gly Arg lie Gin Tyr Gly Pro Gly Phe Leu Ser 

2370 2375 2380 

His Ser Asn lie Leu Asn Arg Pro Ala Lys Ala Arg Ala Phe Leu Thr 
2385 2390 2395 2400 

Arg Gly Lys Lys Phe Arg Pro Ser Ala Tyr Asp Arg Phe Phe Met Glu 

2405 2410 2415 

Asp Asp Asp Ser Leu Leu Phe Ser Asp Glu Ser Thr Thr Ser Ser Ser 

2420 2425 2430 

Ser Ser Asp Ser Pro Phe Ser Ser Phe Ser Lys Gly Arg Lys Cys Lys 

2435 2440 2445 

Arg Arg Thr Ser Glu Asp Gin Cys Ala Phe Val Lys Arg Val Val Arg 

2450 2455 2460 

Ala Phe Val Pro Thr Arg Val Thr Met lie Asn Gly Arg Val Ser Met 
2465 2470 2475 2480 

lie Thr Pro Val Thr Ser Glu Asn Thr Val Gly Phe Tyr Glu Asn Tyr 

2485 2490 2495 

Gin Lys Ala Asn Lys Arg Glu Arg Ala Arg Leu lie Glu Glu Tyr Lys 

2500 2505 2510 

lie Val Lys Gly Ala Ser Ala Thr Leu Pro Asp Glu Tyr Val Glu Gly 

2515 2520 2525 

Arg Ala Ser Lys Gin Val Ser Pro Arg Glu Leu Arg Arg Ser Leu lie 

2530 2535 2540 

Lys Ala Ala Ala Tyr Val Ala Arg Thr Gin Glu Ser Asn Leu Asn lie 
2545 2550 2555 2560 

lie Phe Asp Ala Leu Thr Thr Thr Ser Asn Ala Thr Leu Val Asn Asp 

2565 2570 2575 

Pro Ser Thr Leu Leu Gly Asp Thr Leu Leu Phe Ala Lys Gin Leu Glu 

2580 2585 2590 

Ala lie Thr Glu Arg Arg Asn Arg Leu Met Lys Asp Leu Thr Glu lie 

2595 2600 2605 

Ser Pro Ser Leu Phe Thr Ser Phe Gly Asp Ala Ser Lys Asp Thr Gin 

2610 2615 2620 

Met Met Ala Asp Ala Lys Gin He Val Ser Gly Gly Asn Phe Lys Ser 
2625 2630 2635 2640 

Ala Gly Tyr Leu Gly Val Pro Leu Arg Thr Leu Ala Ser Cys lie Lys 

2645 2650 2655 

Gly Thr Asn Thr Val Asp Arg Leu Leu Ala Thr Lys Asn Lys Asn His 
2660 2665 2670 
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Leu Glu Trp Met Thr Thr Ala Ala lie Val Phe Ala Arg Ser Phe Asn 

2675 2680 2685 

Asp Thr Thr Phe His Ala Leu Glu Asp Thr Leu Lys Met Thr Ser Ala 

2690 2695 2700 

Leu Thr Asp Met Tyr Ser Ala Phe Thr Asn Leu Val Gly Ser Glu His 
2705 2710 2715 2720 

Ser Gin Arg Leu Lys Val Lys Ser Thr Leu Leu Asp Ser lie Phe Asn 

2725 2730 2735 

Thr Arg Met Ala His Thr Glu Ala Val Met Gly Leu Val Tyr Pro Thr 

2740 2745 2750 

Ala Phe lie Asn His Glu Met Pro Ser Asp Tyr Thr Gin Arg Arg Glu 

2755 2760 2765 

Met Gin Ser Leu Ala Leu Asn lie Leu Arg Gly Val Asn Cys Ser Gin 

2770 2775 2780 

Leu Pro Arg Lys Asp lie Gly Asp Thr Ala Gly Leu Leu Thr Phe lie 
2785 2790 2795 2800 

Thr Ser Arg Lys Phe Ala Gly Tyr Gly Gly Glu Arg Gly Gly Leu Ser 

2805 2810 2815 

Leu Tyr Arg Met Ser lie Val Asp Ala Leu Ser Cys Pro Ser Asp Asn 

2820 2825 2830 

Arg Leu Lys Gly Ala Val Ser Leu Glu Val Gly Lys Trp Gin Asp Met 

2835 2840 2845 

Gly Glu Glu lie Phe Tyr Lys Arg Ser Asn Asp Leu Val Asp Phe Cys 

2850 2855 2860 

Ser Lys Asn Asn lie Ser Leu Glu Asn Ala Val Gly Pro lie Ala Arg 
2865 2870 2875 2880 

Phe Val Pro Asn Gly Thr Asn Met Ala Asp lie Gly Met Thr Asp lie 

2885 2890 2895 

lie Ser Arg Thr Val Lys Asp Asp Ala Ser Met lie Arg Leu Arg Arg 

2900 2905 2910 

Ala Glu Glu Gly Ala Gly Ala Ala Gly Lys Phe lie Thr Ala Ser Ala 

2915 2920 2925 

Met Gly Asn Leu Tyr Gly Gly He Asp Thr Val Val Asn Leu Thr Glu 

2930 2935 2940 

Lys Leu Tyr Asp Ser Phe Val Leu Leu Gin Asp Ser Asp Ser Phe Asn 
2945 2950 2955 2960 

Thr Pro Thr Glu Met Ala Thr Ala He He Asn Arg Met Lys Ser Arg 

2965 2970 2975 

Lys His Lys Ala Leu Lys Thr Pro Phe Gly Gly Asp He Ala Thr Tyr 

2980 2985 2990 

Lys Asn Phe Pro Ser Ser Ser Glu Ala He Val Val Arg Ala Lys Glu 

2995 3000 3005 

Met Arg Asn Ser He Ser Thr He Val Met Asp He Ser Lys Ser Arg 

3010 3015 3020 

Gly He Asn Ser Phe Ser Ser Arg Ser Gly Ser Thr Leu Ala Lys He 
3025 3030 3035 3040 

Ser Thr Ser Glu Phe Glu Arg He Thr Ser Ala Val Leu Ser Asn Thr 

3045 3050 3055 

Lys Ala Asn Leu Arg Thr He Glu Asn Arg Leu Ala Glu His Tyr Asn 

3060 3065 3070 

Lys Leu Lys Gin Phe Ser His Asn Asp Gly Leu Ser Glu Thr Arg Ala 

3075 3080 3085 

Val Val Ala Val He Ala Glu Ser Leu Thr Pro Val Tyr Ala Asp Asp 

3090 3095 3100 

Thr Ser Gly Ala Ser Val Ser Glu Leu Leu Thr Asp Asn Thr Leu Leu 
3105 3110 3115 3120 

Lys Phe He Val Gin Asn Glu Leu Lys Asn He Glu Glu Ala Lys Arg 

3125 3130 3135 

His Val Thr Ala Ala He Glu Gly Ser Ser Gin Leu His Glu Lys Met 

3140 3145 3150 

Leu Ser L u Leu Val Ala Ser Ala Asp He Asn Arg Met Ser Ala Gin 
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3155 3160 3165 

Asn Asn Leu Glu Cys Lys Lys Leu Thr Glu Gly Asn Ser Asn Phe Val 

3170 3175 3180 

Pro Met Thr Asn Asp Gin Gly Gly Thr Phe lie Lys His Lys Glu Thr 
3185 3190 3195 3200 

Gly lie Trp Leu Lys Thr Asp Glu Glu Asn Asn Thr Ser Ser lie Lys 

3205 3210 3215 

Asp Asn Asp Gin Arg Arg Val Ala Lys Thr lie Leu Ala lie Val Glu 

3220 3225 3230 

Asp Asn Arg Asn Ala Thr lie Arg Ser Arg Leu Gin Ser Leu Cys Phe 

3235 3240 3245 

Gly Lys Tyr Ala Met Asn Asp lie Phe Ala Leu Asp Asp Ala Asp lie 

3250 3255 3260 

Lys Asn Met Asp Lys Leu lie Glu Lys Leu Gly Glu Ala Glu Lys Asp 
3265 3270 3275 3280 

Ser Ser Ser Ala lie Ser Ser Ser Ser Ser Ser Asn Thr Thr Ser Ser 

3285 3290 3295 

Ser Ser Ser Pro Ser Ser Ser Pro Ser Ser Ser Ser Ser Ser Phe Ser 

3300 3305 3310 

Met Asp Tyr Ser Asn Asn Leu Ala Lys Thr lie Pro Tyr Met Pro lie 

3315 3320 3325 

Val Phe Gin Asn Lys Gin Ser Asn Val Asn Ser Ser Asp Ala Ser Ser 

3330 3335 3340 

Ser Ser Pro Ser Ser Ser Ser Ser Ser Ser Ala Asn lie Asp Asn Val 
3345 3350 3355 3360 

Glu His Lys Lys Val Gin Gin Leu Gin Thr Gin Glu Ser Asn Asp Leu 

3365 3370 3375 

Ser Asn Val Leu Ser Val Thr Thr Lys His Arg Phe Ala Ser His Asn 

3380 3385 3390 

Gin Ala Ala Thr Val Gly He Phe Asn Gly Arg Gin His Ala Glu Thr 

3395 3400 3405 

Val Val Ala He Pro Asn Ala Asn Lys Ala Asn Asn Asn Ala Thr Val 

3410 3415 3420 

Ser Ala Gly Gin Gly He Leu Thr Arg Phe Ser Ala Pro Glu Asn Val 
3425 3430 3435 3440 

Ser Ser Thr Ser Met Gin Leu Pro Pro Ser Ser Ser Ser Ser Ser Asn 

3445 3450 3455 

Gly Asp Asp Asn Lys Val Pro Val Thr Val Arg Leu Asn Gin Tyr Ala 

3460 3465 3470 

Asn Ser He Leu Ser Ser He Glu Asn Ala Ser Glu Phe Lys Asp Leu 

3475 3480 3485 

Lys Glu Ala Glu Arg Lys He Asp Leu Ala He Gin Ala Ala Ser Thr 

3490 3495 3500 

Thr Glu Thr Lys Glu Met Val Thr Val Ser Lys Cys Pro Ser Ala Asn 
3505 3510 3515 3520 

Gin Thr Ala He Thr Ala He Ser Gin Ala Lys Ser Leu Lys Lys Ser 

3525 3530 3535 

Ala Leu Glu Leu Leu Glu Arg Val He Lys Ala Val Glu Val Tyr Thr 

3540 3545 3550 

Pro Asp Ser Ser He Ala Ala Val Ser Leu Pro Val Asn Gly Asp Ser 

3555 3560 3565 

Met Val Ser Ser Ser Ser Gly Ser Gly Ser Ala Pro Ser Ser Ser Ser 

3570 3575 3580 

Ser Ser Ser Ser Ser Ser Ser Ser Ser Asn Val Thr Asp Tyr Phe Asn 
3585 3590 3595 3600 

Tyr Ala Tyr Gly Lys Leu Lys Asn He Asp Glu Asn Thr Glu Glu Gly 

3605 3610 3615 

Ala Glu Thr Val Gin Lys Asn Met Val Glu Gin Asp Ala Ala Val Arg 

3620 3625 3630 

He Pro Leu Leu Val Ser Tyr Ala Pro Phe Ser Glu Met Met Arg Arg 
3635 3640 3645 
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Ala He Asp Lys Leu Asn Glu Tyr Tyr Gin Leu He Asp Ala He Lys 

3650 3655 3660 

Thr Lys He Val Ser Asp Thr Lys Gin Ala Ser Ser Trp Ala He Lys 
3665 3670 3675 3680 

Glu Thr Asp Lys Glu Leu Asp Met Asp Lys Glu Gin Val He Ser Lys 

3685 3690 3695 

He Asn Asn Leu Gin Gin Asn Phe Ser Asn Glu Ser Asp Lys He Lys 

3700 3705 3710 

Met Ala He Ser Val Leu Asp Asn Lys Arg Asn Glu Leu Glu Leu Gin 

3715 3720 3725 

Asn Asn Lys Thr Arg Ser Phe He Glu Thr Thr Lys Ser Arg He Glu 

3730 3735 3740 

Ala Gly Gly Gly Asp Val Ala Asn Phe Lys Glu He He Asp Tyr Glu 
3745 3750 3755 3760 

Asn Thr Ser Glu Asn Asp Asn Asn Leu Phe Gin Ser Leu Lys Ala Phe 

3765 3770 3775 

Ala Ala Asp Asn Ser Gly Thr Val Tyr Thr Pro Thr Asp Met Ser Asn 

3780 3785 3790 

Gly Arg Asp Thr Lys Ser Asp Ser Lys Phe Val Asp Met Tyr Asn Lys 

3795 3800 3805 

Gin He Gly Gly He Lys Leu lie Asn Glu Gly Gin Asn Thr Val Lys 

3810 3815 3820 

Val Asp Phe Ser Lys Ala Leu Glu Ala Phe Pro Arg Gin Ser Asn Gly 
3825 3830 3835 3840 

Ala Ser Glu Pro Val Ser Ser Ser Val Val Glu Arg Arg Gin Arg Glu 

3845 3850 3855 

Arg Leu Gin Ala Val Glu Met Phe Met Ala He Met Met Glu Arg Thr 

3860 3865 3870 

Glu Ser Leu Arg Lys Arg Leu Ala Asp Ser Ala Ala Gin Trp Asn Thr 

3875 3880 3885 

Val Asn Asn Val Glu Glu Thr Val Asn Ser Gly Met Val Asn He Lys 

3890 3895 3900 

Ser Leu Thr Glu He Arg Asn Gin Ala Gin He Ala Glu Ser Thr Ala 
3905 3910 3915 3920 

Leu Asn Ser He Asn Asp Glu He Val Glu Ser Pro Leu Thr Leu Ser 

3925 3930 3935 

Leu Gly Ala Arg Val Asp Gin Leu Leu He Lys Val Asp Arg Val Gly 

3940 3945 3950 

Ser He Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Leu Pro 

3955 3960 3965 

Lys Leu Thr Ala Thr Glu Gin Arg Lys Glu Gin Gin Tyr Ala Ala Asp 

3970 3975 3980 

Arg Val Val Tyr Asp Pro Ser Tyr Thr Cys Phe Leu Gin Pro Leu His 
3985 3990 3995 4000 

Glu Thr He Lys Arg He Ser Ser Val Tyr Asn Ser Lys Asn Lys Gly 

4005 4010 4015 

Pro Leu Ser Asn Thr Arg Gly Val Pro Thr Ser Asp Ala Asp Leu Gin 

4020 4025 4030 

Leu Met Thr He Thr Asp Leu Ser Arg Ser Val Leu Asp Ser Ser Ser 

4035 4040 4045 

Thr Ser Ser Lys Lys Met Leu Tyr Glu Asn Val Pro Ser Ser He Val 

4050 4055 4060 

Pro Gly Leu Cys Gin Gin Cys Ala Met Met He Thr Asn Val His Glu 
4065 4070 4075 4080 

Ala Thr His Thr Ser Pro His Ser Phe Asn Phe Glu Asn Lys Arg Ser 

4085 4090 4095 

Leu Lys Gin Leu Thr Glu Met Leu Asn Ala Ala Thr Ser Ser Ser Asp 

4100 4105 4110 

Gly Pro Ala Val Arg His Asp Val Leu Thr Met Leu Glu Ser Asn Asn 

4115 4120 4125 

Gly Tyr Val Lys Asp Phe Gly Phe Thr Gin Lys Val Ala Cys He Thr 
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4130 4135 4140 

Pro Val Asn Thr Leu Leu Gly Gly Thr Phe Ser Gly Asn Val Ala Pro 
4145 4150 4155 4160 

Asn Thr Val lie Leu Pro Thr Ser Glu Leu Phe Asn Cys Pro Gly Val 

4165 4170 4175 

Glu Asn Asp Lys Phe Arg Ser Met Val Asn Arg Thr Thr Asp Lys Asn 

4180 4185 4190 

Val Ala Asp Ala Pro Lys Ser Ser Ala Ser lie Val Glu Thr Leu Ala 

4195 4200 4205 

Arg Thr Ser Pro Asn Ala Glu His Leu Tyr Phe Pro Phe Lys Asp Gin 

4210 4215 4220 

Arg Arg His Phe Asn Ser He Thr Asp Ala He He Ser Gly Met Ser 
4225 4230 4235 4240 

Gly Glu Ser Ser Ser Gin Leu Asn Thr Thr Cys Asp Gin Asn Leu Val 

4245 4250 4255 

Asn He Asp Gin Thr Thr Gly Phe Pro Val Phe Thr Gly Arg Lys Gin 

4260 4265 4270 

Gly Glu Arg Arg He Val His Thr Glu Asn Thr Met Glu Gly Ala Arg 

4275 4280 4285 

Lys Asp Lys Asn Ser Gly He Pro Ser Cys Thr Lys Asp Arg Gin Thr 

4290 4295 4300 

Tyr He Asp Met Gly Thr Lys Phe Met Val Ala Pro Gly Ser Leu Leu 
4305 4310 4315 4320 

Asn Ala Asn Lys Glu Glu Thr Leu Arg Leu Asn Arg Leu Ser Asp He 

4325 4330 4335 

Asn Asn Val Arg His Tyr Gly Thr Asp Val His Val Ala Gly Ala Asn 

4340 4345 4350 

Ser Ala Trp Arg He Gly Glu Val Val Arg Ala Ala Ser Ser Phe Pro 

4355 4360 4365 

Asp Gly Asp Lys Glu Ser Ala Met Lys Lys Met Leu Leu Leu Gly Ser 

4370 4375 4380 

Val Ser Ala lie Ser Ala Gin Lys Ser Ala Ser His He Asn Asp Pro 
4385 4390 4395 4400 

Thr Ala Leu Leu Ser Thr Asn Thr Ser He Gin Asn Leu Val Lys Glu 

4405 4410 4415 

Ala Phe Pro Asp Pro Val Cys Ser Ser Asn Tyr Leu Gly Ser Ala Glu 

4420 4425 4430 

Ser Thr Phe Ala Thr Gin Leu Ala Tyr Arg Gin Arg Leu Phe Pro Asn 

4435 4440 4445 

Gly Asp Asp Glu Asn Val Thr Thr Val Ser Asn He Cys Pro Met Asp 

4450 4455 4460 

Leu Met Gly Ser Thr Lys Arg Tyr Asn Asp Ala Phe Asn Asn He Phe 
4465 4470 4475 4480 

Gly Ser Lys Met Thr Ser Thr Asn Lys Lys Gly Ser Asn Cys Glu Asn 

4485 4490 4495 

Leu Leu Lys Ser Ala Met Ser Asn Val Pro AJ.a He Asn Thr Ala Phe 

4500 4505 4510 

Gly Ala Phe Glu Glu Ala Ser Ser Ser Val Arg Asn Arg Leu Ser Pro 

4515 4520 4525 

Leu Tyr Glu Asp Ser Thr Lys Tyr Ser Ser Asn Gin Leu Ala Val Gin 

4530 4535 4540 

Ala Met Thr Asp Thr Ala Val Asp Ala Leu Ser Ala Val Ser Thr Val 
4545 4550 4555 4560 

Val Gly Arg Gin Asn Gly Arg Asn Thr Leu Leu Ser Leu Pro Thr Ser 

4565 4570 4575 

He Thr Ser He Ala Thr Ser Gly Arg Pro Ser Leu Ser Tyr Ser Ser 

4580 4585 4590 

Asp Met Lys Ser Asn Leu He Lys Thr He Ser Arg He Asn Arg Asp 

4595 4600 4605 

Ala Ser Leu Leu Ser Met Gly Asp Ser Gin Val Ala Ala Gly Ser Ser 
4610 4615 4620 
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Phe Phe Asn Ser Phe Leu Arg Ser Ser Ser lie Pro Val Thr Thr Ser 
4625 4630 4635 4640 

Gin Asp Gly Asn Val Ala Ala Ala Glu lie Val Leu Gly Thr lie Leu 

4645 4650 4655 

Asp Lys Thr Val Glu lie Asn Lys Arg Phe Glu Met Leu Gly Gly Gly 

4660 4665 4670 

Lys Met Val Ala Gly Ser Pro Glu Ala Arg Ala lie Gin Arg Asn Thr 

4675 4680 4685 

Met Ser Ser lie Leu Gin Met Asn Glu Asn Glu Leu Ala Arg Asp Leu 

4690 4695 4700 

Cys Glu lie Glu Asn Lys He Glu Thr Arg Gin Leu Arg Asp Ala Phe 
4705 4710 4715 4720 

Gin Asp Leu Lys Arg Ser Met Leu Met Thr Pro Gly Gly Val Gly Ala 

4725 4730 4735 

He Ser Ser Gly Ala Ser Thr Asn Asn Val Pro Leu Ser Leu Leu Met 

4740 4745 4750 

Ser Arg Val Asp Ala Ser Ser Gly Leu Leu Met Asn Asn Asn Ser Ala 

4755 4760 4765 

Asn Val Met Glu Ala Val Asp Ser Phe Asn Thr Thr Pro Leu Leu Val 

4770 4775 4780 

Arg His Met Met Leu Asp Ser Gly Lys Ser Pro Val Pro Met Ala Lys 
4785 4790 4795 4800 

Glu He Arg Ser Met Leu Thr Gin Pro Arg Ala Leu Thr Ala Arg Ala 

4805 4810 4815 

Leu Leu Ser Glu Ser Ser Pro Leu Leu Thr Glu He Cys Leu Tyr Asn 

4820 4825 4830 

Thr Arg Asp Thr Gin Pro Glu Arg Ala Val Asp Arg Leu Leu Thr Ser 

4835 4840 4845 

Ala Tyr Leu Val Lys Gin Ala Lys Arg Phe Asp Gly Val Asp Pro Ala 

4850 4855 4860 

Phe Pro Ala Ala Leu Thr Cys Ala Ser His Leu Met Leu Ser Ser Met 
4865 4870 4875 4880 

Asp Ser His Thr Lys Ser Ser Phe Met Asp Asn He Lys Leu His Met 

4885 4890 4895 

Thr Asp Thr Gin Cys Phe Phe Lys Asn He Glu Arg Phe Glu Lys Phe 

4900 4905 4910 

Leu Gly Arg Tyr Gly Asp Glu Tyr Ala Met Ser His Lys Gin Asn Cys 

4915 4920 4925 

Asn Cys Pro Phe His Leu His His Thr Phe -Thr Pro Ser Asp Asn Glu 

4930 4935 4940 

His Leu Val Ser Ser Phe Ala Phe Ala Arg Pro Glu Val Ser Met Glu 
4945 4950 4955 4960 

Glu He Arg Ala Thr Pro Tyr Gin Ala Asn Lys Leu He Ser Asp Lys 

4965 4970 4975 

His Tyr Val Met Asn Met Ser Lys He Asp Ser Arg Val Thr Gly Ser 

4980 4985 4990 

Ser Leu Leu Lys Lys Val Ser Glu Trp Thr Glu Met Arg Met Asn Ser 

4995 5000 5005 

Asn Phe Asn Gly Thr Phe Glu Pro Ser Arg Leu Ala Leu Ser Asn Ser 

5010 5015 5020 

Gly Met Thr Thr Ala Gly Val Asn Leu Asp Val He Val Lys Pro Asn 
5025 5030 5035 5040 

Asn Ala Arg Ser Val Leu Gly He Cys His Arg Gin His Val Cys Thr 

5045 5050 5055 

Ala Asp Ala Lys Gly Thr Val Ala Ser Ala Met Pro Ala Val Phe Gin 

5060 5065 5070 

Ala Thr Asp Gly Asn Gly Asn Glu Ser Glu Leu He Gin Asn Ala Leu 

5075 5080 5085 

Pro Arg Asn Arg Tyr He Gin Lys Ser Thr Met Asn Ala Gin Thr Val 

5090 5095 5100 

Val Phe Ala Asn Val Leu Glu Gin Leu He Ala Asp Leu Gly Lys Val 
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5105 5110 5115 5120 

lie Val Asn Glu Leu Ala Gly Thr He Ala Glu Ser Val Pro Glu Ser 

5125 5130 5135 

Val Tyr Glu Asn Thr Lys Glu Met He Asp Arg Leu Gly Ser Asp Asp 

5140 5145 5150 

Leu Phe Lys Ser Asn Asn Asn Gly Gly Val Glu Ser Met Asp Tyr Glu 

5155 5160 5165 

Asp Ser Glu Thr Thr Ser Asn Asn Gly Pro Val Leu lie Ser Glu Ala 

5170 5175 5180 

Met Lys Asn Ala Val Tyr His Thr Leu He Ser Gly Lys Ala Ala Arg 
5185 5190 5195 5200 

Pro Glu Asn Val Pro Phe Ala Ser Cys Ala Ser Gly Pro Leu Ala Phe 

5205 5210 5215 

Asp Phe Leu Leu Ser Lys Gly Asp Thr Phe Glu Glu Lys Asn Ala Glu 

5220 5225 5230 

Gin Gly Ala Ala Ala Ala Val Ser Ser Thr Tyr Ser Ser Ser Ser Asn 

5235 5240 5245 

Thr Thr Leu Arg Lys His Leu Ala Arg Val Phe Glu Ala He Ser Lys 

5250 5255 5260 

Gin Val Thr Asp Ala Glu Phe Lys Asp lie Leu Asn Asp He Glu Arg 
5265 5270 5275 5280 

Asn He Ser Ser Asp Tyr Thr Asn Cys Pro Pro Asn Thr Asn Gin Asn 

5285 5290 5295 

Ala Phe Ala He Lys Arg Glu Phe Ser Arg He Val Ser Phe Leu Thr 

5300 5305 5310 

He Leu Arg Lys Asn lie Thr Pro Ala Leu Val Asp Pro Lys Gly Ala 

5315 5320 5325 

Leu His Glu Lys Val Ala He Tyr Leu Thr Leu Leu Ser Thr Lys Ser 

5330 5335 5340 

Lys Leu Glu Asn Phe Phe Gin Tyr Gly Leu Ser Asn Ser Ser Ser Val 
5345 5350 5355 5360 

Asp Leu Ser His Leu Lys Pro He Asn Cys Ser Asn Asn Val Lys Asn 

5365 5370 5375 

He Glu Asp Thr Phe Met Tyr Arg Asn Val His Pro lie Leu He Met 

5380 5385 5390 

Ala Leu Pro Glu Asn Phe Thr Ala Leu Leu Gin Gin Glu Gin Met Asp 

5395 5400 5405 

Pro Asp Thr Ala He Glu Ser Arg Arg Ser Leu Thr Thr Phe Leu Asn 

5410 5415 5420 

His Pro Asn Thr Ala Ser Met Ala Asn Gly Ala Arg Ala Ala Val Gly 
5425 5430 5435 5440 

Ala Gly Gly Gly Asn Pro Met Gly Leu Ser Ser His He Leu His Glu 

5445 5450 5455 

Ser Thr Val Thr Thr Ser Asn Pro Val Thr Asp Thr Thr Glu Asn Val 

5460 5465 5470 

Asn Tyr His Ser Ser Val Thr Gin Asp Pro Val Met Val Val Asn Pro 

5475 5480 5485 

Phe Lys Asp Ser Ala Arg Leu lie Val Asn Asn Asn Asn Thr Gly He 

5490 5495 5500 

Asp Val Leu Asn Asp Lys Ser Cys Asn Tyr Leu Gin Val Ser Met Pro 
5505 5510 5515 5520 

Ser Glu Ser Ser Gly Leu Val Thr Asn Thr Gly Cys Ser Ser Ser Ser 

5525 5530 5535 

Ser Ser Ser Ser Ser Asp Thr Phe Lys Tyr Val Arg Arg Asp Asn Thr 

5540 5545 5550 

Pro Val Asn Leu Pro Arg Val Thr Pro Ala Val Leu Cys Ser Asp Ala 

5555 5560 5565 

Ser Ser Asn Leu Leu Asp Val Phe Ser Arg Ala Asp He Val Leu Glu 

5570 5575 5580 

Asn Met Asn Val Arg Phe Gly Phe Met Pro Glu lie He Ala Ala Val 
5585 5590 5595 5600 
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Ser Lys Phe Lys Gly Leu Thr Lys Glu Glu Val lie Lys Gin Met Val 

5605 5610 5615 

Ser Gin Asn Asn lie Asn Asn Asn Ser Asn Asn Asn Asn Gly Asn Gly 

5620 5625 5630 

Lys Lys Thr Thr Val Asp Pro Val Thr Gly Asp lie Val lie Thr Asn 

5635 5640 5645 

Ala Thr Phe Pro Asp Trp Leu Tyr Thr Ala Ala Asn Gly Gly Thr Ser 

5650 5655 5660 

Ser Phe Lys Trp Gly Asp lie Asn Asp Arg Lys Met His Ala Lys Ala 
5665 5670 5675 5680 

Phe Pro Thr Phe Phe lie Gly Asn Pro Thr Ala Ala Ala Thr Ala Asn 

5685 5690 5695 

Gly Val Pro Leu Thr Ser Glu Gly lie Ser Leu Thr Glu Glu Lys Arg 

5700 5705 5710 

Lys Lys lie Ala Gly lie Ser Glu Gly Ser lie Gly Thr Gly Ala Leu 

5715 5720 5725 

Arg Ala Ala Ala Asn Thr Arg Leu Ser Ser Asp Met Glu Pro Val Met 

5730 5735 5740 

Lys Gly Trp Asn Asn lie Val Gin Leu Gin Gin Thr Phe Lys Lys Ala 
5745 5750 5755 5760 

Ser Asp Lys Leu Thr His Leu Leu Arg Ser Gly Gly lie Pro Pro Arg 

5765 5770 5775 

Ser Gin Glu Thr Asn Ala lie lie Asn Lys Met His Asp Ser Phe Lys 

5780 5785 5790 

Thr Leu Glu Glu Cys Arg Arg Val lie Gin Asp Glu Ala Ala Leu Leu 

5795 5800 5805 

Val Ala Thr Ser Asp Leu Leu Thr Gly Gly Tyr Gly Gly Asp Ala Ala 

5810 5815 5820 

Met Val Ser Pro Val Arg Pro Glu Met Thr Gly Leu lie Gly Ala lie 
5825 5830 5835 5840 

Ser Ala Pro Val Arg Gly lie Ser His Leu Leu Lys Leu Gly Gly Val 

5845 5850 5855 

Ser Ala Ala Asn Ala Ala lie Arg Lys Arg Leu Asn Leu Pro Thr Ser 

5860 5865 5870 

Asn Gly Lys Thr Leu Pro Glu His Gly lie Val His Lys Ser Ala Lys 

5875 5880 5885 

Thr Leu Leu Leu Asp Ser Asp Ser lie Ser Asn Leu Tyr Asn Thr Asp 

5890 5895 5900 

Leu Gin Asp Val Val Ser Asn Ala Arg Asp Asn Asn Asn Leu Gly Arg 
5905 5910 5915 5920 

lie Met Gin Ser Leu Gly Leu Lys Gly Asn Asn Ala Gly Asp Leu Val 

5925 5930 5935 

Tyr Ser Ala Arg Gin Leu Thr Asp Leu lie Thr Val Pro Glu Tyr Gly 

5940 5945 5950 

Asn Asn Arg Asp Leu Thr Lys Arg Gin Ala lie Leu Lys Met Leu lie 

5955 5960 5965 

Ser Asn Pro Glu lie Asn Val Ala Asp Thr lie Tyr Leu Thr Thr Gly 

5970 5975 5980 

Lys Asn Ala Pro Val Ser Ala Gin Glu Met Ala Cys Ala Ser Leu Thr 
5985 5990 5995 6000 

Val Gly Gly Ser Gly Gly Gly Lys Leu Ser Ser Asp Asp Asn Val Gin 

6005 6010 6015 

Ser Leu Asn Arg Leu Tyr Phe Arg Val 
6020 6025 



<210> 62 
<211> 2190 
<212> DNA 
<213> SHRIMP 
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<400> 62 

atggataaag tttgtgttat atcaaatact agggagcgca cgtttaaggt acctgccgat 60 
ctactgtgcg ttgcaacaga accggaaatc tctaccaagg aagaagatgc aggtatagaa 120 
atcgagacaa gagtggtggt gttttcaagg tgcgtctcgg tccaggaact acatacaata 180 
aatccaaatg atgaaggatt ttctgtccaa cttttcaagg actacctgaa attgcaatct 24 0 
gcacaaggaa aaaaacccat tggtttgtac atccaaataa aggctggaga ggatcttgaa 300 
aggagattaa tcagtggagg aactgcatac ctggatccgg caacacacct tttctatctt 360 
gatttctccc tttaccctaa ttattcaata ttcaatgaca tttcatcccg tctgaaaatc 420 
attgatgaag acacgtacaa tggtgttgtt ttctctaaca gtgaagaaaa agagaaggat 480 
gcactagtgc tgataagggt gactttttct acgcatgaaa aggcaattga agcagccata 540 
aaaaaaataa tgctaaggaa agtgtttttc aaggatggag atcttgattt cgggtactta 600 
cgtataccaa aatctaaact ggacaaattt actccctatt ttcggagtca atacgggata 660 
gtaaatgttg aaaaaaatat ccctggttac atatggggag aaattatgaa gcaacgagtg 720 
cgatgttcca gatggtacct ttacaacacc gactcggaat gggaatataa aaatgtggcc 780 
gaagaaagag ttggacctcg ccagttagtg aaaaaatatg gtgccaagtg tgaaaattta 840 
tgttttaggg acatagacct cagaaaaaag gaagcaaagg aaaaaaggga tatagaaaga 900 
gaaactgaaa gcagatatgt ggtcgtaaca ctaacccata agcatgaaat gcctgaaaat 960 
atgccctatt ttggaccaaa gtgttcagtg gtgaggttgg atgaaactag aatactttta 1020 
tgttttgtgg atgaaatttc ttataatgat gaagatgtag acgaaatttt gtctgagaat 1080 
agatcactaa gaaatgtttc tattagacat aaggaaaatg tacctgtaca cacgttatta 1140 
aaaaaaggtg tgtctattca tgctagattt acccttaatg gtttggatga tgctttaata 1200 
attttaaaga gaataccaaa aacttatttt gaagatgagg aactacaagc cgcttgtgcg 1260 
catgttaacc ttgaacagta cgaatggctt. tgttctaata atagagggaa taaagtagaa 1320 
catgtaaagt cgcgggtagt gactcgagca gttaagcgta ggagaaaatg tagacactgg 1380 
atttattttg ataaagacac tttaaattta aactacaaat actttgataa aaaagttact 1440 
gctagtatgg catctaaaat atgtaatgca aaacacgact gtttagtttt ccatagaaaa 1500 
atggaattgg aagatttgac tgagagcgca tatttcaagg tagaaccttc cccaataaat 1560 
tttgccaagt taaaatcttg cccggatgtt aaatatgtgc agaaaaaaac agatggtaca 1620 
ttttctgtta taagattctt tagaaacatg acaaagggtg atcttattca aaggatggat 1680 
cttttttgta ggtttattcc cgactcacac actattacac ttttgagtag ggcggatttt 1740 
tatgcatgta aaagaggaga atctatgcat atgtgcacaa acaaacaccg tattcttcac 1800 
tacaaattct ccaacgctcc ccatgcggcc atcgaacaaa taaccaatat catcagtgat 1860 
acaagaggac gtaagggtat acacatagaa tacgcgatcg aaaatgtaca agaaatgtac 1920 
gaagaagatg gaagaagata tgaagctaaa tacactggaa ctttaaccga gtacaaaaga 1980 
aatgaggaca aaaccttcaa atctcttctt gctcctcatt taacacctgt caacaaacca 2040 
tataatatta accatttgta tgagcaatat ggaaattttg atgaagaatt agaagacaag 2100 
ttgaggagtg gtttcatttc ttatgacacg tatgttactg caaaagacaa ctggggcagg 2160 
tgtgcaactg gaaagggggc gtgcatctag 2190 

<210> 63 
<211> 727 
<212> PRT 
<213> SHRIMP 



<400> 63 
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<210> 64 
<211> 966 
<212> DNA 
<213> SHRIMP 



<400> 64 

atgtgcacat taaaaacata caaaatgact acttcaacag aaatatcaaa gaacctttca 60 
gatgtgttat ccatcaaggc aactggagat tggtgcagta atatcaagac ggtattttca 120 
cccttcacag aaggcaaggg aaatttacca aacagtctcc cgtttacgag aagtcccaat 180 
acaacatgtg gttcaagaga ggcggcaaac gccacagagc attttatcac cgtctttgca 24 0 
aaggacaaat atgagcggaa aagagtaaaa cgtacaatcg gattcaccct cgacaacaca 300 
aaggagttga cgcccaacag atacttggta gcagatgtat actcttggca agaagagaaa 360 
atggtgtttg aaggattttg tgtcccacca ggaaagtcgg gaacatttgt acgctactct 420 
aatgaagata aaagttttct actagcagat accggaagat atatgaaaaa gaagtacgat 480 
gatccagaaa ataagaccag tagtgggggt gatgatgacg atgacgacga tgatgatgat 540 
gacgacaaca acaatgttga cgtgtatgaa gaaaacgacc ccagaaatgt attcgaggtc 600 
gaaaaggatg aaaaatatgc ctgtactttt tcaattttgg tctatagagc aatgaaaaag 660 
tctcctcctg tatgtagagg gttattagta gagacagatg gaccctcatc tcaccctaaa 720 
cgggccccgt cagcatttaa tccattcgga ggaagttcta tgttgaacgg ttatggtgca 7 80 
ggtgcagatg cactagaaga agaggatgaa gttgatggag ttcctgaaag agagaggatt 840 
acaaattttg ctctcaagag aggacctgca actggccaga actttgtatc tgttaaactg 900 
gaacatgatg gatctaaagc agacctgtac aacgtcacgt gcttctccaa gcagcgtgga 960 
gtataa 966 



<210> 65 
<211> 321 
<212> PRT 
<213> SHRIMP 



<400> 65 
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Glu 


Glu 


Glu 


Asp 


Glu 
270 


Val 


Asp 


Gly 


Val 


Pro 


Glu 


Arg 


Glu 


Arg 


He 


Thr 


Asn 


Phe 


Ala 


Leu 


Lys 


Arg 


Gly 




275 










280 










285 








Pro 


Ala 


Thr 


Gly 


Gin 


Asn 


Phe 


Val 


Ser 


Val 


Lys 


Leu 


Glu 


His 


Asp 


Gly 




290 








295 










300 










Ser 


Lys 


Ala 


Asp 


Leu 


Tyr 


Asn 


Val 


Thr 


Cys 


Phe 


Ser 


Lys 


Gin 


Arg 


Gly 



305 310 315 320 

Val 



<210> 66 
<211> 1197 
<212> DNA 
<213> SHRIMP 



<400> 66 

atgcaactca 

cccatgtttg 

aataaacatt 

acaatagtca 

gaactaatca 

ttcacggatt 

gcacttgatg 

acaaattcca 

gctcctttta 

gctgaaatgt 

aaggggaaac 

ctcttgttga 

aggagcagcc 

gatcatgcag 

cagttgttag 

ttctctttag 

cctcagcccg 

acttttggtg 

ctagacctcg 

attgaggagg 



ttctttctca 
cgggcaagtc 
gcctgtttgt 
ctcatgccgg 
gtgtgttacc 
tggtgctagt 
gaacttctga 
ttgttaagct 
ctgtaaggtt 
acgctgctgc 
ttgttgtact 
ccaataaaaa 
atacgggtaa 
ctcacttgtt 
acgatggaat 
ctagaggagt 
atcttgtatt 
tcgaaagatt 
caaataagga 
tgcaaactaa 



tcatctaacc 
tacctacctg 
caaacactcc 
agaagtgatt 
agaagttgtg 
caatagactg 
ccagcaaatg 
agcatctaaa 
tggtaatgac 
ctgccgggac 
tgaaggaggt 
ctcgcctctt 
actcatcaat 
attttctgca 
ccatgttgtg 
ggataccgtt 
gttgatgctt 
tgagacaaat 
cgaaaagaat 
aattataaat 



atggctggtc 
aaaaacatat 
ctagaaacta 
gaaggttgta 
gatgtgattc 
gctgacaagg 
ttcagtccta 
tgtatgattt 
aatgataata 
tgttacaaaa 
gacaggtgcg 
tatggaggag 
gattatttaa 
aatagatggg 
atggatagat 
gagtggtgct 
ttagatgttg 
tccattcaag 
gtatggatta 
attgtatata 



gtgtagagct 
accaacaaga 
ggtacggttg 
ctacagtttc 
tcattgacga 
ggaaaaggat 
ttcataagct 
gtaaaattga 
atgttatatg 
aaattaacaa 
gtaagagtac 
aatatatgtg 
ctaagaaaat 
aagtttgtag 
attactactc 
ctgctagcga 
aaaagtgttc 
aacgtgctag 
aggtagacgc 
atattgttga 



cgtcactgga 
aaatggaggc 
tggaactgga 
ttctatcaag 
agggcaattc 
tgtgattgca 
attgccttat 
taccaaagaa 
tgtaggagga 
gaaaaagaac 
ccaagccaaa 
ctttcccgac 
tgaactagat 
taaaattaag 
ggggattgtt 
tgagggactt 
aaatagggat 
agccctattc 
tcgcggcacc 
agaataa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1197 



<210> 67 
<211> 394 
<212> PRT 
<213> SHRIMP 
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<400> 67 












Met 


Gin 


Leu 


He 


Leu 


Ser 


His 


His 


1 








5 








Leu 


Val 


Thr 


Gly 


Pro 


Met 


Phe 


Ala 








20 










Tip 

lie 


i yr 


Gin 


Gin 


V3X U 


Asn 


uiy 


oiy 






35 












His 


Ser 


Leu 


Glu 


Thr 


Arg 


Tvr 

xyx 


Gly 




50 










SS 






Ala 


uiy 


VJl u 


Va 1 
vox 


Tip 
IXC 




oiy 


65 










/ u 






O X U 


L6U 


lie 


Se x* 


Val 


liCU 














ft s 








Glu 


r; "1 v 

OX _y 




Phe 


Phe 


Thr 


A en 
nop 


Leu 








l nn 

XvU 










Lys 




jjys 


Arg 1 


He 


Val 


He 


Ala 






115 












fXl n 


ne t 


Pho 
t life: 




XT X V 


Tip 
X xc 


nx o 


Lys 




130 










1 

1-^3 




v ax 


Ly s 


lieu 


Ala 


Cpr 


Jjys 


Cys 


l Ifci L- 


i jt: 










IjU 






Ala 
nxa 


Pro 


IT IlC 


i nr 


Vol 


Arg 


rne 


oiy 










lOJ 








cys 


va i 


r*i w 


oiy 


Ala 

nXa 


OX LI 


l itr l_ 


Tyr 








1 on 










Ly s 


Lys 


Tic* 

lie 


A en 
noil 


T 

iiy s 


i»yo 


JLys 


Asn 






1 QS 
l ^ j 










£. KJ\J 


uiy 




Asp 


Arg 


Cys 


uiy 


Lys 


Ocl 














£lJ 




Asn 


Lys 


Asn 




fro 


Leu 


Tyr 


Gly 


^. ^ j 
















Arg 




Car 

oer 


nis 


Th r 
x ill 


oiy 


Lys 


Leu 


















T "1 ~ 

1 le 


olU 


Leu 


Asp 


Asp 


nis 


Ala 


Ala 

ax a 








^ by 










Trp 


olU 


vdi 


Cys 


Car 


Lys 


He 


Lys 






£ 1 D 










*>ft n 


V3± 


vai 


Met 


Asp 


Arg 


Tyr 


Tyr 


Tyr 




290 










295 




Val 


Asp 


Thr 


Val 


Glu 


Trp 


Cys 


Ser 


305 










310 






Pro 


Asp 


Leu 


Val 


Leu 


Leu 


Met 


Leu 










325 








Arg 


Asp 


Thr 


Phe 


Gly 


Val 


Glu 


Arg 








340 










Arg 


Ala 


Arg 


Ala 


Leu 


Phe 


Leu 


Asp 






355 










360 


Val 


Trp 


He 


Lys 


Val 


Asp Arg 


Thr 




370 










375 




lie 


Asn 


He 


Val 


Tyr 


Asn 


He 


Val 


385 










390 







<210> 68 

<211> 486 

<212> DNA 

<213> SHRIMP 

<400> 68 

atgttaccta gaaagacttt gcccgacact 



Leu 


X 11X 


l ie L~ 


Ala 
nla 


HI V 

oiy 


Arg 


Val 


V7X U 




1 0 

X \J 










15 




Gly 


Jjyo 


Ser 


Thr 




Leu 


Lys 


Asn 


















Asn 


Lys 


nlo 


Cys 


Leu 




Va 1 

vox 


Lys 










45 










oiy 


Thr 

XI1X 


oiy 


Thr 


Tip 

X X<? 


Va 1 


Th r 

i in 


















Cys 


Th t- 

x nr 


Thr- 


Vdl 


bsr 




Tl 0 

ixe 


Lys 






7S 












Va 1 
Val 


Vox 


Asp 


vol 


Tie 
lie 


Leu 


Tl 0 

i ie 


Asp 


















Va 1 
Val 


Leu 


\7a 1 
vol 


Asn 


Arg 


Leu 


Ala 
nla 


Asp 


IUj 










inn 






Ala 


Leu 


Asp 


uiy 


x nr 




Asp 


oin 










125 








Leu 


Leu 


Pro 


Tyr 


Th r- 

x nr 


Asn 




Tin 

ne 








1 *i W 










1 Its 


Cys 


Lys 


lie 


Asp 


Th r- 
1 11 L 


Lys 


ulu 
















1 Cfl 
1 OU 


Asn 


Asp 


Asn 


Asp 


Asn 


Asn 


vol 


T1 A 

x ie 




1 / \J 










iic 

1 / «J 




Ala 
nla 




Ala 
nla 


Cys 


Arg 


Asp 


Cys 


Tyr 


1 ft S 
lOJ 










i an 






T ire 

Lys 


«iy 


Lys 


Leu 


\/a 1 

v ai 


Vdl 


Leu 


olU 


















inr 


olil 


Aid 


Lys 


Leu 


Leu 


Leu 


Txir 


















biy 


olU 


Tyr 


net 


Cys 


rne 


fro 


Asp 






r> 1 q 










^4 U 


j. ie 


Asn 


ASp 


Tyr 


i«eu 


Thr 


Lys 


Lys 




•? <^n 










zoo 




His 


Leu 


Leu 


pne 


Ser 


Ala 


Asn 


Arg 


Oct 










O *7 A 






(jin 


Leu 


Leu 


Asp 


Asp 


Gly 


lie 


Mi- 
nis 










zoo 








Ser 


Gly 


He 


Val 


Phe 


Ser 


Leu 


Arg 








300 










Ala 


Ser 


Asp 


Glu 


Gly 


Leu 


Pro 


Gin 






315 










320 


Leu 


Asp 


Val 


Glu 


Lys 


Cys 


Ser 


Asn 




330 










335 




Phe 


Glu 


Thr 


Asn 


Ser 


He 


Gin 


Glu 


345 










350 






Leu 


Ala 


Asn 


Lys 


Asp 


Glu 


Lys 


Asn 










365 








lie 


Glu 


Glu 


Val 


Gin 


Thr 


Lys 


He 








380 










Glu 


Glu 















gaaaatggtt attttgtctt ggacgagtct 60 
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cttctggaga 
tatatgcaga 
tatagtgtgt 
gatggggtat 
gggtgctaca 
cagcaaatga 
tctagaacgt 
tgttaa 



aggtgtacta 
tatgcaagtc 
tggaggatta 
ttttaggacc 
aagctctttt 
aaaccaacct 
acaatacacc 



tgataacaac 
aaaatacatc 
tcaccccatc 
gtgggagttt 
gtcattgcca 
ggaaaaaatg 
ccagatagaa 



aatgaactga 
ttccatcacg 
aaagagattg 
ttatcgcgca 
gaggacaaat 
gaagaaatac 
ttgtctgacc 



ttgtaagagt 
atgatccaga 
ttgaacgact 
aacaagtgaa 
attgtaacct 
agcgtactag 
agctagatgg 



tggtgggatt 
gaggttcttt 
agcagaagag 
cctccaacac 
attattaccc 
actcattcac 
atgtgttata 



120 
180 
240 
300 
360 
420 
480 
486 



<210> 69 
<211> 161 
<212> PRT 
<213> SHRIMP 



<400> 69 



Met 


Leu 


Pro 


Arg 


Lys 


Thr 


Leu 


Pro 


Asp 


Thr 


Glu 


Asn 


Gly 


Tyr 


Phe 


Val 


1 








5 










10 










15 




Leu 


Asp 


Glu 


Ser 


Leu 


Leu 


Glu 


Lys 


Val 


Tyr 


Tyr 


Asp 


Asn 


Asn 


Asn 


Glu 








20 










25 










30 






Leu 


He 


Val 


Arg 


Val 


Gly 


Gly 


He 


Xyr 


Met 


Gin 


He 


Cys 


Lys 


Ser 


Lys 






35 










40 










45 








Tyr 


He 


Phe 


His 


His 


Asp 


Asp 


Pro 


Glu 


Arg 


Phe 


Phe 


Tyr 


Ser 


Val 


Leu 




50 










55 










60 










Glu 


Asp 


Tyr 


His 


Pro 


He 


Lys 


Glu 


He 


Val 


Glu 


Arg 


Leu 


Ala 


Glu 


Glu 


65 










70 










75 










80 


Asp 


Gly 


Val 


Phe 


Leu 


Gly 


Pro 


Trp 


Glu 


Phe 


Leu 


Ser 


Arg 


Lys 


Gin 


Val 










85 










90 










95 




Asn 


Leu 


Gin 


His 


Gly 


Cys 


Tyr 


Lys 


Ala 


Leu 


Leu 


Ser 


Leu 


Pro 


Glu 


Asp 








100 










105 










110 






Lys 


Tyr 


Cys 


Asn 


Leu 


Leu 


Leu 


Pro 


Gin 


Gin 


Met 


Lys 


Thr 


Asn 


Leu 


Glu 






115 










120 










125 








Lys 


Met 


Glu 


Glu 


He 


Gin 


Arg 


Thr 


Arg 


Leu 


He 


His 


Ser 


Arg 


Thr 


Tyr 


130 










135 










140 










Asn 


Thr 


Pro 


Gin 


He 


Glu 


Leu 


Ser 


Asp 


Gin 


Leu 


Asp 


Gly 


Cys 


Val 


He 


145 










150 










155 










160 



Cys 



<210> 70 
<211> 1926 
<212> DNA 
<213> SHRIMP 



<400> 70 

atggttgctt 

acaaactttc 

gatgtagttt 

agtatagttt 

caaatggttt 

aatgaagaag 

ctcctcaact 

tttgacaggg 

tccacgtcaa 

aatatggaac 

tcttttacta 

ttcatatgca 

tgccctgata 

aattaccctg 

ggggatcata 

gaaattatag 



caactccgtg 
ttgaagctca 
attgtgactc 
ctaccattgg 
cttccgtctt 
ataaaggtat 
cattgttggt 
agaatatatg 
cttttcagac 
gtggcaaaca 
actttgaaaa 
atgaaaagga 
aaactaa tga 
cattgctata 
atgaatctct 
agtttataag 



tccaggccca 
caagcttgtc 
tgagacgtac 
agactatgtc 
agaaaaattt 
tcctctgtgg 
tcataacaac 
tgatgcttca 
aatttgcaaa 
aggagggagt 
tgaagaaatg 
aaaactagac 
tgaagggtac 
tctcgtttgt 
caaagcgttt 
tgatcactac 



ggaccagttc 
gtggaacttc 
accaaaccta 
ttatcaaacc 
cccttgctat 
aagaagttgt 
aagaactgga 
ggaaggagtg 
aacaacacac 
tttcttcact 
gactctcatg 
tctttcatac 
atcccgctgg 
aggtatggag 
gcaataagaa 
agtttcaaca 



caacccaaga 
ttctcccgtc 
taccgatttt 
ccaatgaaga 
tccactgcac 
acaacaaaag 
ctcctgttcc 
ttcttatgag 
attacttgtt 
tctttgcatc 
tgctcagtaa 
ctgccaacgg 
aaatagcaat 
catcttgggc 
atgatgcaaa 
aaaatgtgac 



acttctttct 
ctacagtagt 
tgggaacaag 
tgtgagttac 
ttataagacg 
aaaattcaaa 
agctatcccg 
tgaaataatg 
tgatatgtta 
taggaagaat 
catagcgaaa 
aaaaatacca 
tatggaagac 
aaacacatac 
agattgtctg 
gaaggaagaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 
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tttgttaaag 
cgttgttaca 
tctaaagcta 
aaatgtcccc 
atggatttgt 
aaattagaca 
ctagcaaaaa 
aaaaggacag 
acagcagaaa 
aaaataattg 
tcaactgcag 
gattccgatc 
gaagaaacaa 
ctaactaaag 
ataattgcga 
acttctcata 
gaataa 



agaagactgt 
aactcccatg 
actttagatg 
caactataca 
tcaatcgtgc 
aaaaatcaaa 
ggacagcaga 
cagaaattga 
ttgaagaggc 
aaaagggaaa 
ccataaacat 
aggagttggc 
aacttgacag 
atgatgaaac 
agaaaaacaa 
ttgagaaact 



agaatgtgtt 
tggacatttc 
tgttaaatgt 
atggaaaatg 
atttgacaca 
acctaaacac 
aattgaagag 
agaggccata 
catgaagaaa 
aagacgactg 
gtacaaagag 
tgaggcgaaa 
atttttgttt 
gttagccttc 
ccaaaaaggc 
acctcaatcc 



ggatgtttat 
atgcatacat 
ttccaaacct 
ggtataaacc 
tatttagatt 
aaacctgaaa 
gccataaaga 
aagaaaaagg 
aaggaagaag 
aatgaagaat 
aaagtgagaa 
gagaggttga 
agaccgaaac 
aagttagccc 
agtgaaagaa 
actgctttgg 



atgatattga 
tttgcttgtc 
ttgatgacac 
aaacgactaa 
ttatttgctc 
acaaaaaggt 
aaaaggaaga 
aagaagaact 
aagaactctc 
gtgtcaagct 
ttaatggtgt 
ggaaaatttt 
gagtagaaga 
tagaaaagaa 
gagatggaga 
ctagtgtgtg 



agacgagaaa 
taataagtgt 
aatttttaga 
ccataaggaa 
atataacgtc 
ggaagaagaa 
agaactagca 
agcaaaaagg 
aaaatataat 
gagagatatt 
attactaaaa 
attgctagaa 
acgtatattc 
aacggaggac 
atatactata 
tgtgttaaac 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1926 



<210> 71 
<211> 637 
<212> PRT 
<213> SHRIMP 



<400> 71 



Met 


Val 


Ala 


Ser 


Thr 


Pro 


Cys 


Pro 


1 








5 








Glu 


Leu 


Leu 


Ser 


Thr 


Asn 


Phe 


Leu 








20 










Leu 


Leu 


Leu 


Pro 


Ser 


Tyr 


Ser 


Ser 






35 










40 


Thr 


Tyr 


Thr 


Lys 


Pro 


He 


Pro 


He 




50 










55 




Thr 


He 


Gly 


Asp 


Tyr 


Val 


Leu 


Ser 


65 










70 






Gin 


Met 


Val 


Ser 


Ser 


Val 


Leu 


Glu 










85 








Thr 


Tyr 


Lys 


Thr 


Asn 


Glu 


Glu 


Asp 








100 










Leu 


Tyr 


Asn 


Lys 


Arg 


Lys 


Phe 


Lys 






115 










120 


Asn 


Asn 


Lys 


Asn 


Trp 


Thr 


Pro 


Val 




130 










135 




Asn 


He 


Cys 


Asp 


Ala 


Ser 


Gly 


Arg 


145 










150 






Ser 


Thr 


Ser 


Thr 


Phe 


Gin 


Thr 


He 










165 








Phe 


Asp 


Met 


Leu 


Asn 


Met 


Glu 


Arg 








180 










His 


Phe 


Phe 


Ala 


Ser 


Arg 


Lys 


Asn 






195 










200 


Glu 


Met 


Asp 


Ser 


His 


Val 


Leu 


Ser 




210 










215 




Glu 


Lys 


Glu 


Lys 


Leu 


Asp 


Ser 


Phe 


225 










230 






Cys 


Pro 


Asp 


Lys 


Thr 


Asn 


Asp 


Glu 










245 








lie 


Met 


Glu 


Asp 


Asn 


Tyr 


Pro 


Ala 








260 










Gly 


Ala 


Ser 


Trp 


Ala 


Asn 


Thr 


Tyr 






275 










280 


Ala 


Phe 


Ala 


He 


Arg 


Asn 


Asp 


Ala 



Gly 


Pro 


Gly 


Pro 


Val 


Pro 


Thr 


Gin 




lu 










ID 




Glu 


Ala 


His 


Lys 


Leu 


Val 


Val 


Glu 


25 










30 






Asp 


Val 


Val 


Tyr 


Cys 


Asp 


Ser 


Glu 










45 








Phe 


Gly 


Asn 


Lys 


Ser 


He 


Val 


Ser 








60 










Asn 


Pro 


Asn 


Glu 


Asp 


Val 


Ser 


Tyr 






75 










80 


Lys 


Phe 


Pro 


Leu 


Leu 


Phe 


His 


Cys 




90 










95 




Lys 


Gly 


He 


Pro 


Leu 


Trp 


Lys 


Lys 


105 










110 






Leu 


Leu 


Asn 


Ser 


Leu 


Leu 


Val 


His 










125 








Pro 


Ala 


He 


Pro 


Phe 


Asp 


Arg 


Glu 








140 










Ser 


Val 


Leu 


Met 


Ser 


Glu 


He 


Met 






155 










160 


Cys 


Lys 


Asn 


Asn 


Thr 


His 


Tyr 


Leu 




170 










175 




Gly 


Lys 


Gin 


Gly 


Gly 


Ser 


Phe 


Leu 


185 










190 






Ser 


Phe 


Thr 


Asn 


Phe 


Glu 


Asn 


Glu 










205 








Asn 


He 


Ala 


Lys 


Phe 


He 


Cys 


Asn 








220 










lie 


Pro 


Ala 


Asn 


Gly 


Lys 


He 


Pro 






235 










240 


Gly 


Tyr 


He 


Pro 


Leu 


Glu 


He 


Ala 




250 










255 




Leu 


Leu 


Tyr 


Leu 


Val 


Cys 


Arg 


Tyr 


265 










270 






Gly 


Asp 


His 


Asn 


Glu 


Ser 


Leu 


Lys 










285 








Lys 


Asp 


Cys 


Leu 


Glu 


He 


He 


Glu 
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290 

Phe lie Ser Asp 
305 

Phe Val Lys Glu 

Glu Asp Glu Lys 
340 



inr 


fne 


Cys 


lieu 






"5 C C 

ODD 




i»ys 


Cys 


Pne 


Gin 




370 






Tnr 


T 1 _ 

lie 


Gin 


Trp 


385 








Met 


Asp 


Leu 


Phe 


Ser 


Tyr 


Asn 


Val 








420 


Glu 


Asn 


Lys 


Lys 






435 




Glu 


Glu 


Ala 


lie 




450 






Glu 


lie 


Glu 


Glu 


4 65 








Thr 


Ala 


Glu 


lie 


Ser 


Lys 


Tyr 


Asn 








500 


Glu 


Cys 


Val 


Lys 






515 




Lys 


Glu 


Lys 


Val 




530 






Glu 


Leu 


Ala 


Glu 


545 








Glu 


Glu 


Thr 


Lys 


Glu 


Arg 


He 


Phe 








580 


Ala 


Leu 


Glu 


Lys 






595 




Lys 


Gly 


Ser 


Arg 




610 






Leu 


Pro 


Gin 


Ser 


625 









295 



nlS 


Tyr 


■JCl 


iriifci 




Jiu 






Lys 


Thr 


Val 


Glu 


325 








Arg 


Cys 


Tyr 


Lys 




Asn 


Lys 


Cys 










lnr 


irne 


Asp 


Asp 






"7 C 




Lys 


Met 


Gly 


Ti- 
ne 




jyo 






Asn 


Arg 


Ala 


Phe 


405 








Lys 


Leu 


Asp 


Lys 


Val 




olU 












Lys 


Lys 


Lys 


GlU 






yl C C 

4 3D 




Ala 


lie 


Lys 


Lys 




4 /O 






Glu 


Glu 


Ala 


Met 


485 








Lys 


lie 


lie 


GlU 


Leu 


Arg 


Asp 


lie 








c o 


Arg 


lie 


Asn 


Gly 






535 




Ala 


Lys 


Glu 


Arg 




550 






Leu 


Asp 


Arg 


Phe 


565 








Leu 


Thr 


Lys 


Asp 


Lys 


Thr 


Glu 


Asp 








600 


Asp 


Gly 


Glu 


Tyr 






615 




Thr 


Ala 


Ser 


Val 




630 







300 



Asn 


Lys 




Val 












val 




Cys 










Leu 


Pro 


Cys 


Gly 


345 








Ser 


Lys 


Ala 


Asn 


inr 


lie 


irne 


Arg 








JtJU 


Asn 


ri n 

uin 


i nr 


x nr 










Asp 


inr 


Tyr 


Leu 




4 1U 






Lys 


Ser 


Lys 


Pro 


425 








T At 1 

Leu 


Ala 


Lys 


Arg 




GlU 


Leu 


Aia 








4 bU 


Lys 


Glu 


Glu 


GlU 






4 /o 




Lys 


Lys 


Lys 


GlU 




4 90 






Lys 


Gly 


Lys 


Arg 


505 








Ser 


Thr 


Ala 


Ala 


Val 


Leu 


Leu 


Lys 








540 


Leu 


Arg 


Lys 


He 






555 




Leu 


Phe 


Arg 


Pro 




570 






Asp 


Glu 


Thr 


Leu 


585 








He 


He 


Ala 


Lys 


Thr 


He 


Thr 


Ser 








620 


Cys 


Val 


Leu 


Asn 






635 





Thr 


Lys 


Glu 


Glu 








Jew 


Leu 




Asp 


He 












rile 














Phe 


Arg 


Cys 


Val 


365 








Lys 


^yo 


Pro 


XT X. U 


Asn 


nis 


Lys 


GXU 








4 uu 


Asp 


roe 


-lie 


Cys 






4 1D 




Lys 


rixs 


Lys 


jfro 










Thr 


Ala 


Glu 


He 


445 








Lys 


Arg 


inr 


ax a 


Leu 




Lys 


Arg 








4 oU 


GlU 


ri,. 

GlU 


GlU 


Leu 






4 273 




Arg 


Leu 


Asn 


GlU 




OlU 






Tl a 

lie 


Asn 


ritr t- 


iyr 


525 








Asp 


Ser 


Asp 


Gin 


Leu 


Leu 


Leu 


Glu 








560 


I*ys 


Arg 


Val 


Glu 






575 




Ala 


Phe 


Lys 


Leu 




590 






Lys 


Asn 


Asn 


Gin 


605 








His 


He 


Glu 


Lys 



Glu 



<210> 72 
<211> 780 
<212> DNA 
<213> SHRIMP 



<400> 72 

atgtctactt 

ccattcgacc 

tttttcatta 

tcaaagatta 

aatatataca 

attatactga 

attcttccca 

aacaatgaac 

caaaatctag 

ataaaagegt 



gttcgaattt 
tcgtccatac 
ctgaaacgtg 
ttgacacggg 
aagctataaa 
tggatgtaga 
agggtaaaaa 
ggcattacta 
tagattatta 
gcgcatattt 



gttgtcagta 
aegtcaagag 
taaaggagag 
taataatgat 
ccatttagaa 
aacaatgata 
tggaagtttc 
ccctgtattt 
taatgaaatt 
catgtacaac 



tt tggtggag 
tgtgataaaa 
aa tattggta 
tctacctcaa 
aatatcctaa 
ctggaaactc 
agtacatgcg 
gaaacagaga 
gtagctcaca 
tttctaactc 



gagattggac 
agagagagca 
tacattegta 
tagaggaact 
aactcaacaa 
ataaaatttt 
tacgetttge 
aagaagcgtt 
ccaatgacca 
tccacccttt 



aacaacattc 60 
agactactca 120 
tgaacacacg 180 
agaagtactg 240 
aggagaaaaa 300 
aatgaaaggg 360 
tgtaaataag 420 
caattctata 480 
aattaaaata 540 
caatgatggt 600 
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aatggaagaa cagctagatt attgtatagt tttctattga aaggtaatgg tatcgtacct 660 
catttttcac ccataacaca ccctagggat caatttgttg atactttagt gtattfctaga 720 
gaacatggag atggacgacc tttattgtat gttttgctgg aatcaataaa aaataagtaa 780 



<210> 73 
<211> 255 
<212> PRT 
<213> SHRIMP 



<400> 73 



Met 


Ser 


Thr 


Cys 


Ser 


Asn 


Leu 


Leu 


Ser 


Val 


Phe 


Gly 


Gly 


Gly 


Asp 


Trp 


1 








5 










10 










ID 




Thr 


Thr 


Thr 


Phe 
20 


Pro 


Phe 


Asp 


Leu 


Val 


His 


Thr 


Arg 


Gin 


Glu 
JU 


Cys 


Asp 


Lys 


Lys 


Arg 
35 


Glu 


Gin 


Asp 


Tyr 


Ser 
40 


Phe 


Phe 


He 


Thr 


Glu 

A C 


Thr 


Cys 


Lys 


Gly 


Glu 
50 


Asn 


He 


Gly 


He 


His 
55 


Ser 


Tyr 


Glu 


His 


Thr 
60 


Ser 


.Lys 


T 1 ^ 

lie 


lie 


Asp 


Thr 


Gly 


Asn 


Asn 


Asp 


Ser 


Thr 


Ser 


He 


Glu 


Glu 


Leu 


Glu 


Val 


Leu 


65 










70 










75 










80 


Asn 


He 


Tyr 


Lys 


Ala 


He 


Asn 


His 


Leu 


Glu 


Asn 


He 


Leu 


Lys 


Leu 


Asn 








85 










90 










95 




Lys 


Gly 


Glu 


Lys 
100 


He 


He 


Leu 


Met 


Asp 
105 


Val 


Glu 


Thr 


Met 


He 
110 


Thr 


His 


Lys 


He 


Leu 
115 


Met 


Lys 


Gly 


He 


Leu 
120 


Pro 


Lys 


Gly 


Lys 


Asn 
125 


Gly 


Ser 


Phe 


Ser 


Thr 
130 


Cys 


Val 


Arg 


Phe 


Ala 
135 


Val 


Asn 


Lys 


Asn 


Asn 
140 


Glu 


Arg 


His 


Tyr 


Tyr 


Pro 


Val 


Phe 


Glu 


Thr 


Glu 


Lys 


Glu 


Ala 


Phe 


Asn 


Ser 


He 


Gin 


Asn 


145 










150 










155 










160 


Leu 


Val 


Asp 


Tyr 


Tyr 
165 


Asn 


Glu 


He 


Val 


Ala 
170 


His 


Thr 


Asn 


Asp 


Gin 
175 


He 


Lys 


He 


He 


Lys 
180 


Ala 


Cys 


Ala 


Tyr 


Phe 
185 


Met 


Tyr 


Asn 


Phe 


Leu 
190 


Thr 


Leu 


His 


Pro 


Phe 
195 


Asn 


Asp 


Gly 


Asn 


Gly 
200 


Arg 


Thr 


Ala 


Arg 


Leu 
205 


Lys 


Phe 


Leu 


Leu 


Lys 
210 


Gly 


Asn 


Gly 


He 


Val 
215 


Pro 


His 


Phe 


Ser 


Pro 
220 


He 


Thr 


His 


Pro 


Arg 


Asp 


Gin 


Phe 


Val 


Asp 


Thr 


Leu 


Val 


Tyr 


Phe 


Arg 


Glu 


His 


Gly 


Asp 


225 










230 










235 










240 


Gly 


Arg 


Pro 


Leu 


Leu 
245 


Tyr 


Val 


Leu 


Leu 


Glu 
250 


Ser 


He 


Lys 


Asn 


Lys 
255 





<210> 74 
<211> 480 
<212> DNA 
<213> SHRIMP 



<400> 74 

atggaggacc 

caatggacaa 

gaggaacaaa 

ctcgcagtaa 

aaagaattga 

gcaacctcta 

acactcaaag 

atcaataaca 



taaaatccac 
atactgtaga 
aaagggccct 
aagaagggat 
taaatacgtg 
ccaataatca 
aaaagtacaa 
aaagggatta 



tatcgagaga 
ggaagaagaa 
ggacgcatgg 
atctgcactc 
gatagccgaa 
actgaagaac 
caaatattac 
tgaagcaagt 



gtatatgaag 
aggactgtct 
gaagcagcga 
gttttcaacg 
agggaaacgt 
cagatgtcat 
agaagaagtg 
caattttggg 



aaagagtgga 
cagcaatcga 
taaaggaacg 
cagcagacgc 
cagaaaaaag 
ctctagtcaa 
ccatactcaa 
tgtatacaaa 



gaatctagaa 
ttctgtcctg 
agaaaacgac 
caaaacacgt 
aagaaaggaa 
cacaaccaaa 
catgcaatac 
caatgcataa 



60 

120 

180 

240 

300 

360 

420 

480 
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<210> 75 
<211> 159 
<212> PRT 
<213> SHRIMP 



<400> 75 



Met 


Glu 


Asp 


Leu 


1 








Glu 


Asn 


Leu 


Glu 








20 


Val 


Ser 


Ala 


He 






35 




Ala 


Trp 


Glu 


Ala 




50 






Glu 


Gly 


He 


Ser 


65 








Lys 


Glu 


Leu 


He 


Arg 


Arg 


Lys 


Glu 








100 


Ser 


Ser 


Leu 


Val 






115 




Tyr 


Tyr 


Arg 


Arg 




130 






Arg 


Asp 


Tyr 


Glu 


145 









Lys Ser Thr He 
5 

Gin Trp Thr Asn 

Asp Ser Val Leu 
40 

Ala He Lys Glu 
55 

Ala Leu Val Phe 
70 

Asn Thr Trp He 
85 

Ala Thr Ser Thr 

Asn Thr Thr Lys 
120 

Ser Ala lie Leu 
135 

Ala Ser Gin Phe 
150 



Glu 


Arg 


Val 


Tyr 










Thr 


Val 


Glu 


Glu 


25 








Glu 


Glu 


Gin 


Lys 


Arg 


Glu 


Asn 


Asp 








60 


Asn 


Ala 


Ala 


Asp 






75 




Ala 


Glu 


Arg 


Glu 




90 






Asn 


Asn 


Gin 


Leu 


105 








Thr 


Leu 


Lys 


Glu 


Asn 


Met 


Gin 


Tyr 








140 


Trp 


Val 


Tyr 


Thr 






155 





Glu Glu Arg Val 
15 

Glu Glu Arg Thr 
30 

Arg Ala Leu Asp 
45 

Leu Ala Val Lys 

Ala Lys Thr Arg 
80 

Thr Ser Glu Lys 
95 

Lys Asn Gin Met 
110 

Lys Tyr Asn Lys 
125 

He Asn Asn Lys 
Asn Asn Ala 



<210> 76 
<211> 321 
<212> DNA 
<213> SHRIMP 



<400> 76 

atgcataagt tttcaaataa attttatttt ataataaaag gtgttttaat tataattttt £0 

gtccctgatg ttgtgttttc tatattttta ctgcctcctc ttggggtaag acataaaaac 120 

ggtggcggcg gaaacgagga acagaagagc ggacccagcc agaagcatca tatccctggt 180 

cctgttctta tatttgtcct catcatcgtt atcgttggca gtgtcgtcat catcatcggt 240 

gtccttatca gtgtcaggat cgctgtcctt ctttggtccc atccatacat tcatgacggc 300 

caggacgagg ataccaattg a 321 

<210> 77 
<211> 106 
<212> PRT 
<213> SHRIMP 

<400> 77 



Met 


His 


Lys 


Phe 


Ser 


Asn 


Lys 


Phe 


Tyr 


Phe 


He 


He 


Lys 


Gly 


Val 


Leu 


1 








5 










10 










15 




He 


He 


He 


Phe 
20 


Val 


Pro 


Asp 


Val 


Val 
25 


Phe 


Ser 


He 


Phe 


Leu 
30 


Leu 


Pro 


Pro 


Leu 


Gly 
35 


val 


Arg 


His 


Lys 


Asn 
40 


Gly 


Gly 


Gly 


Gly 


Asn 
45 


Glu 


Glu 


Gin 


Lys 


Ser 


Gly 


Pro 


Ser 


Gin 


Lys 


His 


His 


He 


Pro 


Gly 


Pro 


Val 


Leu 


He 




50 










55 










60 










Phe 


Val 


Leu 


He 


He 


Val 


He 


val 


Gly 


Ser 


Val 


Val 


He 


He 


He 


Gly 


65 










70 










75 










80 


Val 


Leu 


lie 


Ser 


Val 
85 


Arg 


He 


Ala 


Val 


Leu 
90 


Leu 


Trp 


Ser 


His 


Pro 
95 


Tyr 


He 


His 


Asp 


Gly 
100 


Gin 


Asp 


Glu 


Asp 


Thr 
105 


Asn 
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<210> 78 
<211> 1635 
<212> DNA 
<213> SHRIMP 



<400> 78 

atgttccgtc 

tccacggcgt 

ggagaaccac 

tgcgaggacc 

atacaggtaa 

attgatagga 

gacgggggtg 

aagaaattgt 

gaaaacagta 

gaatgtggat 

gagattaaag 

tcattaattg 

gcagtcaact 

tattgttgtg 

tctttaacag 

gagtgcgtgc 

agggactgct 

ttcctggaca 

tcttccaaca 

aggagtaact 

gcatcagcct 

atggtaagaa 

gcctctaaag 

gcatctttac 

aaatggatga 

tggagttaca 

ccctatcaga 

ccattatact 



aattctgttc 
ccgcttcggc 
cttctcataa 
cagaaagggt 
caaaagaatg 
gcggaattga 
gaaatggggg 
cctttgttgg 
cagaatttgc 
gcatcaaatc 
aggctgctga 
actggttttt 
ggtacttgaa 
tcttttccta 
tggcttcatc 
aacacttttt 
tcttccctct 
caatgaagag 
acaagagaaa 
tttctacagc 
gtattcttat 
aatcaactat 
acactggtgc 
tcctccctgg 
ctctagcaga 
cgctagcaga 
catgcggcaa 
tttaa 



actatatctt 
agcggcctca 
aatgagggga 
atatgttgat 
ggaattgttt 
gaataatggc 
aagaattgaa 
aagagaagat 
acgtcgttta 
aaaagaggaa 
cgttaggggt 
tgagttcgac 
atctcaagcg 
cataagacgc 
cttttctccc 
aaaatcagac 
tttaactgaa 
acctacctta 
gagaaatact 
atccaataac 
cgaagggtat 
tattccagaa 
gaatatctta 
actttttaga 
aaacaacatc 
ttattgtcct 
ttttacttcg 



cttcaacgcc 
ttaaaaggag 
ccttcttata 
attgtagtgt 
tccgataagt 
gaaggagaag 
gacagagaag 
ccagtcgctg 
acactcaagg 
ctctttgact 
atggcaaaca 
acatatagta 
tctccaattt 
caaacctttt 
acgcccgaca 
attaatatta 
atgccccgcc 
tctcttctac 
gccgctgcca 
aagagactga 
gcgaatggaa 
gtgtttaacc 
ttttttatca 
caccccaaac 



gggttaacga 
atggaactga 
gcgttttagg 
ctattttgca 
tgagaaaatt 
aagatggaga 
cacatcgacg 
tagatttacc 
aattgtgcga 
tcatttttga 
ggagtaaatt 
aatgtgtagt 
cattggtact 
taactagggc 
caaagctttt 
gccagatggc 
aacaaaaaaa 
cttccacctc 
atattcttct 
aaactgatga 
aaataagccc 
atcttttgtt 
aaatgaaatc 
aatttctcaa 



aacgacaaca acataaactc 
ctgggctatt acacccaaga 
actacaaaca agagactaca 



caatcttaga 
atttataacc 
acctgatccg 
gacaaataat 
gggtccatgg 
tgaaaatgaa 
aaaaatgatg 
cacgtggcga 
tttaatagtt 
agaaccgtgg 
caccaaggaa 
attttttgaa 
agatgatata 
aaaaaaccca 
ggctatcgac 
attaactgaa 
agtaaacacc 
ctcctcctct 
tccagtgtac 
tggggaaaat 
tataaggatt 
ccctgtcttt 
ctttgcaagt 
cgggccgtgc 
ttccacgatg 
gagccctcaa 
aaacgtgcag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1635 



<210> 79 
<211> 540 
<212> PRT 
<213> SHRIMP 



<400> 79 



Met 


Phe 


Arg 


Gin 


Phe 


Cys 


Ser 


Leu 


1 








5 








Asp 


Asn 


Leu 


Arg 


Ser 


Thr 


Ala 


Ser 








20 










Gly 


Asp 


Gly 


Thr 


Glu 


Phe 


He 


Thr 






35 










40 


Arg 


Gly 


Pro 


Ser 


Tyr 


Ser 


Val 


Leu 




50 










55 




Glu 


Arg 


Val 


Tyr 


Val 


Asp 


He 


Val 


65 










70 






lie 


Gin 


Val 


Thr 


Lys 


Glu 


Trp 


Glu 










85 








Leu 


Gly 


Pro 


Trp 


He 


Asp 


Arg 


Ser 








100 










Glu 


Glu 


Asp 


Gly 


Asp 


Glu 


Asn 


Glu 






115 










120 


He 


Glu 


Asp 


Arg 


Glu 


Ala 


His 


Arg 




130 










135 





Tyr 


Leu 
10 


Leu 


Gin 


Arg 


Arg 


Val 
15 


Asn 


Ala 


Ser 


Ala 


Ala 


Ala 


Ser 


Leu 


Lys 


25 










30 






Gly 


Glu 


Pro 


Pro 


Ser 
45 


His 


Lys 


Met 


Gly 


Pro 


Asp 


Pro 
60 


Cys 


Glu 


Asp 


Pro 


Val 


Ser 


He 
75 


Leu 


Gin 


Thr 


Asn 


Asn 
80 


Leu 


Phe 
90 


Ser 


Asp 


Lys 


Leu 


Arg 
95 


Lys 


Gly 


He 


Glu 


Asn 


Asn 


Gly 


Glu 


Gly 


105 










110 






Asp 


Gly 


Gly Gly 


Asn 


Gly 


Gly 


Arg 










125 








Arg 


Lys 


Met 


Met 
140 


Lys 


Lys 


Leu 


Ser 
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dk a 


vaj. 


ijiy 


Arg 


145 








Glu 


Asn 


Ser 


Thr 


Asp 


lieu 


lie 


va i 








.LOU 


Asp 


rne 


lie 












Arg 


taiy 


M a 4- 


nici 










Trp 


OK a 


Fne 


CjlU 


225 








Ala 


Val 


Asn 


Trp 


Asp 


Tl . 

He 


Tyr 


Cys 








^ bU 


Tnr 


Arg 


Ala 


Lys 






one 




Thr 


Pro 


Asp 


inr 










T All 

Lieu 


Lys 


ber 


Asp 


305 








Cys 


Phe 


Phe 


Pro 


Asn 


Thr 


Pne 


Leu 










Ser 


Thr 


Ser 


Ser 






•ace 




Ala 


Ala 


Ala 


Asn 




"a t a 
J /U 






Asn 


Asn 


Lys 


Arg 


385 








He 


Leu 


He 


Glu 


Met 


Val 


Arg 


Lys 








420 


Phe 


Pro 


Val 


Phe 










He 


Lys 


Met 


Lys 




450 






Phe 


Arg 


His 


Pro 


4 65 








Leu 


Ala 


Glu 


Asn 


Trp 


Ser 


Tyr 


Thr 








500 


Glu 


Ser 


Pro 


Gin 






515 




Asn 


Lys 


Arg 


Leu 




530 











Pro 


Val 




150 








rile 




Arg 


165 








Glu 


Cys 


Gly 


Cys 


LilU 


r* i ii 

ulu 


Pro 


Trp 










Asn 


Arg 


Se r 


Lys 










rne 


Asp 


i or 


Tyr 




Ton 






Tyr 


Leu 


Lys 




245 








Cys 


Val 


Phe 


Ser 


Asn 


Pro 




Leu 










Lys 


Leu 


Leu 


nla 










Tl Q 

i le 


Asn 


ne 












Leu 


Leu 


i nr 




325 








Asp 


Thr 


Met 


Lys 


O A V- 

ber 


O a w 

ber 


ber 


O A •»- 

ber 










T 1 A 

He 


Leu 


T Al ■» 

Leu 


Pro 










Leu 


Lys 


rnr 


Asp 




■a or\ 






Gly 


Tyr 


TV "1 

Ala 


Asn 


405 








Ser 


Thr 


He 


He 


Ala 


Ser 


Lys 


Asp 








A A t\ 


Ser 


Phe 


Ala 


Ser 






455 




Lys 


Gin 


Phe 


Leu 




470 






Asn 


He 


Asn 


Asp 


485 








Leu 


Ala 


Asp 


Tyr 


Pro 


Tyr 


Gin 


Thr 








520 


Gin 


Asn 


Val 


Gin 






535 





Ala 


Va 1 
val 


Asp 


JjCU 






155 




Arg 


Leu 


± hi. 


Leu 




170 






Tie 


Lys 




Lys 


185 








Glu 


He 


Lys 


Glu 


irne 


i nr 




UlU 








ZZU 


Ser 


Lys 


Cys 


Val 










uin 


Asp 


X I A 

lie 






^ JU 






Tyr 


T 1 a 

1 1c 


Arg 


Arg 


265 








Thr 


Val 


Ala 


Ser 


X 1 A 

lie 


Asp 


/^i ii 

ulU 


Cys 










bin 


Met 


Ala 


Leu 






315 




wet 


Pro 


Arg 






O -JU 






Arg 


Pro 


i nr 


Leu 


345 








Asn 


Asn 


Lys 


Arg 


val 


Tyr 


Arg 


Caw 

ber 










Asp 


Gly 


GlU 


Asn 










Gly 


Lys 


T 1 A 

lie 


Ser 




/lift 






Pro 


GlU 


vai 


fc*ne 


425 








Thr 


Gly 


Ala 


Asn 


Ala 


Ser 


Leu 


Leu 








460 


Asn 


Gly 


Pro 


Cys 






475 




Asn 


Asn 


He 


Asn 




4 90 






Cys 


Pro 


Leu 


Gly 


505 








Cys 


Gly 


Asn 


Phe 


Pro 


Leu 


Tyr 


Phe 



540 



Pro Thr Trp Arg 
160 

Lys Glu Leu Cys 
175 

Glu Glu Leu Phe 
190 

Ala Ala Asp Val 
205 

Ser Leu He Asp 

Val Phe Phe Glu 
240 

Leu Val Leu Asp 
255 

Gin Thr Phe Leu 
270 

Ser Phe Ser Pro 
285 

Val Gin His Phe 

Thr Glu Arg Asp 
320 

Gin Lys Lys Val 
335 

Ser Leu Leu Pro 
350 

Lys Arg Asn Thr 
365 

Asn Phe Ser Thr 

Ala Ser Ala Cys 
400 

Pro He Arg He 
415 

Asn His Leu Leu 
430 

He Leu Phe Phe 
445 

Leu Pro Gly Leu 

Lys Trp Met Thr 
480 

Ser Ser Thr Met 
495 

Tyr Tyr Thr Gin 
510 

Thr Ser Thr Thr 
525 



<210> 80 
<211> 582 
<212> DNA 
<213> SHRIMP 



<400> 80 

atggatgact ctagcagaaa acaacatcaa cgacaacaac ataaactctt ccacgatgtg 60 

gagttacacg ctagcagatt attgtcctct gggctattac acccaagaga gccctcaacc 120 

ctatcagaca tgeggcaatt ttacttcgac tacaaacaag agactacaaa acgtgcagcc 180 
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attatacttt 
ccgtttaatc 
aaaataagtg 
gtttcttcgc 
gtaatggaat 
cgctccagaa 
agacaaatgt 



taaacactct 
tcttgcttaa 
caggtatcat 
ccaataataa 
atcctatcgg 
tggtcacctg 
ttgatataat 



tttggaatac 
tgtgatgaat 
atcgaaactc 
taataacaac 
gggtttattg 
tacaaagggc 
agaagcaaat 



tacaggacac 
aacaagtgga 
ccatggacca 
ggagactact 
cacacgcctg 
aaagaccacc 
ggacaactct 



cttcagaaga 
gtacactcat 
tgaaaacaat 
attctacatg 
ccataactaa 
agaagctata 
ga 



gtgggaaatt 240 
tccaggtgtc 300 
gtacgagatt 360 
caggcgaatg 4 20 
taagtatcca 4 80 
tgacatctct 540 
582 



<210> 81 
<211> 193 
<212> PRT 
<213> SHRIMP 



<400> 81 



Met 


Asp 


Asp 


Ser 


Ser 


Arg 


Lys 


Gin 


His 


Gin 


Arg 


Gin 


Gin 


His 


Lys 


Leu 


1 








5 










10 










15 




Phe 


His 


Asp 


Val 
20 


Glu 


Leu 


His 


Ala 


Ser 
25 


Arg 


Leu 


Leu 


Ser 


Ser 
30 


Gly 


Leu 


Leu 


His 


Pro 
35 


Arg 


Glu 


Pro 


Ser 


Thr 
40 


Leu 


Ser 


Asp 


Met 


Arg 
45 


Gin 


Phe 


Tyr 


Phe 


Asp 
50 


Tyr 


Lys 


Gin 


Glu 


Thr 

55 


Thr 


Lys 


Arg 


Ala 


Ala 
60 


He 


He 


Leu 


Leu 


Asn 


Thr 


Leu 


Leu 


Glu 


Tyr 


Tyr 


Arg 


Thr 


Pro 


Ser 


Glu 


Glu 


Trp 


Glu 


He 


65 










70 










75 










80 


Pro 


Phe 


Asn 


Leu 


Leu 
85 


Leu 


Asn 


Val 


Met 


Asn 
90 


Asn 


Lys 


Trp 


Ser 


Thr 
95 


Leu 


lie 


Pro 


Gly 


Val 
100 


Lys 


lie 


Ser 


Ala 


Gly 
105 


He 


He 


Ser 


Lys 


Leu 
110 


Pro 


Trp 


Thr 


Met 


Lys 
115 


Thr 


Met 


Tyr 


Glu 


He 
120 


Val 


Ser 


Ser 


Pro 


Asn 
125 


Asn 


Asn 


Asn 


Asn 


Asn 
130 


Gly 


Asp 


Tyr 


Tyr 


Ser 
135 


Thr 


Cys 


Arg 


Arg 


Met 
140 


Val 


Met 


Glu 


Tyr 


Pro 


He 


Gly 


Gly 


Leu 


Leu 


His 


Thr 


Pro 


Ala 


He 


Thr 


Asn 


Lys 


Tyr 


Pro 


145 










150 










155 










160 


Arg 


Ser 


Arg 


Met 


Val 
165 


Thr 


Cys 


Thr 


Lys 


Gly 
170 


Lys 


Asp 


His 


Gin 


Lys 
175 


Leu 


Tyr 


Asp 


He 


Ser 
180 


Arg 


Gin 


Met 


Phe 


Asp 
185 


He 


He 


Glu 


Ala 


Asn 
190 


Gly 


Gin 



Leu 



<210> 82 
<211> 615 
<212> DNA 
<213> SHRIMP 



<400> 82 

atggatcttt 

gctgtattta 

acagacaata 

ggatcaggct 

aagatccgca 

actcccgtgg 

acattcaagg 

ccaaagatta 

gtctctattg 

attgcagcta 

actgagaccg 



ctttcactct 
ttgtgatttt 
tcgagacaaa 
acttcaagat 
atggaaagtc 
agggccgagc 
tgtggaacaa 
acccatcaaa 
atgaggatga 
ccgccggtgg 
agtaa 



ttcggtcgtg 
taggtatcac 
catggatgaa 
gactgatgtg 
tgatgcacag 
actcgaagtg 
cacatcaaga 
ggcctttgtc 
agttggcacc 
aaatcttttc 



tcggccatcc 
aacactgtga 
aacctccgca 
tcctttgaca 
atgaaggaag 
actgtggggc 
aagatcaaca 
ggtagctcca 
tttgtgtgtg 
gacatgtacg 



tcgccatcac 
ccaagaccat 
ttcctgtgac 
gcgacacctt 
aagatgcgga 
agaatctcac 
tcactggtat 
acacctcctc 
gtaccacctt 
tgcacgtcac 



tgctgtgatt 
cgaaacccac 
tgctgaggtt 
gggcaaaatc 
tcttgtcatc 
ctttgaggga 
gcagatggtg 
cttcaccccc 
tggcgcacca 
ctactctggc 



60 

120 

180 

240 

300 

360 

420 

480 

54 0 

600 

615 



<210> 83 
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<211> 204 
<212> PRT 
<213> SHRIMP 



<400> 83 



Met 


Asp 


Leu 


oer 


1 








Thr 


Ala 


vai 


T 1 Q 








£ V 


Val 


Thr 


Lys 


Thr 






35 




Asp 


Glu 


Asn 


Leu 




50 






Phe 


Lys 


Met 


inr 


65 








Lys 


He 


Arg 


Asn 


Asp 


Leu 


Val 


He 








100 


Gly 


Gin 


Asn 


Leu 






115 




Ser 


Arg 


Lys 


He 




130 






Pro 


Ser 


Lys 


Ala 


145 








Val 


Ser 


He 


Asp 


Phe 


Gly 


Ala 


Pro 








180 


Tyr 


Val 


His 


Val 






195 





rne 


i nr 


Leu 


Cat* 


c 
O 








Aid 


vai 


DV-i 0 

rue 


Tie 


j. ±e 


bill 


i nr 


u{ e 
J1J.5 








^ n 


Arg 


He 


Pro 


Val 






55 




Asp 


vai 


Ser 


xrlie 




70 






Gly 


Lys 


Ser 


Asp 


85 








Thr 


Pro 


Val 


Glu 


Thr 


Phe 


Glu 


Gly 








120 


Asn 


He 


Thr 


Gly 






135 




Phe 


Val 


Gly 


Ser 




150 






Glu 


Asp 


Glu 


val 


165 








He 


Ala 


Ala 


Thr 


Thr 


Tyr 


Ser 


Gly 








200 



\7-j 1 

Vdi 


Val 


OCX. 


a 1 a 

nia 




i n 






vai 


j. ±e 


rfie 


Arg 


£.0 










Asp 


Asn 


Tip 




TV 1 -1 


Glu 


Val 








60 


Asp 


Ser 


Asp 


Thr 






75 




Ala 


Gin 


Met 


Lys 




90 






Gly 


Arg 


Ala 


Leu 


105 








Thr 


Phe 


Lys 


Val 


Met 


Gin 


Met 


Val 








140 


Ser 


Asn 


Thr 


Ser 






155 




Gly 


Thr 


Phe 


Val 




170 






Ala 


Gly 


Gly Asn 


185 








Thr 


Glu 


Thr 


Glu 



He Leu Ala He 
15 

Tyr His Asn Thr 
30 

Glu Thr Asn Met 
45 

Gly Ser Gly Tyr 

Leu Gly Lys He 
80 

Glu Glu Asp Ala 
95 

Glu Val Thr Val 
110 

Trp Asn Asn Thr 
125 

Pro Lys He Asn 

Ser Phe Thr Pro 
160 

Cys Gly Thr Thr 
175 

Leu Phe Asp Met 
190 



<210> 84 
<211> 888 
<212> DNA 
<213> SHRIMP 



<400> 84 

atgaagaat t 

ttttattttt 

ttattatctc 

tcacgaccaa 

gaaaaagaag 

ateggaagag 

tttgaacaac 

cctggagaag 

cctactccta 

gaagatgatg 

gatagtggag 

agactagaaa 

caaccagatc 

gagtcgtgga 

ataaccaaca 



cgeggcagag 
ctggtgccat 
tcatactttt 
ataaaactac 
atgacaccga 
acgaggaagg 
cgagtgtaaa 
gtttgttacc 
ctcctccacc 
atgatgatat 
aagaaatgga 
caggcatgag 
ctgtagttgt 
aagttttgaa 
agattaaagc 



gtcaggcgtg 
aattgaatgt 
tgtgtgtttt 
tacaacatcc 
aggagctgta 
tgctgtagaa 
tactgggtcc 
tccaccccct 
tcccccgaca 
agacatagac 
agaagacgaa 
eggegaagaa 
caaaaacata 
cagatctgag 
catgaeggaa 



tggaggggta 
aaaaaaataa 
gtgggtgtgg 
aataaaaaaa 
ttagggcgaa 
gaeggaaaag 
aatacaggag 
cctactccta 
cgaaccccat 
tttgatgata 
gaagaggaag 
gtagattttg 
gaaaggtcag 
ggagatatta 
gatctgaagg 



actcgtgttt 
ggataataat 
tggttggtgt 
caaagaaaga 
gggaacctga 
aagaggagga 
gtggaggaac 
ctcctactcc 
ctccttcttc 
atgatataga 
atttggacac 
atgeatcate 
attatactct 
gattcttcgt 
aactgtaa 



gtacaaatca 
gatgttcctt 
gattttcatg 
taaagagaag 
aaataggccg 
agtttttgaa 
aggaactgtg 
tccacctact 
atctcttggg 
cgaattttta 
actcctttca 
tgcatatatt 
ggacccaatg 
agatcgaggg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

888 



<210> 85 
<211> 295 
<212> PRT 
<213> SHRIMP 



<400> 85 
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Met 


Lys 


Asn 


Ser 


Arg 


Gin 


Arg 


Ser 


Gly 


Val 


Trp 


Arg 


Gly 


Asn 


Ser 


Cys 


1 








5 










10 










15 




Leu 


Tyr 


Lys 


Ser 
20 


Phe 


Tyr 


Phe 


Ser 


Gly 
25 


Ala 


He 


He 


Glu 


Cys 
30 


Lys 


Lys 


He 


Arg 


He 
35 


He 


Met 


Met 


Phe 


Leu 
40 


Leu 


Leu 


Ser 


Leu 


He 
45 


Leu 


Phe 


Val 


Cys 


Phe 


Val 


Gly 


Val 


Val 


Val 


Gly Val 


He 


Phe 


Met 


Ser 


Arg 


Pro 


Asn 




50 










55 










60 










Lys 


Thr 


Thr 


Thr 


Thr 


Ser 


Asn 


Lys 


Lys 


Thr 


Lys 


Lys 


Asp 


Lys 


Glu 


Lys 


65 










70 










75 










80 


Glu 


Lys 


Glu 


Asp 


Asp 


Thr 


Glu 


Gly Ala 


Val 


Leu 


Gly Arg Arg 


Glu 


Pro 










85 










90 










95 




Glu 


Asn 


Arg 


Pro 
100 


He 


Gly 


Arg 


Asp 


Glu 
105 


Glu 


Gly 


Ala 


Val 


Glu 
110 


Asp 


Gly 


Lys 


Glu 


Glu 
115 


Glu 


Glu 


Val 


Phe 


Glu 
120 


Phe 


Glu 


Gin 


Pro 


Ser 
125 


Val 


Asn 


Thr 


Gly 


Ser 


Asn 


Thr 


Gly 


Gly 


Gly 


Gly 


Thr 


Gly Thr 


Val 


Pro 


Gly 


Glu 


Gly 




130 










135 










140 










Leu 


Leu 


Pro 


Pro 


Pro 


Pro 


Pro 


Thr 


Pro 


Thr 


Pro 


Thr 


Pro 


Pro 


Pro 


Thr 


145 










150 










155 










160 


Pro 


Thr 


Pro 


Thr 


Pro 
165 


Pro 


Pro 


Pro 


Pro 


Thr 
170 


Arg 


Thr 


Pro 


Ser 


Pro 
175 


Ser 


Ser 


Ser 


Leu 


Gly 


Glu 


Asp 


Asp Asp Asp 


Asp 


He 


Asp 


He 


Asp 


Phe 


Asp 








180 










185 










190 






Asp 


Asn 


Asp 
195 


He 


Asp 


Glu 


Phe 


Leu 
200 


Asp 


Ser 


Gly 


Glu 


Glu 
205 


Met 


Glu 


Glu 


Asp 


Glu 
210 


Glu 


Glu 


Glu 


Asp 


Leu 
215 


Asp 


Thr 


Leu 


Leu 


Ser 
220 


Arg 


Leu 


Glu 


Thr 


Gly Met 


Ser 


Gly 


Glu 


Glu 


Val 


Asp 


Phe 


Asp 


Ala 


Ser 


Ser 


Ala 


Tyr 


He 


225 










230 










235 










240 


Gin 


Pro 


Asp 


Pro 


Val 
245 


Val 


Val 


Lys 


Asn 


He 
250 


Glu 


Arg 


Ser 


Asp 


Tyr 
255 


Thr 


Leu 


Asp 


Pro 


Met 
260 


Glu 


Ser 


Trp 


Lys 


Val 
265 


Leu 


Asn 


Arg 


Ser 


Glu 
270 


Gly 


Asp 


lie 


Arg 


Phe 
275 


Phe 


Val 


Asp 


Arg 


Gly 
280 


He 


Thr 


Asn 


Lys 


He 
285 


Lys 


Ala 


Met 


Thr 


Glu 
290 


Asp 


Leu 


Lys 


Glu 


Leu 
295 





















<210> 86 
<211> 603 
<212> DNA 
<213> SHRIMP 



<400> 86 

atgaccatgt 

ggtgttttct 

tctaaacaag 

ccagtcactt 

tttatggcat 

gacgaggctg 

cgttatggga 

acagtacaag 

gatggacacg 

ggaaaaacgt 

tag 

<210> 87 
<211> 196 
<212> PRT 



ggaacaaaat 
ttattctagc 
aaaaatactc 
acttgtcaaa 
atgttgatgt 
gctgggccca 
gaggagaat t 
atatgtcgga 
gaatgcaatc 
catcagaaac 



tgtaataact 
aataactgcc 
cacttctcat 
aacgggtaaa 
gcacaataga 
cttttgcctg 
tggaccagaa 
gccatatata 
tgttttggag 
ttataatctt 



actaaacgaa 
ttagctgtcg 
atcaatgaac 
ttaaaggaca 
acaaaaactc 
ctggcttctg 
aaacattccc 
acccacatct 
aagaatagga 
agtgataagt 



tgaattggcc 
ctctctatat 
aatttactgc 
tgcacttgac 
tgaagcaccc 
ctgaagctta 
tagctgaaac 
tcaagaagaa 
ataaaatcag 
ctatatctat 



tatggttgtt 
acgacatgct 
caaacagttg 
ccattccgat 
catgtgtact 
tcgtcgcatc 
tatccaatcg 
cacggatgtt 
aatgggtgat 
tgttggtgta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

603 
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<213> SHRIMP 



<400> 87 



Met 


Thr 


Met 


TrD 


Asn 


Lys 


He 


Val 


He 


Thr 


Thr 


Lys 


Ara 


Met 


Asn 


TrD 


1 








5 










10 










15 




Pro 


Met 


Val 


Val 


Gly 


Val 


Phe 


Phe 


He 


Leu 


Ala 


He 


Thr 


Ala 


Val 


Tvr* 








20 










25 










30 






lie 


Arg 


His 


Ala 


Ser 


Lys 


Gin 


Glu 


Lys 


x yi 


Ser 


Thr 


Ser 


His 


He 


Asn 






35 










4 0 










45 








Glu 


Gin 


Phe 


Thr 


Ala 


Lys 


Gin 


Leu 


Pro 


Val 


Thr 




Leu 


Sex 


Lys 


Thr 




SO 






























Glv 


Lys 


Leu 


Lys 


Asd 


Met 


His 


Leu 


Thr 


His 


Ser 




Phe 


Met 


Ala 


Tvr 


65 










70 










75 










80 


Val 


Asp 


Val 


His 


Asn 


Arg 


Thr 


Lys 


Thr 


Leu 


Lys 


His 


Pro 


Met 


Cys 


Thr 










85 










90 










95 




Asp 


Glu 


Ala 


Gly 


Trp 


Ala 


His 


Phe 


Cys 


Leu 


Leu 


Ala 


Ser 


Ala 


Glu 


Ala 








100 










105 










110 






Tyr 


Arg 


Arg 


He 


Arg 


Tyr 


Gly 


Arg 


Gly 


Glu 


Phe 


Gly 


Pro 


Glu 


Lys 


His 






115 










120 










125 








Ser 


Leu 


Ala 


Glu 


Thr 


He 


Gin 


Ser 


Thr 


Val 


Gin 


Asp 


Met 


Ser 


Glu 


Pro 




130 










135 










140 










Tyr 


He 


Thr 


His 


He 


Phe 


Lys 


Lys 


Asn 


Thr 


Asp 


Val 


Asp 


Gly 


His 


Gly 


145 










150 










155 










160 


Met 


Gin 


Ser 


Val 


Leu 


Glu 


Lys 


Asn 


Arg 


Asn 


Lys 


He 


Arg 


Met 


Gly 


Asp 










165 










170 










175 




Gly 


Lys 


Thr 


Ser 


Ser 


Glu 


Thr 


Tyr 


Asn 


Leu 


Ser 


Asp 


Lys 


Ser 


He 


Ser 



180 185 190 

He Val Gly Val 
195 



<210> 88 
<211> 861 
<212> DNA 
<213> SHRIMP 



<400> 88 

atgagctcgg 

gaaggggaag 

attactgcca 

ggactaatag 

atgaggctag 

gatactattg 

gagaaaaaca 

aaaggggagg 

gattggatgg 

ttacaaagtg 

agcttttatc 

ttatcaaaag 

aaacaggaga 

gaatatccgc 

aacttttctg 



gtagtatcaa 
aacaggattt 
tgttatctgt 
caactgcgtc 
ataaagacgt 
aagttgactc 
taaaaaccaa 
cagtcaaact 
cgaacatctg 
ggcaattaat 
acattgctat 
gatcaaagga 
ttgctgcaat 
ctgtgcctat 
agaaattatg 



<210> 89 
<211> 286 
<212> PRT 
<213> SHRIMP 



caaccatccc 
tgatgtatta 
tgcagctcct 
tgcgacagca 
gtgccaacta 
cattattcgc 
ctctactgaa 
ggctaagaat 
ccaaagcaac 
atggacaact 
aatgtggaaa 
caatcttttg 
gcttcaatct 
cactgcaatt 
a 



tcctcaaaca 
gaactggact 
cctcccaatt 
ccagcagccg 
atcgaaaggg 
caactgttat 
aaggagccag 
accccagttc 
aagatcgggg 
ttcccagctg 
cttttaggtt 
cctcgtgatg 
agacaaaata 
ctatcacgta 



tggacaccaa 
actccaaaat 
ccatactgga 
aaacaggaaa 
acatagaatt 
attttggaga 
tttacttccc 
tagatacgat 
ttgaaaattt 
ctgtatatgc 
cctttataaa 
atatacaggt 
ttctcggaag 
ctataattcc 



caaaatggag 
catccatgat 
cgcatctgat 
ctctaacagg 
agtgaagagt 
atctgcatca 
caaagaaccg 
aacaaaacta 
agcctctgcg 
ttcgttggac 
catcgaagct 
tgtccatgct 
aggaccgtcg 
cctgttgagg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

861 



<400> 89 

Met Ser Ser Gly Ser He Asn Asn His Pro Ser Ser Asn Met Asp Thr 
15 10 15 
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UcL 


Glu 


Glu 


ri 1 v* 
uiy 


UlU 


Glu 


Gin 


Asp 


Phe 


Asp 


Val 


Leu 


Glu 


Leu 








20 










25 










30 






ASp 


iyr 


C pr 


Lys 


He 


Tip 

J- lc 


m o 


Asp 


lie 


Thr 


Ala 




Leu 


Ser 


Val 


Ala 






35 










40 










45 








Ala 

nlo 


Prn 

IT X (J 


riu 


Pro 


Asn 


oc x 


Tip 

lie 






Ala 


•3C x 


Asp 


Glv 


Leu 


He 


Ala 




50 




















60 










i ill 






Ala 


Thr 


nlo 


Pro 


nla 


Aid 


wl U 


X 11 X 


o x _y 


Asn 


Ser 


Asn 


Arg 


O -J 










70 










75 










80 


rJc c 


Arg 


Leu 


Asp 


Lys 


Asp 


Vol 




will 


Leu 


I le 


O X IX 


Arg 


Asp 


He 


Glu 










85 










on 










95 




Leu 


VOX 


Lys 


Ser Asp 


X XIX 


ne 


^1 n 


val 


Asp 


wel 


Tie* 
11c 


He 


Arg 


Gin 










100 










ins 

x u J 










110 






L€U 


Tyr 


DVia 


Gly Glu 




nia 


Ser 


r:i ii 

bill 


Lys 


Asn 


Tip* 
lie 


Xjys 


Thr 


Asn 


Ser 






J. i»? 










i ^ \j 










125 








inr 




Lys 


Glu 


Pro 


Val 


Tyr 


rrne 


Pro 


Lys 


ulu 


Pro 


T w« 

i»y 5 


Gly Glu 


Ala 
nla 














X J -J 










140 










\/a 1 
Vdl 


Lys 


Leu 


Ala 


Lys 


Asn 


i nr 


Pro 


Vol 


Leu 


ASp 


X III. 


Tip 

lie 


Thr 


Lys 


Leu 


"I d S 










i sn 

x v 










1 SS 

X J J 










i fin 


Asp 


Trp 




Ala 


Asn 


lie 


Cys 


V?11 1 


OCi 


Asn 


Lys 


Tin 
116 


oiy 


Val 


Glu 


Asn 










165 










1 "70 
X / u 










175 




Leu 


/\lcl 


oer 


Ala 


Leu 


oin 


Oci 


biy 


uin 


Leu 


Tip 

x ic 


i rp 


Thr 


Thr 


Phe 


Prn 








180 










185 










190 






Ala 


Ala 


Val 


Tyr 


Ala 


Ser 


Leu 


Asp 


Ser 


Phe 


Tyr 


His 


He 


Ala 


He 


Met 






195 










200 










205 








Trp 


Lys 


Leu 


Leu 


Gly 


Ser 


Phe 


He 


Asn 


He 


Glu 


Ala 


Leu 


Ser 
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<400> 90 

atggaccctg 

acaaggcatt 

gtcgaggagg 

tcttcttcct 

gaagaagaag 

ggagaggagg 

cgatcttctt 

tttatcaagt 

egtaategge 

aagaaacaga 

atgtctcctg 

acttctataa 



gagcategge 
atcagcagca 
aacatgggca 
catcctcctc 
aagaagaaga 
aggaagaaga 
caeggaatte 
acgaaaagga 
aaccccaaat 
aaaccaataa 
caagtagact 
tagtactgtc 



tgcttccagg 
attgaaccgt 
ggtaacaaca 
ttcttctccc 
atatgactct 
ttcagataca 
cacgaccaga 
tattttactt 
acataagcta 
gaagaaaact 
gaaatttttc 
gataatggee 



agggctttat 
gcactgaata 
acaaacaaag 
aegtcttegg 
gaatcagaca 
gaatccacat 
aacaggctaa 
gctgacagga 
aacaacaaaa 
tggagaagac 
tctgcttgta 
ctttag 



ggtctagcac 
agattgaaga 
aaatggcttc 
ccatcccttc 
etaaegtega 
ctgctgatgc 
taaaaaagta 
ataaaaggaa 
gactcaaaaa 
tccctaaatt 
taatatcagg 



tgtaacaaac 
agaagatgat 
tacttctaca 
aagegatgag 
ttctcttctt 
aaactttcta 
cgttgataga 
gaagaggcac 
accaacagac 
tattaaaaag 
gataaaaata 
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<210> 92 
<211> 4608 
<212> DNA 
<213> SHRIMP 

<400> 92 

atggctcaca aacttttatt tctggaggaa 
cacccagaac cttcattcgc tctttatgag 
aaaaatgtaa ccgatgccta ccccaaattt 
caggcactag cagtgegtet agecctgatt 
aacatgtcca ttatagacct tgtgaaagat 
aaaacgagtg gaggatttat categgagat 
gcagccttcg tcatgagtgt catactacaa 
gtagggecat caatttttgg tcaagattcg 
gtgtggaaaa ggcacccctt ttttatatgg 
tgtcaacagg geatgegaga agtggttaca 
tcatggagaa aactacaagt gccttgtgct 
aaaagtgggt ccatgacagt agatgtggaa 
gatattagat ttttcactct cagggatgtt 
tcaataggag atttcttaac agaaacgccc 
cgcacaaatc tggaatttgt actgaaattt 
aaggttatgc ccattgacaa ctttgtcaca 
caaaatatta gtgactcttt tagaaaggaa 
agagttttgc gcaacataca aaaaagggca 
ttcaagagtg caatgagtga cgacagaaac 
aataagggee gagttacaat gtccaactac 
egtattttet tagagatctt aaagtatgat 
ttccagagca atgctgattt gcacatgatt 
tacacgagca tacaagcett caaagaagtg 



gaagatgeta 
agtgaaacat 
cttcctcggc 
caattctaca 
aaggtcgaaa 
ggtactggtg 
gaaaaggcac 
gacaaggtaa 
ttgacgtgct 
aattctagag 
aataaaccaa 
aattctgtat 
aaggaatttc 
accattttat 
ataacaggag 
gcattattat 
ctggcaaaga 
aacccttcag 
tttaaagtga 
ttgaaactat 
aegttcaegg 
gaccaca tac 
tecagtgeta 



aagagattgg 
teeggteggt 
cgatggacat 
aagggagagg 
ggaattttaa 
taggaaaaac 
tcttagatgt 
ttaeggecat 
ctaagcctct 
gtcttcgtga 
cgtctttcaa 
cgtctgctaa 
atagtaagcg 
ttatgacgta 
gaacagacct 
gtgacgagtt 
cttgggagga 
tgtcagacct 
agegcatgaa 
tatctcaagc 
ttatggccag 
taaggaaaag 
ctccagatgc 



caccctctcc 
gggcttctgc 
taattctgtg 
gtggaagaag 
agtggacaag 
aagagaattg 
acaaaaacac 
caacagtggc 
tttcaacagc 
tcctaaattt 
gagegaeggg 
agacteggtg 
ttcttcaaga 
ctctgattta 
cgattccaac 
ccacaagact 
agaagatact 
tattaatagg 
atcatccaat 
tgatgcgttc 
cgctacgcca 
tgcgccagcc 
tatggctgag 
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cacagcgagt 
caattggtta 
gcctcccctc 
caagtactca 
atacgagaac 
gagattaatt 
gtaatggaaa 
catgatggga 
aaaaaggatg 
agcagcattg 
tccaaaggta 
tggggtgaag 
acagcaagta 
gtcaccgatg 
ctagagcaaa 
gccaaggatg 
cctgttgcta 
tcagctttca 
atgcttgtgc 
gattcaatca 
tgttcgcgta 
acaaacaaat 
cctaaaggca 
cgttaccatt 
aggactcaca 
cccgcagaga 
acatatgccg 
caatttatcg 
gccagtatga 
agaggagggg 
attgtcctcg 
tcagacgatc 
tccattgtta 
tttaaatctg 
gccctgatag 
gaaagtgatg 
atggatgttt 
gatgttcccg 
acgttagctc 
ctagtgacgg 
tgtttagatg 
gttatacaga 
ctagcacatt 
ttgtcccgtg 
gagtctaaga 
aggtatgatg 
gtcaggaaga 
catataataa 
gaagacgacg 
gaagaagaag 
gggcacgatg 
agtgtttttg 
ggttctttga 
ttttgcctgt 



acgtgacagt 
gtaggagtat 
ttcaaaaata 
tcgattctga 
accaagaagg 
ccccgtccag 
atatcgacag 
aaacaacact 
cgctttctaa 
atatggttgc 
aaactgattt 
tatttgacga 
cttctgtggc 
cagtgaagcg 
ctggggattc 
aaagtgacgc 
atacaatatt 
caa tctcctt 
cttcagtccc 
cacaatcaat 
taacgaatag 
gtattagtgc 
ataccggtct 
gcgttttaga 
gaaacggtca 
gaagattttt 
atagatacag 
atcaagggtt 
caaaggtaca 
ttgcattcgt 
cgatgcacat 
ttgggcatgc 
aatccgcatc 
attgtgacaa 
acactttgaa 
caccttcatt 
tctctaatat 
acaatgatga 
tatcaaactt 
tgagaattgt 
tacctgaat t 
gtttggctaa 
cccacagaga 
gtttcattaa 
ttatgtacga 
gcctttgtga 
agccagagtg 
atgacgaaga 
acgaagaatt 
gagaaagtgc 
tggacataat 
ttaatgaaca 
tacaaatatg 
acgactcctc 



gtttctggaa 
tagcatggct 
cgctatagac 
aaaagtagga 
aggaatattg 
aaagaggaaa 
gcgttttaag 
aagaagtata 
tggaggagga 
tcaagattta 
taacgaagat 
cgagtgtttt 
agcatgtaaa 
tttaaggaca 
ttttctcaag 
aaaatacggc 
ctcaggctat 
aaagaataaa 
tgacacggaa 
aggggaggat 
gggattgttt 
attcaacaac 
ctctttacat 
tgtaccttat 
actttcagtc 
tgatagtttg 
taataattcc 
agtcctaaag 
cttgatggaa 
agaaggtttg 
agcgctagta 
tctttctttc 
tcaattcgtt 
tcttctccca 
tacgaaaaat 
aactgctcta 
aatggcatat 
agacgaaggg 
cccccatgat 
tggtcaagga 
agatatgaca 
agaaaatcct 
agggtacgga 
ttttagacag 
tatattcttg 
tatgagaatg 
tgttttcatt 
ggaagaaacg 
tgaagacgag 
ttacgacatt 
actgtgcaat 
catagactct 
ctttgataac 
atctaaagac 



caggttatta 
ggagtcgact 
gaattggctt 
ggacatgtca 
gaagaggaag 
cgtgctgcaa 
gtggtggttg 
gtacaagatg 
ggaataataa 
tacgatgcta 
tatgacgacg 
gaaaagttga 
ggtgcactat 
acaaacgaat 
aatttgacta 
atcgtagacg 
agacttttgt 
accaatagaa 
ccgctcatgg 
agtaacgctg 
ctggttaaaa 
acgaaagaag 
gactcttcta 
aacgccattg 
cctcagttcc 
gataagagga 
atagatattg 
actatgggga 
catttttcaa 
acgttagaaa 
gttataggtg 
acgtctgtac 
ttttctaatt 
cgcgagaagc 
aacgaagtta 
atgctccctt 
aataacaata 
tgtttacctt 
tatgatagag 
gagaaagagg 
aatctcatac 
ggtttgcttt 
ggttctcatc 
ttccaaaacc 
agcgtcaaag 
aacagcatga 
acaaaattat 
agggagaggt 
gaggaggaac 
tctgtcatca 
agaaagaaac 
tttatggttg 
tgcacaggcg 
aaagacacca 



aacttttaag 
gctctactac 
cgtactgttt 
gaagggcctt 
atgttgaaaa 
atgatgatga 
tgcgcgatag 
ctataaaaac 
cgtctccgga 
tcaagaaaaa 
gtgctaatga 
ggagacaata 
tgaatatcaa 
caaaaaagat 
cacgcatctt 
tggggacatt 
gccgtgcagt 
gaacttcccc 
cgctagcagg 
aaataacaaa 
acaacactaa 
tcgacgttat 
ataactccat 
ctttcctaca 
tcatattctc 
tcaaagattc 
ctgccgcagt 
atatcgtaca 
agatgatgat 
atggtatctt 
cccaaaataa 
ttcctcataa 
tgtgtctgca 
gagtgcgtga 
cttccaagac 
ctggtcctag 
acggcatgga 
tgcaagaaga 
ccataaaaga 
gagtgattcc 
ctgtagtgac 
ttactataca 
ttctgggttt 
aacttttctc 
cgatcatggc 
tggatgcaag 
tggacaagaa 
ttggcgggga 
aagcagaaag 
atgataaaaa 
taacactcac 
gaaatttaat 
agtttgaagg 
ttccgtag 



aaatagagga 
taattgtaaa 
aaacgctaga 
cacaaagatt 
attagtagca 
tttata tgaa 
ggatgtcgca 
ctactcacag 
agttgacata 
ggaaaaacct 
agaagacgga 
cttcattaat 
ggcaacttct 
ggtcatgtca 
acaaacgatc 
cgattcatcg 
catgatggct 
agctcacgtt 
caaccctata 
caggaaattg 
aactgccaac 
aatgctggga 
gtacgctaaa 
gactatcggt 
tacagactcg 
aaaggcagga 
tatgagggaa 
gattgtcaca 
gaggacaaat 
cactgaagta 
aatcacatca 
tcaaatattg 
tttagtccac 
tgcagcatct 
aaataaaata 
gaatagaaag 
ctttgatgaa 
aaatgctaca 
tgcacatcaa 
tatcagtgaa 
ggcaactaat 
taatgctgca 
ggctaagaaa 
acctaaaaaa 
cagagatgat 
tttcttaaag 
ttttaggaga 
ggaagaagaa 
ggagtggggg 
taacacaatc 
taaagaaaat 
aggtgcggaa 
tttacccaaa 
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Ser Asn Asn Ser He Asp He Ala Ala Ala Val Met Arg Glu Gin Phe 

980 985 990 

He Asp Gin Gly Leu Val Leu Lys Thr Met Gly Asn He Val Gin He 

995 1000 1005 

Val Thr Ala Ser Met Thr Lys Val His Leu Met Glu His Phe Ser Lys 

1010 1015 1020 

Met Met Met Arg Thr Asn Arg Gly Gly Val Ala Phe Val Glu Gly Leu 
1025 1030 1035 1040 

Thr Leu Glu Asn Gly He Phe Thr Glu Val He Val Leu Ala Met His 

1045 1050 1055 

He Ala Leu Val Val He Gly Ala Gin Asn Lys He Thr Ser Ser Asp 

1060 1065 1070 

Asp Leu Gly His Ala Leu Ser Phe Thr Ser Val Leu Pro His Asn Gin 

1075 1080 1085 

He Leu Ser He Val Lys Ser Ala Ser Gin Phe Val Phe Ser Asn Leu 

1090 1095 1100 

Cys Leu His Leu Val His Phe Lys Ser Asp Cys Asp Asn Leu Leu Pro 
1105 1110 1115 1120 

Arg Glu Lys Arg Val Arg Asp Ala Ala Ser Ala Leu He Asp Thr Leu 

1125 1130 1135 

Asn Thr Lys Asn Asn Glu Val Thr Ser Lys Thr Asn Lys He Glu Ser 

1140 1145 1150 

Asp Ala Pro Ser Leu Thr Ala Leu Met Leu Pro Ser Gly Pro Arg Asn 

1155 1160 1165 

Arg Lys Met Asp Val Phe Ser Asn He Met Ala Tyr Asn Asn Asn Asn 

1170 1175 1180 

Gly Met Asp Phe Asp Glu Asp Val Pro Asp Asn Asp Glu Asp Glu Gly 
1185 1190 1195 1200 

Cys Leu Pro Leu Gin Glu Glu Asn Ala Thr Thr Leu Ala Leu Ser Asn 

1205 1210 1215 

Phe Pro His Asp Tyr Asp Arg Ala He Lys Asp Ala His Gin Leu Val 

1220 1225 1230 

Thr Val Arg He Val Gly Gin Gly Glu Lys Glu Gly Val He Pro He 

1235 1240 1245 

Ser Glu Cys Leu Asp Val Pro Glu Leu Asp Met Thr Asn Leu He Pro 

1250 1255 1260 

Val Val Thr Ala Thr Asn Val He Gin Ser Leu Ala Lys Glu Asn Pro 
1265 1270 1275 1280 

Gly Leu Leu Phe Thr He His Asn Ala Ala His Ser His Arg Glu Gly 

1285 1290 1295 

Tyr Gly Gly Ser His Leu Leu Gly Leu Ala Lys Lys Leu Ser Arg Gly 

1300 1305 1310 

Phe He Asn Phe Arg Gin Phe Gin Asn Gin Leu Phe Ser Pro Lys Lys 

1315 1320 1325 

Glu Ser Lys He Met Tyr Asp He Phe Leu Ser Val Lys Ala He Met 

1330 1335 1340 

Ala Arg Asp Asp Arg Tyr Asp Gly Leu Cys Asp Met Arg Met Asn Ser 
1345 1350 1355 1360 

Met Met Asp Ala Ser Phe Leu Lys Val Arg Lys Lys Pro Glu Cys Val 

1365 1370 1375 

Phe lie Thr Lys Leu Leu Asp Lys Asn Phe Arg Arg His He He Asn 

1380 1385 1390 

Asp Glu Glu Glu Glu Thr Arg Glu Arg Phe Gly Gly Glu Glu Glu Glu 

1395 1400 1405 

Glu Asp Asp Asp Glu Glu Phe Glu Asp Glu Glu Glu Glu Gin Ala Glu 

1410 1415 1420 

Arg Glu Trp Gly Glu Glu Glu Gly Glu Ser Ala Tyr Asp He Ser Val 
1425 1430 1435 1440 

He Asn Asp Lys Asn Asn Thr He Gly His Asp Val Asp He He Leu 

1445 1450 1455 

Cys Asn Arg Lys Lys Leu Thr Leu Thr Lys Glu Asn Ser Val Phe Val 



WO 01/38351 



178 



PCT/US00/28888 



1460 

Asn Glu His lie Asp 
1475 

Gly Ser Leu lie Gin 
1490 

Gly Leu Pro Lys Phe 
1505 

Thr lie Pro 



1465 

Ser Phe Met Val Gly 
1480 

lie Cys Phe Asp Asn 
1495 

Cys Leu Tyr Asp Ser 
1510 



1470 

Asn Leu lie Gly Ala Glu 
1485 

Cys Thr Gly Glu Phe Glu 
1500 

Ser Ser Lys Asp Lys Asp 
1515 1520 



<210> 94 
<211> 2919 
<212> DNA 
<213> SHRIMP 

<400> 94 

atggatgaac gacgaagaga tccgttactt 
gcgcaaataa cgttattcgt gaccgtgttc 
gctgcaatgg cctataatgt agccaaacca 
ttagggatga aaagtaaatt gaaagcagtt 
gaatacctag aagccagagg taggcatgac 
caccccgctc tcctggacat gatgaatttg 
tcagaaacga aggaaaggaa ttctcggttc 
tcatctcttc tcgtgcctga acaccatata 
tctaacaaat atctagatgc tgtatcgggg 
aaggataata tagaaaaaga cgatactgaa 
atccatcacc atagtgcatc aggcgtaagt 
tcagaaggac atgatgaaga tgtggatatt 
aacccagttc ctttgaaaga gggacaaact 
gttcgtgtga tgacaccatt tgttgctgga 
gatcctaaac cttattgttg gtcaggaaac 
accactggag aacgtgtagt taaagagtgt 
gatgatgggg gctggcaatg cagacctaaa 
acatcgatga ctgcttgcgc tttcaatcct 
tcttcgacac caatctatta tgatgtatta 
ttccgcaact cttcttacat ctccaaattg 
attaaatgca acgatcctga atttttatac 
aagaaggatg tactaaataa cgaccttctt 
agaggtatgt atgaatgtat ggaaaacccc 
tttgttacgt ttgatgtgag cacaatgaag 
tcaaacaggc atgcaataat aggcgatgac 
atgggaattt tcttggccga ccaatctaaa 
aaatcaagta tcgacgaaac tactgccaag 
acgcctttaa acctggatgc aacaaatact 
agcactgttt tacctcctct agaaaatatt 
ttacatagaa gttgtttagc tcccgtcctg 
ttctgtactg caccctttta cattgagcca 
caaaaaccct atgaacatag tatgttggcc 
tcgggtagtg ttcacggcgg atctgaactt 
ccctcctctt atatcagaac tccccctgga 
ggtgaccaaa ggctggaaga aataagggac 
agattgtctc agacagaata cgtaataaag 
atatatctca aatcaagttc ttgggatagt 
atcataaaaa aatctgagga tacttttgta 
ggaccttatg ctgctactgt gctagcgcga 
aaacctgcat caaaaacttc cagtacattg 
cccacatcct actctgttct tcccgaagaa 
cacaatagga tatttgacag cgaaacaatc 
cccggaagtg aaaaattgtt cggaacatca 
gacgacgatg cgtggggttt gcaatcgttt 



taccctacca accgaagtat gcgcttcacc 60 
gtgttaggct gtatttttgc gctagtctgc 120 
atgtctgtta attttcaagc tatacacgaa 180 
caaggagcta accctgaaaa gacactagaa 24 0 
ggggtggaag atgcaagcaa ttacccacct 300 
actgttaaag ggaataaatg gaacgttccg 360 
gaaagtcatg atttagctgc aaatagatcc 420 
gatagattga gtgaggccac catcgaaaaa 4 80 
aaaaagttta gacaaaggat ggtcaatttg 54 0 
ttgtacgatt ctttatttgg tgtacatgat 600 
ggtgacgctc ctcctcctcc tccatctaca 660 
ttggcctaca atactggcgg gtactgttcc 720 
tgcacatctg tgtgttatac ctcgagagca 780 
ggaacattta tcacacataa atctggagaa 84 0 
gtgcccggtg accacataga aacatcccca 900 
tctgtgcaca catctatagt cgttctgaca 960 
tacccaacat atttcggagg ttcgggagga 1020 
tcaacccaca aaggaccacc acctccttca 1080 
aagaaacaac aaatacggaa tcatacagaa 114 0 
cgtcaatctt cttccttggc ggaattcaaa 1200 
aaaaacccca tcacctgttt ctgtaataat 1260 
tctcaagatg taacaaagga tatgaaattc 1320 
tgtgttatga tgcccaacat agacccttcg 1380 
tgtgtaccgg gcgttaataa tccccaagac 14 40 
aggacaccgt tagtggggac tgttcctgcc 1500 
agaggggacc agattcacca gcaaagacct 1560 
aaaattgcac tggcccaagc gcccatcatc 1620 
tctaagaatg tgctctttgt accaatacct 1680 
ccccatgtaa tcatccgccc gtcttcattg 1740 
aacaaacctt cctcggggca gcataggcca 1800 
gcagctaatg ttttggccgg aaacatacct 1860 
acagaatgtt tgcgaaatag tcgcatggtt 1920 
ttattctcta cacttttgag tcaaaataaa 1980 
ggtactccag ctccagagta caattctaca 2040 
ttttttgaac gtaattttaa cgatgagagg 2100 
aaacacgcga gagggatgag gacaagtgaa 2160 
ttaatgaaga ggaaagaatt tctaagacat 2220 
ttgaaagagg ggttattaat gcgttcatac 2280 
gatatgtttg atttagatta cctaaaagga 2340 
aaagtgtcca atcctttgca gtacgctttc 2400 
ggagctaccg acgatatttt ttcagtagac 2460 
cccagctatt ttgattgctc caatgtaact 2520 
tcttcctcat ccgagtacag agtggatatt 2580 
cgtttggatc ataaccccaa gagtggcccg 2 640 
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gtagtacagt cagaccctcg tctagcgttt gatgcgtcaa acatttcctc aacccccgaa 2700 

ggagctacaa taacacccct ctctctattc aagaagtcac tcgtcgagtg ggggcataag 2760 

aaggcggacg tgcaagaaac ttcgtggttc agggacggtg tggatacttc tgaggcttat 2820 

agacgattac ttgtagagac tagtatggct gtacgtaact cctggttctc tttagcatgg 2880 

gaaaataaaa actattattt tgccaaaaat agcagctaa 2919 

<210> 95 
<211> 962 
<212> PRT 
<213> SHRIMP 
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620 










Ser 


Gly 


Ser 


Val 


His 


Gly 


cot 
bJ5 










CLA ft 

b4 0 


Ser 


Gin 


Asn 


Lys 


Pro 


Ser 










D33 




Pro 


Ala 


Pro 


Glu 


Tyr 


Asn 








b /O 






Arg 


Asp 


Phe 


Phe 


Glu 


Arg 






685 








Thr 


Glu 


Tyr 


Val 


He 


Lys 




i ft ft 

7 00 










Leu 


Lys 


Ser 


Ser 


Ser 


Trp 


/ 15 










no ft 


Arg 


His 


lie 


lie 


.Lys 


Lys 










/ 33 




Leu 


Leu 


Met 


Arg 


Ser 


Tyr 








/30 






Asp 


Met 


pne 


ASp 


Leu 


Asp 






/ bo 








Ser 


Ser 


Thr 


Leu 


Lys 


val 




/ O U 










Ser 


Tyr 


Ser 


Val 


Leu 


Pro 


795 










800 


Val 


Asp 


His 


Asn 


Arg 


lie 










815 




Asp 


Cys 


Ser 


Asn 


Val 


Thr 








830 






Ser 


Ser 


Ser 


Ser 


Glu 


Tyr 






845 








Leu 


Gin 


Ser 


Phe 


Arg 


Leu 




860 










Gin 


Ser 


Asp 


Pro 


Arg 


Leu 


875 










880 
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Ala 


Phe 


Asp 


Asn 


He 
885 


Ser 


Ser 


Thr 


Pro 


Glu 
890 


Gly 


Ala 


Thr 


He 


Thr 
895 


Pro 


Leu 


Ser 


Leu 


Phe 
900 


Lys 


Lys 


Ser 


Leu 


Val 
905 


Glu 


Trp 


Gly 


His 


Lys 
910 


Lys 


Ala 


Asp 


Val 


Gin 
915 


Glu 


Thr 


Ser 


Trp 


Phe 
920 


Arg 


Asp 


Gly 


Val 


Asp 
925 


Thr 


Ser 


Glu 


Ala 


Tyr 
930 


Arg 


Arg 


Leu 


Leu 


Val 
935 


Glu 


Thr 


Ser 


Met 


Ala 
940 


Val 


Arg 


Asn 


Ser 


Trp 


Phe 


Ser 


Leu 


Ala 


Trp 


Glu 


Asn 


Lys 


Asn 


Tyr 


Tyr 


Phe 


Ala 


Lys 


Asn 


945 










950 










955 










960 


Ser 


Ser 































<210> 96 
<211> 681 
<212> DNA 
<213> SHRIMP 



<400> 96 

atggcctcag 

gttcctaccc 

attatggaga 

ttggcgtatt 

cagatcaagt 

gagatcaagg 

atcttcggaa 

aagggtgtaa 

ttgtggatgt 

aatgcatcta 

gtgtcgtacg 

tctgagtttt 



tctttgaaga 
gccccaatat 
accgcctgat 
tcttcccctc 
cgaattgcct 
actgtggacc 
acagtgtttt 
cattctatat 
gtgacaaact 
agccaaaatt 
ttattgaggg 
tggtacaata 



ccctgctgat 
cctgttcttc 
ccataaaggg 
caacaatgaa 
gtctttcttt 
atactgtact 
tggacaagac 
atttgttgat 
gaagagaatg 
ttctgtatgt 
ccacggaatg 



ctctttgcca 
gaaggcctac 
aagtgtggag 
gaaaatacga 
attagggatt 
ttttcccaat 
tttactattg 
ggatctgatt 
aatgccaatg 
aaaacatgta 
ggacattcag 



<210> 97 
<211> 224 
<212> PRT 
<213> SHRIMP 



acatggactt 
tccccaattc 
catttgaagc 
agaaactcaa 
ttttgaacga 
acatggatgg 
tcgccatgga 
ctatggagaa 
tggtaaaagt 
ggtgggagtt 
atttgtcatg 



gactggcaaa 
tggcaaggag 
agatacacaa 
cattggtttc 
ctggttggag 
ggacaaagaa 
ctggatcgac 
tatggcctct 
gtttgttgat 
cccaggtcca 
tgatgagatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

681 



<400> 97 



Met 


Ala 


Ser 


Val 


Phe 


Glu 


Asp 


Pro 


1 








5 








Leu 


Thr 


Gly 


Lys 


Val 


Pro 


Trp 


Asn 








20 










Pro 


Asn 


Ser 


Gly 


Lys 


Glu 


He 


Met 






35 










40 


Lys 


Cys 


Gly 


Ala 


Phe 


Glu 


Ala 


Asp 




50 










55 




Ser 


Asn 


Asn 


Glu 


Glu 


Asn 


Thr 


Lys 


65 










70 






Lys 


Ser 


Asn 


Cys 


Leu 


Ser 


Phe 


Phe 










85 








Leu 


Glu 


Glu 


He 


Lys 


Asp 


Cys 


Gly 








100 










Met 


Asp 


Gly 


Asp 


Lys 


Glu 


He 


Phe 






115 










120 


Phe 


Thr 


He 


Val 


Ala 


Met 


Asp 


Trp 




130 










135 




He 


Phe 


Val 


Asp 


Gly 


Ser 


Asp 


Ser 


145 










150 






Met 


Cys 


Asp 


Lys 


Leu 


Lys 


Arg 


Met 



165 



Ala 


Asp 


Leu 


Phe 


Ala 


Asn 


Met 


Asp 




10 










15 




He 


Leu 


Phe 


Phe 


Glu 


Gly 


Leu 


Leu 


25 










30 






Glu 


Asn 


Arg 


Leu 


He 


His 


Lys 


Gly 










45 








Thr 


Gin 


Leu 


Ala 


Tyr 


Phe 


Phe 


Pro 








60 










Lys 


Leu 


Asn 


He 


Gly 


Phe 


Gin 


He 






75 










80 


He 


Arg 


Asp 


Phe 


Leu 


Asn 


Asp 


Trp 




90 










95 




Pro 


Tyr 


Cys 


Thr 


Phe 


Ser 


Gin 


Tyr 


105 










110 






Gly 


Asn 


Ser 


Val 


Phe 


Gly 


Gin 


Asp 










125 








He 


Asp 


Lys 


Gly 


Val 


Thr 


Phe 


Tyr 








140 










Met 


Glu 


Asn 


Met 


Ala 


Ser 


Leu 


Trp 






155 










160 


Asn 


Ala 


Asn 


Val 


Val 


Lys 


Val 


Phe 



170 175 
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Val Asp Asn Ala Ser Lys Pro Lys 
180 

Trp Glu Phe Pro Gly Pro Val Ser 

195 200 

Gly His Ser Asp Leu Ser Cys Asp 

210 215 



Phe Ser Val Cys Lys Thr Cys Arg 
185 190 
Tyr Val He Glu Gly His Gly Met 
205 

Glu lie Ser Glu Phe Leu Val Gin 
220 



<210> 98 
<211> 747 
<212> DNA 
<213> SHRIMP 



<400> 98 

atgattgcca 

gcaaagatta 

ttgatgt tea 

cccaccatat 

gccaccggtc 

gggtatattg 

gaaattaaaa 

aattgtccag 

ggacattgta 

attacceggt 

aaagagatgt 

gtatcttgta 

tggacagatg 



tcgccaatca 
atatgttgta 
atgatacege 
tgaaagctgc 
tcgatgtcaa 
catcctcaaa 
gggctctaga 
tgtgtttaga 
ttgatgaaaa 
gtcctgtatg 
tggttgaacc 
aacgtagaag 
agtccgaaac 



caaacacgat 
tggtaaatct 
attaacacag 
tttagatttc 
taaagcacct 
ttatccttgg 
agaacatgag 
agatttcagt 
atgttgggac 
cagagaacgt 
aatagtgtct 
aatagggtat 
tgtatga 



gtctctgatg 
agtactttaa 
tttgeccaga 
tctctccaag 
cgttcttggc 
gctgaacaca 
aagaatgeta 
agtaccaata 
agattggtgt 
acttccctaa 
tgtaaaagaa 
aatagatacc 



cactagttgg 
gtgtaactga 
gaggatacga 
aagaagaacc 
caatactgaa 
taatttctgg 
atgtgcgttt 
tcatcaggac 
tgagtacgea 
gacctgacgc 
tggcagtgcc 
agttcctaat 



agcccatggg 
agcagcatta 
gectagtata 
ccttgttgct 
ttgtcgcttg 
ggataaggaa 
egacagegat 
gaeaegctgt 
acgtggagaa 
tgatcaagtt 
tgatgaacaa 
aaatgatgtg 



60 

120 

180 

240 

300 

360 

4 20 

4 80 

540 

600 

660 

720 

747 



<210> 99 
<211> 248 
<212> PRT 
<213> SHRIMP 



<400> 99 



Met 


He 


Ala 


He 


Ala 


Asn 


His 


Lys 


His 


Asp 


Val 


Ser 


Asp 


Ala 


Leu 


Val 


1 








5 










10 










15 




Gly 


Ala 


His 


Gly 


Ala 


Lys 


He 


Asn 


Met 


Leu 


Tyr 


Gly 


Lys 


Ser 


Ser 


Thr 








20 










25 










30 






Leu 


Ser 


Val 


Thr 


Glu 


Ala 


Ala 


Leu 


Leu 


Met 


Phe 


Asn 


Asp 


Thr 


Ala 


Leu 






35 










40 










45 








Thr 


Gin 


Phe 


Ala 


Gin 


Arg 


Gly 


Tyr 


Glu 


Pro 


Ser 


He 


Pro 


Thr 


He 


Leu 




50 










55 










60 










Lys 


Ala 


Ala 


Leu 


Asp 


Phe 


Ser 


Leu 


Gin 


Glu 


Glu 


Glu 


Pro 


Leu 


Val 


Ala 


65 










70 










75 










80 


Ala 


Thr 


Gly 


Leu 


Asp 


Val 


Asn 


Lys 


Ala 


Pro 


Arg 


Ser 


Trp 


Pro 


He 


Leu 










85 










90 










95 




Asn 


Cys 


Arg 


Leu 


Gly 


Tyr 


He 


Ala 


Ser 


Ser 


Asn 


Tyr 


Pro 


Trp 


Ala 


Glu 








100 










105 










110 






His 


He 


He 


Ser 


Gly 


Asp 


Lys 


Glu 


Glu 


He 


Lys 


Arg 


Ala 


Leu 


Glu 


Glu 






115 










120 










125 








His 


Glu 


Lys 


Asn 


Ala 


Asn 


Val 


Arg 


Phe 


Asp 


Ser 


Asp 


Asn 


Cys 


Pro 


Val 




130 










135 










140 










Cys 


Leu 


Glu 


Asp 


Phe 


Ser 


Ser 


Thr 


Asn 


He 


lie 


Arg 


Thr 


Thr 


Arg 


Cys 


145 










150 










155 










160 


Gly 


His 


Cys 


He 


Asp 


Glu 


Lys 


Cys 


Trp 


Asp 


Arg 


Leu 


Val 


Leu 


Ser 


Thr 










165 










170 










175 




Gin 


Arg 


Gly 


Glu 


He 


Thr 


Arg 


Cys 


Pro 


Val 


Cys 


Arg 


Glu 


Arg 


Thr 


Ser 








180 










185 










190 






Leu 


Arg 


Pro 


Asp 


Ala 


Asp 


Gin 


Val 


Lys 


Glu 


Met 


Leu 


Val 


Glu 


Pro 


He 






195 










200 










205 








Val 


Ser 


Cys 


Lys 


Arg 


Met 


Ala 


Val 


Pro 


Asp 


Glu 


Gin 


Val 


Ser 


Cys 


Lys 
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210 215 220 

Arg Arg Arg He Gly Tyr Asn Arg Tyr Gin Phe Leu He Asn Asp Val 
225 230 235 240 

Trp Thr Asp Glu Ser Glu Thr Val 
245 



<210> 100 
<211> 1788 
<212> DNA 
<213> SHRIMP 



<400> 100 

atggctgaag 

ccatatatgt 

attacctcca 

gaagagttta 

cccatctgcg 

ggcctttacg 

atggataatg 

agcagaactg 

atgagaagga 

gcaagaagtg 

ttttcaagaa 

cttgaacctt 

acaaaggcaa 

ctagaaacat 

agatctatag 

attatcaagt 

actacaaaaa 

gaaacaatcc 

gccgaaatga 

ttgtgtggtg 

aattacttca 

gataatctta 

gaagttaaag 

gcttttctta 

ctattgtgtt 

aaacttaaca 

atatcttccc 

ttttccttta 

tgcaattttc 

agtatagtga 



cagctccacg 
catttctgga 
gaaattctcc 
gagatattat 
agggaatgac 
aaaacatcca 
gatttttgag 
gattcaagtt 
tcaagaagtt 
gtattgtttt 
tcgtcaatat 
ttgacatatc 
aacgactgat 
caccatcatc 
ctagattgaa 
cttttctttc 
ctgttgctga 
agcattttca 
tggacaatat 
caatactgtt 
agatctgttt 
ttgtgaaaag 
gctatgttag 
ccaagaatcc 
attgtatcat 
ttctgaagcc 
cctttaccac 
ttaatgaaaa 
ttgatgctct 
atgctagaag 



ataccgtcag 
cgttttcaca 
tcctgttcct 
ctacttcttt 
ttttatcaac 
cgacgacttt 
acgcgagacg 
tacttcaaag 
ggacggtgtt 
ctacgttgaa 
cacctctgat 
tttcggatac 
atccaatggc 
cgtgtcaact 
ttggaccaac 
ctcgatagaa 
aatggcagta 
aaccctttac 
tctaaaaaat 
gtggatgatt 
gagggagatt 
gatcgtatca 
attgtgtgta 
gtggctcaaa 
taacaaatat 
ttctgcgaaa 
aagagtaggt 
taa tgaaaac 
tgctaaaaaa 
gcgagcgatg 



gtgcttgaag 
gaaagggaat 
tcaagtagtt 
atcaacaaca 
gcattgacaa 
ttgttcgtaa 
cacggcatca 
gaacaggctg 
ttggctagcc 
aaatgttcga 
tacgtggctg 
aagtattttg 
aattttatta 
gacttcagag 
gaagaaaggg 
gaagaaatgg 
gagtttgttg 
agtatatttg 
gttgcaataa 
aacagtaggc 
atggtaagaa 
cataaaaacg 
aaagtctcgt 
cacacttatt 
tcccatacta 
ggcatgtcac 
attgaaattc 
gtattccatg 
tacaagaata 
gtgcaacgcc 



aagtacttga 
tggcccttct 
tcaagaaatt 
ataccaagtc 
ccgtctgcaa 
aatattctct 
agtttggcac 
aggaagaaag 
tgaaaaagtc 
gtgttattag 
aaatgaaaaa 
tggacgaatc 
tcgggagacc 
aagaaatgaa 
ctagtgctta 
tagaagagta 
atgtgtttat 
acacgatgcc 
acgaagcagt 
cctttgaaga 
aaaagaccga 
ttgtgataac 
gtttcatgga 
ttgatgaaaa 
gcaaactagt 
ctctaatggt 
taacaacaaa 
ctgcagctgt 
taatatacga 
cctaatga 



aaacattgaa 
gaatgacata 
agacaataaa 
tgattcttcc 
aaccttcaga 
tctggtctct 
tggtgatgac 
agagaaggtt 
tactagttct 
attcaggctc 
atctgcacca 
tccttgtgtt 
tttttcttgc 
catggacgct 
caggagtgtg 
ctgtgagact 
agagaaggct 
aaaattctct 
gggtagtggt 
gattgattac 
caaactgtgc 
agaccctcat 
agatcttgaa 
ggggaatact 
caaacaagaa 
ttgcgctgca 
cagcttggcc 
tgcaacctcg 
tttcgataga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1788 



<210> 101 
<211> 586 
<212> PRT 
<213> SHRIMP 



<400> 101 

Met Ala Glu Ala Ala Pro Arg Tyr Arg Gin Val Leu Glu Glu Val Leu 

15 10 15 

Glu Asn He Glu Pro Tyr Met Ser Phe Leu Asp Val Phe Thr Glu Arg 

20 25 30 

Glu Leu Ala Leu Leu Asn Asp He He Thr Ser Arg Asn Ser Pro Pro 

35 40 45 

Val Pro Ser Ser Ser Phe Lys Lys Leu Asp Asn Lys Glu Glu Phe Arg 

50 55 60 

Asp He lie Tyr Phe Phe He Asn Asn Asn Thr Lys Ser Asp Ser Ser 
65 70 75 80 

Pro He Cys Glu Gly Met Thr Phe He Asn Ala Leu Thr Thr Val Cys 
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Lys 


Thr 


Phe 


Arg 








100 


Tyr 


Ser 


Leu 


Leu 






115 




His 


Gly 


He 


Lys 




130 






Phe 


Thr 


Ser 


Lys 


145 








Arg 


He 


Lys 


Lys 


Ser 


Ser 


Ala 


Arg 








180 


Val 


lie 


Arg 


Phe 






195 




Tyr 


Val 


Ala 


Glu 




210 






Ser 


Phe 


Gly 


Tyr 


225 








Ala 


Lys 


Arg 


Leu 


Ser 


Cys 


Leu 


Glu 








260 


Glu 


Met 


Asn 


Met 






275 




Glu 


Glu 


Arg 


Ala 




290 






Ser 


Ser 


He 


Glu 


305 








Lys 


Thr 


Val 


Ala 


Lys 


Ala 


Glu 


Thr 








340 


Pro 


Lys 


Phe 


Ser 






355 




lie 


Asn 


Glu 


Ala 




370 






Met 


He 


Asn 


Ser 


385 








lie 


Cys 


Leu 


Arg 


Asp 


Asn 


Leu 


He 








420 


Thr 


Asp 


Phe 


Val 






435 




Phe 


Met 


Glu 


Asp 




450 






His 


Thr 


Tyr 


Pne 


465 








He 


Asn 


Lys 


Tyr 


Asn 


He 


Leu 


Lys 








500 


Ala 


Ala 


He 


Ser 






515 




Thr 


Thr 


Asn 


Ser 




530 






Val 


Phe 


His 


Ala 


545 








Lys 


Lys 


Tyr 


Lys 



1 XT 

Gly 


G1U 


Asn 


1 lc 


vai 


Ser 




Asp 








ion 


pne 


G_Ly 


Tnr 


Gly 






135 




GlU 


Cain 


Ala 


Glu 




150 






Leu 


Asp 


Gly 


Val 


165 








Ser 


Gly 


lie 


vai 


Arg 


Leu 


Pne 


Ser 








200 


Met 


Lys 


Lys 


Ser 






215 




Lys 


Tyr 


Phe 


Val 




230 






He 


Ser 


Asn 


Gly 


245 








Thr 


Ser 


Pro 


Ser 


Asp Ala 


Arg 


Ser 








280 


Ser 


Ala 


Tyr 


Arg 






295 




Glu 


Glu 


Met 


Val 




310 






Glu 


Met 


Ala 


Val 


325 








He 


Gin 


His 


Phe 


Ala 


Glu 


Met 


Met 








360 


Val 


Gly 


Ser 


Gly 






375 




Arg 


Pro 


Phe 


Glu 




390 






Glu 


He 


Met 


Val 


4 05 








Val 


Lys 


Arg 


He 


Lys 


Gly Tyr 


Val 








440 


Leu 


Glu 


Ala 


Phe 






455 




Asp 


Glu 


Lys 


Gly 




470 






Ser 


His 


Thr 


Ser 


485 








Pro 


Ser 


Ala 


Lys 


Ser 


Pro 


Phe 


Thr 








520 


Leu 


Ala 


Phe 


Ser 






535 




Ala 


Ala 


Val 


Ala 




550 






Asn 


He 


He 


Tyr 



565 











HIS 


Asp 


Asp 


Phe 


105 








Asn 


Gly 


Phe 


Leu 


Asp 


Asp 


Ser Arg 








140 


GlU 


GlU 


Arg 


Glu 






155 




Leu 


Ala 


Ser 


Leu 




1 /U 






Phe 


Tyr 


Val 


Glu 


185 








Arg 


lie 


Val 


Asn 


Ala 


Pro 


Leu 


Glu 








220 


Asp 


Glu 


Ser 


Pro 






235 




Asn 


Pne 


He 


He 




OCA 






Ser 


val 


Ser 


Thr 


265 








He 


Ala 


Arg 


Leu 


Ser 


Val 


He 


He 








300 


Glu 


Glu 


Tyr 


Cys 






315 




Glu 


Phe 


Val 


Asp 




330 






Gin 


Thr 


Lys 


He 


345 








Asp 


Asn 


He 


Leu 


Leu 


Cys 


Gly Ala 








380 


Glu 


He 


Asp 


Tyr 






395 




Arg 


Lys 


Lys 


Thr 




410 






Val 


Ser 


His 


Lys 


425 








Arg 


Leu 


Cys 


Val 


Leu 


Thr 


Lys 


Asn 








460 


Asn 


Thr 


Leu 


Leu 






475 




Lys 


Leu 


val 


Lys 




4 90 






Gly 


Met 


Ser 


Pro 


505 








Thr 


Arg 


Val 


Gly 


Phe 


He 


Asn 


Glu 








540 


Thr 


Ser 


Cys 


Asn 






555 




Asp 


Phe 


Asp 


Arg 




570 







Leu 


I: ilC 


Wa 1 
v ax 


Lys 




1 1 n 

J. -L \J 






Arg 


Arg 


Glu 


Thr 


125 








fh t- 
± 11 X. 


rii \/ 


Pho 




Lys 


Val 


ncu 


Arg 








lou 


Lys 


Lys 


oer 


T)-i T* 

inr 






1 "7 C 
1 iO 




Lys 


Cys 




jer 




190 






He 


Thr 


Ser 


Asp 


205 








Pro 


Phe 


Asp 


Ti- 
ne 


Cys 


Val 


i nr 


Lys 










Gly Arg 


Pro 


irne 










Asp 


Phe 


Arg 


ulU 




270 






Asn 


Trp 


Thr 


Asn 


285 








Lys 


Ser 


.rne 


Leu 


Glu 


Thr 


inr 


inr 








—} f\ 


Val 


Phe 


He 


Glu 






335 




Phe 


Asp 


rr» W v 

Tnr 


Met 




350 






Lys 


Asn 


Val 


Ala 


365 








He 


Leu 


lieu 


Trp 


Asn 


Tyr 


Phe 


Lys 








A f\f\ 
H UU 


Asp 


Lys 


Leu 


Cys 










Asn 


val 


vai 


lie 




430 






Lys 


Val 


Ser 


Cys 


445 








Pro 


Trp 


Leu 


i>ys 


Cys 


Tyr 


Cys 


Tin 

lie 








4 80 


Gin 


Glu 


Lys 


Leu 






495 




Leu 


Met 


Val 


Cys 




510 






He 


Glu 


He 


Leu 


525 








Asn 


Asn 


Glu 


Asn 


Phe 


Leu 


Asp 


Ala 








560 


Ser 


He 


Val 


Asn 






575 
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Ala Arg Arg Arg Ala Met Val Gin Arg Pro 
580 585 



<210> 102 
<211> 1197 
<212> DNA 
<213> SHRIMP 



<400> 102 

atggccggaa 

gggtgtatcg 

tcaatcaaaa 

aaggatatgg 

gaaaatgatg 

gaaataaagt 

ggatgtcttc 

tttgataaga 

actaaaatag 

gagggcaaaa 

actcctttac 

aatgtaaaaa 

tgctgggttc 

catcaggcag 

aatgaagcag 

ttggagttgc 

agaacaagga 

cttcacaccc 

gatagggtca 

aggattaatc 



ctgacatcat 
tcagcaagaa 
cttcctcttc 
ttaagaagtg 
ctctgagagc 
atggggaagg 
ttgcaaagaa 
atgggttcat 
ctaagaaggt 
taccttgcca 
cccccactcc 
ataagccagt 
acaaactctt 
gaaggcgtga 
aagacactga 
tagacacggc 
ttgtgaggga 
cctttgatgc 
gagacggcag 
agctccagca 



cagcagcagc 
aggaaagaca 
atcagaaatg 
tgccagctgt 
atctatagaa 
ggagataggg 
tgaaaagtcc 
gatgttgtct 
ttttggaggc 
ggagttccat 
tgctcctgtc 
ttttgtgcct 
ctctgatgac 
cgccctcatg 
gaggtttgtg 
taatatgccc 
ctatgatgct 
accaaatgtt 
acgcaacact 
ccagtttctg 



agcagcggca 
ataaaaggga 
atgaagaaac 
aagaaggttg 
agtacctgct 
gatgttttga 
ttttataagt 
gaaggtatga 
tgtaaggccg 
aaacctagtt 
aaggtgttgc 
gacttggctg 
cctgaggaaa 
gaagattatg 
tctaacgcac 
cctcccgcat 
agtcctgtac 
aatttgaacc 
tcacggagga 
tactatagct 



gcagcagcag 
agaatatcgt 
ataagaaaag 
attatagctc 
cagcactcaa 
gtgcaattag 
ttttcttgag 
aaaggattga 
cccctctaaa 
catataaggg 
cccctcttct 
taggggaggc 
gaaaacggct 
gtgttattcc 
ttgaatacca 
ccacacctgt 
cttctcctta 
caggttcagg 
catctgccgt 
ctgatagtga 



caagaaggga 
ctttaagaca 
aatggaaata 
atcaaccttg 
taggtttcct 
gctaatggcg 
gggttttcag 
gaaaatgcat 
agaggacagg 
tgaatatacc 
tccctataag 
taagaaacca 
ctttgagagg 
taataatgac 
ggcacagatg 
tagaagggga 
ttcttcacct 
taggatggtt 
catggccagg 
tttttaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1197 



<210> 103 
<211> 396 
<212> PRT 
<213> SHRIMP 



<400> 103 



Met 


Ala 


Gly 


Thr 


Asp 


He 


He 


Ser 


Ser 


Ser 


Ser 


Ser 


Gly 


Ser 


Ser 


Ser 


1 








5 










10 










15 




Ser 


Lys 


Lys 


Gly 
20 


Gly 


Cys 


He 


Val 


Ser 
25 


Lys 


Lys 


Gly 


Lys 


Thr 
30 


He 


Lys 


Gly 


Lys 


Asn 
35 


He 


Val 


Phe 


Lys 


Thr 
40 


Ser 


He 


Lys 


Thr 


Ser 
45 


Ser 


Ser 


Ser 


Glu 


Met 
50 


Met 


Lys 


Lys 


His 


Lys 
55 


Lys 


Arg 


Met 


Glu 


He 
60 


Lys 


Asp 


Met 


Val 


Lys 


Lys 


Cys 


Ala 


Ser 


Cys 


Lys 


Lys 


Val 


Asp 


Tyr 


Ser 


Ser 


Ser 


Thr 


Leu 


65 










70 










75 










80 


Glu 


Asn 


Asp 


Ala 


Leu 
85 


Arg 


Ala 


Ser 


He 


Glu 
90 


Ser 


Thr 


Cys 


Ser 


Ala 
95 


Leu 


Asn 


Arg 


Phe 


Pro 
100 


Glu 


He 


Lys 


Tyr 


Gly 
105 


Glu 


Gly 


Glu 


He 


Gly 
110 


Asp 


Val 


Leu 


Ser 


Ala 
115 


He 


Arg 


Leu 


Met 


Ala 
120 


Gly 


Cys 


Leu 


Leu 


Ala 
125 


Lys 


Asn 


Glu 


Lys 


Ser 
130 


Phe 


Tyr 


Lys 


Phe 


Phe 
135 


Leu 


Arg 


Gly 


Phe 


Gin 
140 


Phe 


Asp 


Lys 


Asn 


Gly 


Phe 


Met 


Met 


Leu 


Ser 


Glu 


Gly 


Met 


Lys 


Arg 


He 


Glu 


Lys 


Met 


His 


145 










150 










155 










160 


Thr 


Lys 


He 


Ala 


Lys 
165 


Lys 


Val 


Phe 


Gly 


Gly 
170 


Cys 


Lys 


Ala 


Ala 


Pro 
175 


Leu 


Lys 


Glu 


Asp 


Arg 
180 


Glu 


Gly 


Lys 


He 


Pro 
185 


Cys 


Gin 


Glu 


Phe 


His 
190 


Lys 


Pro 
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oer 


Ser 


Tyr 
195 


Lya 


uiy 


V3 JL U 


1 y - L 


Thr 
200 


Thr 


Pro 


Leu 


Pro 


Pro 
205 


Thr 


Pro 


Ala 


Pro 


vdl 


Lys 


vet J. 


Leu 




Pro 


Leu 


Leu 


Pro 




Lys 


Asn 


Val 


Lys 


Asn 












£. X J 










220 












rXU 


Vdl 




vdl 




Asp 


Leu 


Al a. 


Val 


Gly 


Glu 


Ala 


Lys 


Lys 


Pro 


225 










230 










235 










240 


Cys 


Trp 


va j. 


HIS 


Lys 


Leu 


OK a 

trie 


oer 


Asp 


Asp 


fro 


UiU 


r;i n 

UXU 


A t~ n 


Lys 


Arg 








245 










250 










255 




Xj6U 




bill 


Arg 
260 


Hie 
nx o 


-LI 1 


a 1 a 
AXo 




Arg 
265 


Arg 


Asp 


Ala 


Leu 


Met 
270 


Glu 


Asp 


Tyr 




val 


He 


Pro 


Asn 


Asn 


Asp 


Asn 




Ala 


Glu 


Asp 


Thr 


Glu 


Arg 




275 










280 










285 








nu _ 

rue 


Vd 1 

290 


Ser 


Asn 


Ala 


Leu 


295 


Tyr 




AXa 


Gin 


Met 

300 


Leu 


Glu 


Leu 


Leu 


Asp 


Thr 


Ala 


Asn 


Met 


Pro 


Pro 


Pro 


Ala 


Ser 


Thr 


Pro 


Val 


Arg 


Arg 


Gly 


305 










310 










315 










320 


Arg 


Thr 


Arg 


He 


Val 
325 


Arg 


Asp 


Tyr 


Asp 


Asp 
330 


Val 


Pro 


Ser 


Pro 


Tyr 
335 


Ser 


Ser 


Pro 


Leu 


His 
340 


Thr 


Pro 


Phe 


Asp 


Ala 
345 


Pro 


Asn 


Val 


Asn 


Leu 
350 


Asn 


Pro 


Gly 


Ser 


Gly Arg 


Met 


Val 


Asp 


Arg 


Val 


Arg 


Asp 


Gly Arg 


Arg 


Asn 


Thr 






355 










360 










365 








Ser 


Arg 
370 


Arg 


Thr 


Ser 


Ala 


Val 
375 


Met 


Ala 


Arg 


Arg 


He 
380 


Asn 


Gin 


Leu 


Gin 


His 


Gin 


Phe 


Leu 


Tyr 


Tyr 


Ser 


Ser 


Asp 


Ser 


Asp 


Phe 











385 390 395 



<210> 104 

<211> 3796 

<212> DNA 

<213> SHRIMP 



<400> 104 

atggcgtcct 

caagaaaaac 

ccatctcaag 

cctcctccac 

gaacggctaa 

acaaacgagc 

aaggataatt 

ataggegcac 

ctcgatataa 

agggaccatg 

accaaa tgcg 

aaactggtgg 

agaggegcta 

aatgaaaaca 

atategcacg 

ggcaacactt 

atagggtgga 

gacgacaatt 

aaatttaaaa 

aagactcaag 

tttgcagaaa 

acaaaaagca 

gecttttatg 

ttctttaacc 

ttgtccacat 

tccgtctcag 

gtgggtgtga 



eggggggatt 
aagagaaaaa 
ctccagatcc 
ctaatcctct 
agaegtcaaa 
gtgttgatgc 
acttccaggt 
aegtgaaaca 
aaaataagtg 
agaaccttat 
tgggggcaga 
taaaaaaact 
ccagaaattt 
aggacattcc 
tcttttccac 
tttatcaacg 
cagaaagtga 
tattegtget 
acgtcctcta 
agaegctaat 
tcgacaagtg 
ctgtcataca 
ctctaaataa 
ttttctcctt 
tgacgttcgt 
ccaagcggct 
gggattttgg 



ctttacagga 
taaaccaact 
agtcccagac 
ccctcctcct 
aatacgttta 
gttgaaagag 
tttaaagtgc 
ggtggccaaa 
ctctttagtg 
actacaaatt 
aatatgtgtt 
ccccaatgca 
actggagaat 
tccttcagaa 
tctacacaga 
aaaaccaaca 
agcatcaaaa 
accccattct 
taaaaatagt 
aaatccccag 
tttggatgtt 
atatagagga 
ggcaaaacat 
catgtattgt 
ctatcaacac 
catggatatc 
ttcaccttca 



atagatgacc 
caagcaccag 
ccagttccta 
cctcctcctc 
aacaaagctc 
aaccaagcat 
tcgataacac 
agtagcgaaa 
tacaacgaaa 
gccgtccagt 
aaaggcaacg 
ccatcctcat 
aatttcaaca 
cgagccaacc 
ctggacacta 
ttcgataata 
caaaccacta 
ttcaataatt 
accgcacatc 
attgattegg 
ettttggeca 
aagtttagaa 
tctcgcgcag 
catggtccgt 
atgtttttcc 
gattccgccc 
aaaacaagtc 



ttttcaagac 
aaacagaacc 
aaacaccaac 
ctcctaagcc 
ttagtgatat 
taaatacaga 
cttctgtacc 
tcgaactggc. 
atgagtcgtt 
tattctctag 
aaaaaaacaa 
cttcaactgt 
agggagaaaa 
tggacacgac 
aaaggaagct 
aattcaggtg 
aategctaga 
tggcagacca 
ccggcaaacg 
cgaaagagta 
tagggaagaa 
ggtatttaat 
tatcccctct 
ttctccattc 
ccatgggcac 
taatgaaagg 
tccatacaag 



agtgattcaa 
aaaaccaggg 
caatttctgt 
ctcaagagaa 
tgttgaagcc 
atatgacaag 
aacagctatt 
cgtgaacgaa 
aaaatttttc 
gcacgataac 
gtttgttaac 
gctagaaatt 
taacactgtc 
caaggcagaa 
tttctttaaa 
gacagaagtt 
caagccaacg 
cttacgtttg 
aaattactac 
caagatggtc 
tgacaaatac 
attctgetae 
accatttaat 
cgccagtttt 
agccgcccca 
aggaaagggg 
aacattggtg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 
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tctttcttag 
gaagtgcgtg 
gattcagtca 
aaagtatcac 
aaggtgaggg 
gaagaggcgg 
aaagccctcg 
tacaatttag 
gccgaaggag 
gacaaatact 
gaacaaaaca 
tacgaagaag 
ttgaatagag 
atactcgata 
gccgtgtggt 
ggcaactacg 
aattttggaa 
tttcacagag 
acatccctat 
cgaacccacg 
acaagaggtg 
gggcacacga 
caagagaatg 
ttgggaagag 
gctgcagcca 
gactcttcta 
tcggccgcca 
gcaccagcag 
ttcttttctg 
tgtcccaata 
gaaagtgaaa 
gaaatggtat 
aatcaagacg 
gatatggttg 
attgcagccc 
gaagaccctg 
tactgacgag 



gttttgctga 
tatctccagc 
tctttatata 
tcgaatcggc 
ccgccaagag 
gcctgacact 
gattacctgt 
gggattttgt 
aaaccgcagg 
acggcgaagg 
caccgtatga 
aaaaggatgt 
acgtggagga 
aaactaacca 
gcaaaaggaa 
aatggcttgg 
tattaggtga 
acagtgatag 
ctgatcgaaa 
taggaagagt 
ataaagacat 
attcagctaa 
gattgtgggg 
gacgtttagg 
ttgcaatttc 
aagtgagaaa 
gagaaaaggc 
ctggttcatc 
acccagacaa 
caggatttct 
cctctatagt 
gcccttcaac 
ctgtactgac 
aagacgagga 
gtatcagtaa 
aattgcaaag 
atctaa 



aatggctatg 
tctccaacaa 
tttcaccttt 
gcttcgcctc 
atgccgaata 
ctcttatgcc 
ccctaagata 
aggcgtggaa 
atttctcggc 
agccttttca 
cacatcttca 
tccacagcag 
atatttaatg 
aaaagaacgt 
cgtactgaaa 
tgctaaaatg 
cttgaaaata 
attaccctca 
acaattggcc 
gactgcaaca 
gtttgccgcc 
ttttgttcca 
gtatactcgc 
gggcctgaat 
ttctgcctta 
aatagcctcc 
gagagaagct 
cagaaacagt 
gcgccagaag 
tcatgctgct 
gagattacgt 
agctcccgaa 
gttggatgat 
agcgcgtgaa 
agtgtgtgag 
tgtggatgaa 



ggaacaatga 
aggatatcta 
gttttattcc 
atcatggggc 
gaagcagcgg 
catctattgg 
aaccctctca 
caacttctaa 
atgtttgata 
gacgtggttg 
gcgttgatga 
gaagaaaatt 
gcttctccta 
ttcatgtctg 
aaggattgga 
tgcaaccatt 
accaataaac 
gttgcagatc 
cttgttcact 
tcatgggctg 
ctatcttcat 
tattttagta 
agaacctctg 
aaggtagggg 
gatatggggg 
acctgcttaa 
agtattaaac 
aacaggtttc 
cttataaagg 
gttcctgatt 
ttgagactga 
gctaataaga 
gaagataaca 
agattacgcc 
cggaaaaatc 
caattgatac 



cggctctctt 
aatccctaga 
acagattcag 
agacgcacgc 
aaatggaagg 
gtcttcctta 
tgacagcatc 
aggctaagag 
atctagtgaa 
aaaatgtaaa 
cacctatccc 
ctacacaaca 
tgaagatggt 
ttggggatat 
acgaatacgc 
tacttttagc 
ttgacacaaa 
agaaaaaatt 
cgtgcgttaa 
tcgatgcgct 
cgctggatat 
gaaattacct 
aaaaattggc 
tggctaaaac 
aagtagaagc 
atgttaatgc 
gtcttcttct 
tcctcaaaga 
gtgaagcagt 
ttgttattga 
ttaaacctga 
agaggaaatt 
tcgttaaata 
accaggacaa 
caaagaaaaa 
gggaactggc 



atctggtgta 
aagatggtgt 
tggtgcgaaa 
ccacacaaat 
tgtggaagaa 
ctctatacaa 
ttcttctcaa 
agagtttcca 
agattctatt 
acaaggcatg 
taaagcattc 
aagatataga 
gtttgtgtct 
tgcccttctg 
tatcgctaaa 
tgatttagtg 
taccgacacc 
tataaaaaac 
cgtgagcacc 
tcgtacctat 
gtaccatctt 
atgtaacgaa 
caaagaagaa 
agaactggct 
tgtaatggac 
agccaaggtc 
ggccactaat 
tttgtggggg 
ttctgtacta 
gtattccttc 
aaaacaagac 
agtaaggaat 
caacaaatat 
acaatcggtt 
acgtcgttta 
tgccattgcc 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3796 



<210> 105 
<211> 1255 
<212> PRT 
<213> SHRIMP 



<400> 105 



Met 


Ala 


Ser 


Ser 


Gly 


Gly 


Phe 


Phe 


Thr 


Gly 


He 


Asp 


Asp 


Leu 


Phe 


Lys 


1 








5 










10 










15 




Thr 


Val 


He 


Gin 


Gin 


Glu 


Lys 


Gin 


Glu 


Lys 


Asn 


Lys 


Pro 


Thr 


Gin 


Ala 








20 










25 










30 






Pro 


Glu 


Thr 


Glu 


Pro 


Lys 


Pro 


Gly 


Pro 


Ser 


Gin 


Ala 


Pro 


Asp 


Pro 


Val 






35 










40 










45 








Pro 


Asp 


Pro 


Val 


Pro 


Lys 


Thr 


Pro 


Thr 


Asn 


Phe 


Cys 


Pro 


Pro 


Pro 


Pro 




50 










55 










60 










Asn 


Pro 


Leu 


Pro 


Pro 


Pro 


Pro 


Pro 


Pro 


Pro 


Pro 


Lys 


Pro 


Ser 


Arg 


Glu 


65 










70 










75 










80 


Glu 


Arg 


Leu 


Lys 


Thr 


Ser 


Lys 


He 


Arg 


Leu 


Asn 


Lys 


Ala 


Leu 


Ser 


Asp 










85 










90 










95 




lie 


Val 


Glu 


Ala 


Thr 


Asn 


Glu 


Arg 


Val 


Asp 


Ala 


Leu 


Lys 


Glu 


Asn 


Gin 








100 










105 










110 






Ala 


Leu 


Asn 


Thr 


Glu 


Tyr 


Asp 


Lys 


Lys 


Asp 


Asn 


Tyr 


Phe 


Gin 


Val 


Leu 






115 










120 










125 








Lys 


Cys 


Ser 


He 


Thr 


Pro 


Ser 


Val 


Pro 


Thr 


Ala 


He 


He 


Gly 


Ala 


His 
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1 jU 






Val 


Lys 


Gin 


val 


145 








Leu 


Asp 


He 


Lys 


Leu 


Lys 


Phe 


Phe 








loU 


Gin 


Leu 


Phe 


Ser 






195 




Cys 


Val 


Lys 


Gly 




210 






Lys 


Lys 


Leu 


Pro 


225 








Arg 


Gly 


Ala 


Thr 


Asn 


Asn 


Thr 


Val 








260 


Asp 


Thr 


Thr 


Lys 






275 




Leu 


Asp 


Thr 


Lys 




290 






Arg 


Lys 


Pro 


Thr 


305 








Trp 


Thr 


Glu 


Ser 


Pro 


Thr 


Asp 


Asp 








340 


Ala 


Asp 


His 


Leu 






355 




Thr 


Ala 


His 


Pro 




370 






lie 


Asn 


Pro 


Gin 


385 








Glu 


lie 


Asp 


Lys 


Lys 


Tyr 


Thr 


Lys 








420 


Tyr 


Leu 


He 


Phe 






435 




Ser 


Arg 


Ala 


Val 




450 






Phe 


Met 


Tyr 


Cys 


4 65 








Thr 


Leu 


Thr 


Phe 


Ala 


Pro 


Ser 


Val 








500 


Met 


Lys 


Gly 


Gly 






515 




Lys 


Thr 


Ser 


Leu 




530 






Glu 


Met 


Ala 


Met 


545 








Arg 


Val 


Ser 


Pro 


Trp 


Cys 


Asp 


Ser 








580 


Arg 


Phe 


Ser 


Gly 






595 




He 


Met 


Gly 


Gin 



610 







1 It: 




Ala 


Lys 




oer 




1DU 






Asn 


Lys 


Cys 


oer 


165 








Arg 


Asp 


His 


Glu 


Arg 


HIS 


Asp 


Asn 










Asn 


Glu 


Lys 


Asn 






1 D 




Asn 


TV 1 

Ala 


Pro 


ber 










Arg 


Asn 


Leu 


lieu 


245 








Asn 


Glu 


Asn 


Lys 


Ala 


GlU 


lie 


Ser 










Arg 


Lys 


Leu 


Pne 










Pne 


Asp 


Asn 


Lys 










Glu 


Ala 


Ser 


Lys 


325 








Asn 


Leu 


Phe 


Val 


Arg 


Leu 


Lys 


Phe 








360 


Gly 


Lys 


Arg 


Asn 






375 




He 


Asp 


Ser 


Ala 










Cys 


Leu 


Asp 


Val 


405 








Ser 


Thr 


Val 


He 


Cys 


Tyr 


Ala 


Phe 








4 40 


Ser 


Pro 


Leu 


Pro 










His 


Gly 


Pro 


Pne 




470 






Val 


Tyr 


Gin 


His 


485 








Ser 


Ala 


Lys 


Arg 


Lys 


Gly 


Val 


Gly 








520 


His 


Thr 


Arg 


Thr 






535 




Gly 


Thr 


Met 


Thr 




550 






Ala 


Leu 


Gin 


Gin 


565 








Val 


He 


Phe 


He 


Ala 


Lys 


Lys 


Val 








600 


Thr 


His 


Ala 


His 






615 





140 

Glu He Glu Leu 
155 

Leu Val Tyr Asn 
170 

Asn Leu He Leu 
185 

Thr Lys Cys Val 

Lys Phe Val Asn 
220 

Ser Ser Ser Thr 
235 

Glu Asn Asn Phe 
250 

Asp He Pro Pro 
265 

His Val Phe Ser 

Phe Lys Gly Asn 
300 

Phe Arg Trp Thr 
315 

Gin Thr Thr Lys 
330 

Leu Pro His Ser 
345 

Lys Asn Val Leu 

Tyr Tyr Lys Thr 
380 

Lys Glu Tyr Lys 
395 

Leu Leu Ala He 
410 

Gin Tyr Arg Gly 
425 

Tyr Ala Leu Asn 

Phe Asn Phe Phe 
460 

Leu His Ser Ala 
475 

Met Phe Phe Pro 
490 

Leu Met Asp He 
505 

Val Arg Asp Phe 

Leu Val Ser Phe 
540 

Ala Leu Leu Ser 
555 

Arg He Ser Lys 
570 

Tyr Phe Thr Phe 
585 

Ser Leu Glu Ser 

Thr Asn Lys Val 
620 



Ala 


Val 


Asn 


Glu 








i fin 


UlU 


Asn 


\J A. U 


dcl 










V3J.I1 


Tip 
X J.C 


Ala 


v a J- 




i on 






Gly 


Ala 


Glu 


He 


205 








Lys 


Leu 


vai 


vai 


v ax 


Leu 


blu 


1 1c 








o ^ n 

4t ^ u 


Asn 


Lys 


oiy 


bill 










Car 

OCX. 


nla 


Asn 


Leu 




OTft 
£. / U 






Thr 


Leu 


His 


Arg 


285 








Tnr 


Pne 


Tyr 


bin 


blu 


Val 


1 1€ 


bi.y 








ion 


Car 


Leu 


H en 

/AO £J 


T i/Q 










rne 


Asn 


Asn 


Leu 










Tyr 


Lys 


Asn 


Ser 


365 








Gin 


GlU 


Thr 


Leu 


Met 


val 


Pne 


Ala 








H UU 


Gly 


Lys 


Asn 


Asp 






H lO 




Lys 


Pne 


TV 

Arg 


Arg 




A *S f\ 

4 JU 






Lys 


Ala 


Lys 


His 


445 








Asn 


Leu 


Oka 

Pne 




ber 


rne 


Leu 










vt Q n 


Met 


uiy 


Tnr 


Ala 










Asp 


Ser 


Aia 


Leu 




Dili 






Gly 


Ser 


Pro 


Ser 


525 








Leu 


Gly 


Phe 


Ala 


Gly 


Val 


Glu 


Val 








560 


Ser 


Leu 


Glu 


Arg 






575 




Val 


Leu 


Phe 


His 




590 






Ala 


Leu 


Arg 


Leu 


605 








Arg 


Ala 


Ala 


Lys 
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Arg 




Arg 


xxe 


Glu 


Ala 


Ala 


Glu 


Met 


Glu 


Gly 


Val 


Glu 


Glu 


Glu 


Glu 


625 








630 










635 










640 


Ala 


biy 


Leu 


Thr 


Leu 


Ser 


Tyr 


r\ x a 


His 


Leu 


Leu 


Gly 


Leu 


Pro 


Tvr 


Ser 








645 










650 










655 




lie 


d"\ t-» 
uin 


Lys 


Ala 


Leu 


Gly 


L6U 




Val 


Pro 


Lys 


He 


Asn 


Pro 


Leu 


Met 






660 










665 










670 






1 11 X 


ax a 


Ser 


Ser 


Ser 


Gin 


iyr 


Asn 


Leu 


Gly Asp 


Phe 


Val 


Glv 


Val 


Glu 






675 










680 










685 










Leu 


Leu 


Lys 


Ala 


Lys 




VJ1U 




Pro 


Ala 


Glu 


Gly 


Glu 


Thr 


Ala 




690 




















700 












rue 


Leu 


Gly 


Met 


Phe 


Asp 


Asn 


Leu 


vai 


Lys 


Asp 


Ser 


He 


Asp 


Lys 


/ Uj 










710 










715 










720 


Tyr 


Tyr 


Gly Glu 


Gly Ala 




o^?x 


ASp 


vai 


Val 


Glu 


Asn 


Val 


Lys 


Gin 








725 










Tin 










735 




CaXy 


Met 


Glu 


Gin 


Asn 


Thr 


Prn 


Tyr 


ASp 


i nr 


Ser 


Ser 


Ala 


Leu 


Met 


Thr 








740 










745 










750 






Pro 


T 1 A 

xxe 


Pro 


Lys 


Ala 


Phe 


Tyr 


UXU 


fZl ii 


G1U 


Lys 


Asp 


Val 


Pro 


Gin 


Gin 






755 








760 










765 








olu 


r*i »i 
olu 


Asn 


Ser 


Thr 


Gin 


Olll 


Arg 


Tyr 


Arg 


Leu 


Asn 


ftt y 


Asp 


Val 


Glu 




Tin 










77S 










780 










(j-LU 


Tyr 


Leu 


Met 


Asp 


Met 


Lys 


Mot* 

Fid- 


vox 


rne 


Val 


Ser 


He 


Leu 


Asp 


Lys 


not 
/ OD 








790 










795 










800 


Tnr 


TV 

Asn 


Gin 


Lys 


Glu 


Arg 


rne 


Mai- 
ne l_ 


OCX 


Val 


Gly Asp 


He 


Ala 


Leu 


Leu 










805 










BIO 










815 




ax a 


vax 


Trp 


Cys 


Lys 


Arg 


Asn 


Val 


Leu 


Lys 


Lys 


Asp 


xrp 


Asn 


Glu 


xyx 








820 










O ^ ~J 










830 






Aid 


xxe 


Ala 


Lys 


Gly 


Asn 


Tyr 


V2.L Li 


Trp 


Leu 


Gly 


Ala 


T \/Q 


Met 


Cys 


Asn 






835 










84 0 










845 








His 


Leu 


Leu 


Leu 


Ala 


Asp 


Leu 


Val 


Asn 


Phe 


Gly 


He 




Gly Asp 


Leu 




OCA 

ODU 










pec 
ODD 










860 










Lys 


ixe 


Thr 


Asn 


Lys 


Leu 


Asp 


inr 


Asn 


Thr 


Asp 


Thr 


Phe 


His 


Arg 


Asp 


O £. C 
ODD 










870 










875 










880 


Ser 


Asp 


Arg 


Leu 


Pro 


Ser 


vax 


Aia 


Asp 


Gin 


Lys 


Lys 


Ph o 
t lies 


He 


Lys 


Asn 








885 










890 










895 




Thr 


Ser 


Leu 


Ser 


Asp 


Arg 


Lys 


Vain 


Leu 


Ala 


Leu 


Val 


n x ^ 


Ser 


Cys 


vox 








900 










905 










910 






Asn 


Val 


Ser 


Thr 


Arg 


Thr 


His 


Val 


Gly 


Arg 


val 


Thr 


Ala 


Thr 


Ser 


irp 






915 










920 










925 








Ala 


Val 


Asp 


Ala 


Leu 


Arg 


Thr 


Tyr 


Thr 


Arg 


Gly 


Asp 


Lys 


Asp 


Met 


Phe 




930 










935 










94 0 










Ala 


Ala 


Leu 


Ser 


Ser 


Ser 


Leu 


Asp 


Met 


Tyr 


His 


Leu 


Gly 


His 


Thr 


Asn 


945 










950 










955 










960 


Ser 


Ala 


Asn 


Phe 


Val 


Pro 


Tyr 


Phe 


Ser 


Arg 


Asn 


Tyr 


Leu 


Cys 


Asn 


Glu 










965 










970 










975 




Gin 


Glu 


Asn 


Gly 


Leu 


Trp 


Gly 


Tyr 


Thr 


Arg 


Arg 


Thr 


Ser 


Glu 


Lys 


Leu 








980 










985 










990 






Ala 


Lys 


Glu 


Glu 


Leu 


Gly Arg 


Gly Arg 


Leu 


Gly 


Gin 


Lys 


Val 


Gly 


Val 




995 










1000 








1005 






Ala 


Lys 


Thr 


Glu 


Leu 


Ala 


Ala 


Ala 


Ala 


lie 


Ala 


He 


Ser 


Ser 


Ala 


Leu 




1010 








1015 








1020 








Asp 


Met 


Gly 


Glu 


Val 


Glu 


Ala 


Val 


Met 


Asp 


Asp 


Ser 


Ser 


Lys 


Val 


Arg 


1025 








1030 








1035 








1041 


Lys 


He 


Ala 


Ser 


Thr 


Cys 


Leu 


Asn 


Val 


Asn 


Ala 


Ala 


Lys 


Val 


Ser 


Ala 








1045 








1050 








1055 


Ala 


Arg 


Glu 


Lys 


Ala 


Arg 


Glu 


Ala 


Ser 


He 


Lys 


Arg 


Leu 


Leu 


Leu 


Ala 








1060 








1065 








1070 




Thr 


Asn 


Ala 


Pro 


Ala 


Ala 


Gly 


Ser 


Ser 


Arg 


Asn 


Ser 


Asn 


Arg 


Phe 


Leu 






1075 








1080 








1085 






Leu 


Lys 


Asp 


Leu 


Trp 


Gly 


Phe 


Phe 


Ser 


Asp 


Pro 


Asp 


Lys 


Arg 


Gin 


Lys 




1090 








1095 








1100 








Leu 


He 


Lys 


Gly 


Glu 


Ala 


Val 


Ser 


Val 


Leu 


Cys 


Pro 


Asn 


Thr 


Gly 


Phe 
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1105 1110 1115 1120 

Leu His Ala Ala Val Pro Asp Phe Val He Glu Tyr Ser Phe Glu Ser 

1125 1130 1135 

Glu Thr Ser He Val Arg Leu Arg Leu Arg Leu He Lys Pro Glu Lys 

1140 1145 1150 

Gin Asp Glu Met Val Cys Pro Ser Thr Ala Pro Glu Ala Asn Lys Lys 

1155 1160 1165 

Arg Lys Leu Val Arg Asn Asn Gin Asp Ala Val Leu Thr Leu Asp Asp 

1170 1175 1180 

Glu Asp Asn He Val Lys Tyr Asn Lys Tyr Asp Met Val Glu Asp Glu 
1185 1190 1195 1200 

Glu Ala Arg Glu Arg Leu Arg His Gin Asp Lys Gin Ser Val He Ala 

1205 1210 1215 

Ala Arg He Ser Lys Val Cys Glu Arg Lys Asn Pro Lys Lys Lys Arg 

1220 1225 1230 

Arg Leu Glu Asp Pro Glu Leu Gin Ser Val Asp Glu Gin Leu He Arg 

1235 1240 1245 

Glu Leu Ala Ala He Ala Tyr 
1250 1255 



<210> 106 
<211> 1824 
<212> DNA 
<213> SHRIMP 



<400> 106 

atgagagacg 

tacgattgtt 

ggcgtggacg 

gataaatgca 

ctaactaaac 

agttgtaata 

aataatagta 

ggtgccgaat 

gagaaagaag 

ggctcggcag 

ggctcttgga 

gaagttttag 

tccgatctgt 

gcaagaaat t 

gaagagaatg 

gtttctgaga 

gattgttgtg 

caacccaaga 

aaaaccaatc 

cggaaaatga 

tccgctgctg 

aacaacccct 

tgccctcaaa 

gcagaaagtg 

actcaacaca 

gtccagcatg 

agaattgaaa 

cattcggcta 

aggactgctt 

atacctttaa 

atgctagctg 



atacttttaa 
gttgcccatt 
ccacgtctgt 
taccctttag 
ccgatttagc 
acactcaagg 
aaaagtatga 
ctattcacaa 
ccttcgccga 
ccggaggagg 
cgtcgttgtt 
tcagtaacaa 
tatatgccct 
tgtgtgttat 
gacctttcga 
caaattcatc 
atgatgatga 
aacaaaccaa 
acgacaacat 
tggattcttt 
ctggtggtgg 
cttcagtgag 
tagaagaagt 
tagtcaaaca 
gaaaaaagta 
taggatttaa 
ttgttttttc 
ttctaaggct 
cggtgaaagg 
gcaacaagga 
gtttgggaaa 



ccaagaaact 
ggttaaccgc 
gcgaagccga 
gataacgtcc 
tgctgcagct 
caattttgat 
agttacatgc 
gaaacccatg 
aagaattgaa 
aggtatgtac 
gttccacaca 
gataaaacac 
atgctccacc 
tgaaggtggg 
ctccaaaaca 
ggcactggcc 
cgatgatgat 
gaaacaaaaa 
gttgatgaat 
gtcaggaaaa 
tgcttcatcc 
tcctactaac 
cactattttt 
ttctgttgtt 
tatcctgcct 
caaatttcgc 
cgaccaatct 
attgtcatac 
aatcgatttt 
aataagagaa 
atga 



gcggtgaaac 
gtggagcgcc 
ccggctcttt 
cttattgagg 
tttgatgtat 
gtctcttcaa 
ccgtcatgca 
tctcttcttg 
ctcaagaaat 
aaggacagct 
tctaaaaagg 
acatcaagat 
actaacaact 
gaatttttat 
gaccttcaat 
gcttcttctt 
gatgatgaag 
acaacaacat 
gtacttaaaa 
aagggccaac 
gacgttgttg 
aatagggata 
tcacaacaca 
attaatggaa 
cacgaaaata 
attttgactg 
gactctgtgg 
ataagagaaa 
gtcgtgaaat 
cggcaattat 



ttgtacgatg 
ttctaggatc 
atgaagaaga 
gtatactttt 
cagaaaagct 
tgaccatatg 
ctgtcgagaa 
ccttctttaa 
tgtacctctc 
cccaacaatc 
acaagactcg 
tgcagcctag 
ctgcatctta 
attttaaata 
cattagttaa 
cctctttaga 
acgaaaaaac 
caacacttcc 
aaggagctgt 
actctaaaaa 
caggagaaaa 
gaaaagacta 
ggatgaacaa 
attgtttaaa 
ttcttttttg 
gcgtttcttg 
tgttgagtaa 
actcattgaa 
cacaggacac 
gctcagcttc 



gtatacagag 
gttcggagga 
taagaaggga 
ggaaagggct 
ggtgtattgt 
gattgatggc 
aattagtgga 
caatctggta 
cttactaacg 
ttccttcaac 
tttagaggct 
gtgcgtctgt 
cgcgtacaag 
cacaatcttt 
taacgagcct 
agacgacgat 
taagaagaaa 
acctatcagc 
taatggaaaa 
attgaaaacc 
tgaggaagag 
tgtgcttcca 
taacaagttg 
cttgtttgtt 
cccaccttta 
cttttttgat 
taatgctgcc 
gcgaagtgtg 
taatataggc 
aaccctgagt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1824 



<210> 107 
<211> 603 
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<212> PRT 












<213> SHRIMP 










<400> 107 












Met 


Arg 


Asp 


Asp 


Thr 


Phe 


Asn 


Gin 


1 








5 








Trn 


Tvr 

xyx 


Thr 


Glu 


i yx 


Asp 


Cys 


Cys 








20 










Arg 


Leu 


Leu 


Gly 


Ser 


Phe 


Gly 


Gly 






35 










40 


Ser 


Arg 


Pro 


Ala 


Leu 


±yx 


Glu 


Glu 




50 










55 




Pro 


Phe 


Arg 


lie 


Thr 


Ser 


Leu 


Tip 
XXV 


65 










70 






Leu. 


Thr 


Lys 


Pro 


ASp 


Leu 


Al a 

<"lXCi 


Ala 










85 








Leu 


Val 


x yx 


Cys 


Cp r 
<JC1 




A QD 


A 








100 










Ser 


Met 


Thr 


He 




lie 


Asp 


Gly 






115 










120 


Thr 


Cys 


Pro 


Ser 


Cys 


Thr 


Val 


Glu 




130 










135 




Tip 

x x c 


Hie; 


ys 


Lys 


Pro 


Met 


Ser 


Leu 
eu 


X "a J 










150 








Xjy o 


OX U 


Ala 


Phe 


Ala 


V31U 


Arg 










165 










L6U 


Leu 
eu 


Thr 


ri "I xr 
uiy 


Ser 


AT a 


Ala 








180 










•JCi 




Gin 


Gin 


OCX 






Asn 






195 












m & 


Thr 


Cor 
Oct 


Lys 


Lys 


Asp 


Lys 


j. nr 




^ X Is 














o@r 


Asn 


T \/o 


He 


Lys 


nx a 


x 1 1 X 














?"^n 

A. o u 






OCX 


Asp 


L6U 


Leu 


Tyr 


Ala 
Ala 


Leu 


Cys 










t *i j 








xyr 


Ala 
nXa 


i yx 


Lys 


Aid 


Arg 


Asn 


Leu 








260 










x«eu 


iyr 


nit- 


Lys 


Tyr 


TK t- 
J. liX 


11c 


rne 






275 










^ O w 


• 

x*y s 


X I IX 


& ^ 


Leu 


rz 1 rv 

Ull 1 


Cor 

Ot£X 


x«eu 


v a x 




4- 27 1/ 










295 




Asn 


Ocr x 


Ser 


Ala 


Ala 


otsx 


Ser 


Ser 


305 










310 










ASp 


Asp 


ASp 


Asp 


ASp 


ASp 


















Lys 


Lys 


Gin 


Pro 


Lys 






X I IX 








340 










Thr 


Leu 


Pro 


Pro 


xie 


Ser 


T 

ijys 


inr 






355 










360 


Val 


Leu 


Lys 


Lys 


Gly 


Ala 


Val 


Asn 




370 










375 




Leu 


Ser 


Gly 


Lys 


Lys 


Gly 


Gin 


His 


385 










390 






Ala 


Ala 


Gly 


Gly 


Gly 


Ala 


Ser 


Ser 










405 








Glu 


Glu 


Asn 


Asn 


Pro 


Ser 


Ser 


Val 








420 










Lys 


Asp 


Tyr 


Val 


Leu 


Pro 


Cys 


Pro 






435 










440 


Ser 


Gin 


His 


Arg 


Met 


Asn 


Asn 


Asn 



Glu 


Thr 


Ala 


Val 


Lys 


Leu 


Val 


Arg 




10 










15 




Cys 


Pro 


Leu 


Val 


Asn 


Arg 


Val 


Glu 


25 










30 






Glv 
oxy 


Val 


Asp 


Ala 


Thr 


Ser 


Val 


Arg 










45 








A Qn 


T t/ o 


Lys 


Gly Asp 


Lys 


Cys 


Tip 

X JLXS 








60 










Glu 


Gly 


He 


Leu 


Leu 


Glu 


Arg 


Ala 






75 










80 


Ala 


Ala 


Phe 


Asp 


Val 


Ser 


Glu 


Lys 




90 










95 




Thr 


Gin 


Gly Asn 


Phe 


Asp 


Val 


Ser 


105 










110 






Asn 


Asn 


Ser 


Lys 


Lys 


Tyr 


Glu 


Val 










125 








Lys 


lie 


Ser 


Gly Gly Ala 


Glu 


Ser 








140 










Lieu 


Ala 


Phe 


Phe 


Asn 


Asn 


Lieu 


val 






155 










i fin 


Tip 
X XC 


Glu 


Leu 


Lys 


Lys 


Leu 


Tyr 


Leu 




170 










i it 

X / J 




vjj. y 


Gly Gly 


Gly Met 


Tyr 


Lys 


ASp 


185 










1 OA 

i yu 






vjx y 


Ser 


Trp 


Thr 


Ser 


Leu 


Leu 


IT lie? 










205 








Arg 


Leu 


Glu 


Ala 


Glu 


Val 


Leu 


\/a 1 

v a 1 








220 










Arg 


Leu 


Gin 


Pro 


Arg 


Cys 


\7a 1 
vol 


Cys 






235 












Oc X 


Thr 


Thr 


Asn 


Asn 


Ser 


Ala 


OCX 




250 










255 






Val 


He 


Glu 


Gly 


Gly 




rile 


265 










270 






ox u 


Glu 


Asn 


Gly 


Pro 


Phe 


- 

ASp 


Car 










285 








Asn 


Asn 


Glu 


Pro 


Val 


Ser 


O X LI 


l fix 








300 












Leu 


Glu 


Asp 


Asp 


Asp 


Asp 


Cys 






315 












ASp 


Asp 


Glu 


Asp 


Glu 


Lys 


Th r 
1 fix 


Lys 




330 










335 




Lys 


Lys 


Gin 


Lys 


Thr 


Thr 


Thr 


Ser 


345 










350 






Asn 


His 


Asp 


Asn 


Met 


Leu 


Met 


Asn 










365 








Gly 


Lys 


Arg 


Lys 


Met 


Met 


Asp 


Ser 








380 










Ser 


Lys 


Lys 


Leu 


Lys 


Thr 


Ser 


Ala 






395 










400 


Asp 


Val 


Val 


Ala 


Gly 


Glu 


Asn 


Glu 




410 










415 




Ser 


Pro 


Thr 


Asn 


Asn 


Arg 


Asp 


Arg 


425 










430 






Gin 


He 


Glu 


Glu 


Val 


Thr 


He 


Phe 










445 








Lys 


Leu 


Ala 


Glu 


Ser 


Val 


Val 


Lys 
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4 SO 

"i O \J 


































VOX 




± 1c 


Asn 




Asn 


Cys 


Leu 


Asn 


Leu 


rne 




i nr 


bin 


465 










470 










475 










480 


Hi c 

nis 


Arg 


Lys 


Lys 


Tyr 
485 




Leu 


rne 


Asn 


i JLe 
490 


Leu 


OK a 

rrne 


Cys 


Pro 


fro 
495 


Leu 


Vol 


-Li 1 




Veil 

500 


tjj.y 


.rne 


Asn 


Lys 


rne 
505 


Arg 


Ti- 
ne 


Leu 


inr 


biy 
510 


va jl 


e a v 

5er 


Cys 


rue 


rfie 
515 


Asp 


Arg 


lie 


CalU 


Ti- 
ne 

520 


vax 


Pne 


Ser 


Asp 


Gin 
525 


O A W 

Ser 


Asp 


Ser 


Val 


Val 
530 


Leu 


Ser 


Asn 


Asn 


Ala 
535 


Ala 


His 


Ser 


Ala 


lie 
540 


Leu 


Arg 


Leu 


Leu 


Ser 


Tyr 


lie 


Arg 


Glu 


Asn 


Ser 


Leu 


Lys 


Arg 


Ser 


Val 


Arg 


Thr 


Ala 


Ser 


545 










550 










555 










560 


Val 


Lys 


Gly 


lie 


Asp 
565 


Phe 


Val 


Val 


Lys 


Ser 
570 


Gin 


Asp 


Thr 


Asn 


He 
575 


Gly 


lie 


Pro 


Leu 


Ser 
580 


Asn 


Lys 


Glu 


lie 


Arg 
585 


Glu 


Arg 


Gin 


Leu 


Cys 
590 


Ser 


Ala 


Ser 


Thr 


Leu 
595 


Ser 


Met 


Leu 


Ala 


Gly 
600 


Leu 


Gly 


Lys 













<210> 108 
<211> 2403 
<212> DNA 
<213> SHRIMP 



<400> 108 

atgctcagct 

gtcattttac 

ggaataatga 

gtggaagaaa 

aaggaactaa 

accggttcgg 

attagatccc 

ttgcccacca 

acaagagaga 

atgcacctca 

atgtcagaga 

tggaaaatta 

agagtagaaa 

ggcatggatt 

tcttctactt 

acgggcattg 

ctatcggaat 

gtagaacctt 

tggttacctc 

gaagaagaaa 

actgattttt 

tgcaaaataa 

aaatatgatt 

gttgccctca 

caaacccctt 

actgacgtgt 

gaaaatgaaa 

ggtaattctg 

cagtacttta 

gccccgtcat 

agacctagaa 

aggtgcatat 

acagaacacg 

aaccaatacc 



tcaaccctga 
ccgtttccaa 
cactggaccc 
gactagctca 
ataatagtag 
caatcttcga 
ccgctttgaa 
taaccaaatc 
agatagtggt 
cagacggaga 
ccaatgcttc 
tgcctaacaa 
cattagtgga 
tagtggatat 
ccaataatac 
gatccttgtc 
ataggactgg 
caatagacgg 
ctctagttaa 
acggggacaa 
ggtctaatgt 
acgatcgggc 
ggacaaataa 
tgacctttag 
ctcagggaag 
acggttttga 
ccgggattga 
tcagggtcgt 
cagaaatggc 
ctgtaaagga 
aagataacaa 
accacactga 
tatctttcaa 
aaacaaacta 



gtatgctagc 
ggatgttgta 
tcacgatttg 
atacaatact 
aaatatgaaa 
agacaacatg 
ttctgctgta 
tctaaggaga 
ggattctttc 
gatgacgtac 
gtccatctac 
tttcaataaa 
gtttgcgtgc 
gcaaagaact 
tcctagagta 
cattcttgca 
attatccatt 
cgttcaagca 
aaggccagaa 
taaaccttct 
ccccgtcaca 
aatgattagt 
aactacaaag 
aaaattccaa 
tgaatacgca 
atcgaatgga 
aagcatatac 
cagactggaa 
catacctcct 
gtacaagtat 
cgacggcggt 
acgttgttta 
caaggatttg 
caaaagcata 



tggtttggga 
ctctttggat 
gatataaaaa 
ctacctatgg 
gagtcaatat 
ttcaacggag 
ttttcaagca 
agtcaagcta 
agcatactta 
gtccccgatg 
ttgatcataa 
acacttgaaa 
aagtgcgtcg 
ataaggtcta 
gcgataatga 
gaagatggat 
actgaaaata 
gagcatcctc 
gtggtagcag 
gataaagata 
cctctaatta 
agttggaaaa 
gttgattatt 
gacatactag 
gtgaccatgt 
aataagccat 
gtcctaaata 
aaggaaatga 
attaatgaaa 
ttttgtgatc 
gtggagcatt 
gtccatcttt 
aacatattag 
ttcgaaattg 



aaatgatcac 
cacgaggaca 
tcacatctaa 
attttacacg 
tcacgggaat 
gaggttcagc 
agaactacat 
gagataagca 
gtgctatagc 
ggcactgcgt 
acgaccccac 
tgagagacgg 
catcatcctt 
tggatttcct 
catctggaag 
caacacacca 
atagagaggt 
tatccccttc 
cagcagcagc 
acgaagacaa 
cacccaagaa 
ataatctagt 
ttgataagat 
cggataacta 
ctaacgtggc 
tgtttgccct 
tcataggaaa 
gtttcctctt 
aatgcaaatg 
taacagcacc 
ctgcgttgac 
actctgagcc 
aaattggaaa 
tggacgttcc 



cgacccgggg 
atcagatgtg 
acgcataggt 
ggcaatggaa 
atttttagac 
tttgcgctta 
catcaaacaa 
agtggataaa 
tgctcaagta 
taatgttgtc 
tggttcggga 
tgtaatagat 
gattaaaagg 
ccctccagct 
tagtactact 
ccaaatcaag 
gtcttttacg 
tattcttcag 
agcagtagta 
gtacagtgat 
atggagagcg 
gaaactccac 
ggctgccttt 
tgttcctcct 
tacactcttt 
agaacagcta 
ttcccctgat 
gaaggcgaag 
gacagataag 
catttcaaag 
ttatacacct 
agaaaaaata 
aaatattacc 
cataattgtc 
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gcctctatgt 
gccacgacca 
gaggtcagaa 
aagcgcaaca 
ggtgggcgtt 
aagcgcgcac 
taa 



catcaacaaa 
agtttgttca 
actttaaact 
caactctttg 
tttcatgtcc 
tggaagaaat 



aacaatgact 
ggatccgcca 
caaatctgtt 
ttcacaaata 
atcagagtca 
tgccccagag 



gtaaacaact 
aaaacaggga 
cttgttcctc 
actgaacaaa 
cttattctca 
actgagacta 



acataatttc 
aacaacttct 
ctccttattt 
attgcccgtc 
agtactctaa 
gcattttgtc 



aacaccttct 
ggcagttgaa 
tagggacaat 
atcttctgaa 
tctctctaaa 
actagccatg 
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<210> 109 
<211> 794 
<212> PRT 
<213> SHRIMP 
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atgggagtac 
gttgttacac 
tcttcctatt 
agtggaacca 
aattccatat 
cctttgggct 
catgaaacaa 
attctaccta 
cagaagtgtg 
caaggggaat 
gcctaccttt 
gaagataata 
tattttgtca 
tcggacatcc 
gacctcctaa 
aatcccggtg 
gcgctacctt 
acaacaacaa 
gattttgatt 
ttcattagat 
tccatcgcca 
tgggttgggg 
tttacttctg 
gaaactctta 
aagaaaattg 
ggtattgttt 
ataaaaatgc 



agaagaatat 
ttctaggaac 
cctttactcc 
agaaggacac 
tcagacatga 
tgaaccttat 
cagaacctga 
aaaaatggtg 
ggtgcgggca 
ggcttgctgc 
tcataagcct 
tgatgatggg 
acgatttttc 
tgaacaagag 
ctaaacatca 
ttgaggagga 
acattaagaa 
ccactactac 
acgagtcttt 
ttcttgatca 
actttaaaat 
atgaacatac 
cacctattat 
gaagcaaaat 
gggaagattg 
gtataacagt 
ctaaattcta 



cctggtcggt 
agtaacagaa 
tgaatcgagt 
cttaatcaac 
ctgttctgaa 
tttctgttca 
cgatttattg 
gtctgatgtt 
ggcatttgtt 
tcacactaac 
tcaacgaact 
aagaatgtca 
agacgatata 
ggaagaatgg 
actgggggag 
agaagaagag 
aagtggaaaa 
tactactact 
taatgaacca 
aaaaaattgg 
tgaacaagag 
ccctaaattc 
tgtccagaat 
tgttgatcgg 
ggctaataaa 
aacagttatt 
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ggtggtggtg 
ggagcaccgg 
gttttctggg 
gttctgggga 
actcgttcta 
tgtgcgagtc 
tgggacggat 
gtgtggacat 
tcttccttta 
ggaaagacat 
acactcaagc 
ggcacaccca 
ggaagtactc 
atagctgttt 
cgggaatatg 
agagtggaag 
cttatcggac 
actaatccta 
gaaatctttg 
agacttggta 
tcatcaaaag 
agacttagtg 
gtaggttatt 
tcaagggacc 
aaacaaactg 
tctatatttt 



gtgtatcatt 
cagtcccacc 
tcgaaggaaa 
agaagattcc 
ttcaatggcc 
atgaacatca 
caagaaaaac 
ctttatggag 
cttctacgca 
ctgagggtga 
ctatcatcac 
tgaaccctaa 
ctcagtgtct 
ggggtgttgc 
ggagcgaagg 
aagaagaaga 
ctcgtagaag 
ttgttagaga 
gcagtaactc 
tcatgagcag 
ccttattttg 
tatggaagaa 
cctctgatgt 
tgatagaaac 
tagttgctat 
caattgtaat 



gttattgggt 
tttttcatcg 
tcgtgtttta 
ttattatgct 
agaaacttcc 
acaccgtact 
taccaccata 
ggataacgac 
gaaagaagtc 
tactaattca 
tgacgtcaca 
ggatatgaca 
tgtatcgaat 
agactctaaa 
aagaagaaga 
agtagaagtt 
acctttgaca 
ggttgtggaa 
aaaacttccc 
agtttcttct 
tttggcagtc 
ctggaagcct 
tttctggcat 
aaaagtgaca 
gtttatttca 
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490 

Val He Ser He 
505 

Lys Phe 



Met 


Thr 


Tvr 


Phe 








240 


Gin 


Cys 


Leu 


Val 






255 




Tip 
X Ic 


Ala 


Val 






270 






Gin 


Leu 


Gly 


Glu 


285 








tj-Ly 


val 


/Jin 


OX Li 


wX U 


Val 


Pro 


Tyr 










Arg 


Pro 


Leu 


Thr 






jJj 




rtO 


Tip 
X X G 


V Ol 












Glu 


Pro 


Glu 


He 


365 








Leu 




Gin 

O XI 1 


Lys 


oer 




rt Xcl 


Asn 








& no 


Cys 


Leu 


rtXa 


VOX 






A 1 ^ 




Ser 


Val 


Trp 


Lys 




430 






Gin 


Asn 


Val 


Gly 


445 








Ser 


Lys 


He 


Val 


Lys 


Lys 


He 


Gly 








480 


Met 


Phe 


He 


Ser 






495 




Phe 


Ser 


He 


Val 




510 







<210> 112 
<211> 627 
<212> DNA 
<213> SHRIMP 



<400> 112 

atgtatatct 

atagatacag 

tactacaact 

agagtggtgg 

gagcaaggaa 

tacatcgcag 

gaagctgtca 

tggatggctg 

agagtagtct 

tataactttg 

aggcatagga 



tcgtcgaagg 
gatcatgtgg 
ggcctgccga 
atggaatgtt 
aagagagttt 
aatggttgag 
agagtggatt 
aaattattag 
tcaatggcaa 
gaacaaacta 
ttgtacattt 



ttcccccctc 
aatgtctttc 
aatcgggaca 
tgaacctgtc 
gaaggaatat 
acaaaggcca 
cttgaacgag 
agagaagagg 
tgtttgtagt 
taacaatgtt 
cttataa 



acagggaaga 
ctcaattttc 
gaacatctcc 
ctaaagacct 
ctggactaca 
ctagccttcc 
gaggatctag 
ggcaatattc 
gcatgtttct 
gtacattgtg 



gttcatggat 
ttcgtatgaa 
agttaggttt 
ttgtcgactc 
acggccaagt 
acgtgtttac 
atatggatac 
aagaaataaa 
ctaacactaa 
atttgttgtg 



gtccaagttg 
cacttctgac 
cagagaaacc 
gtggaagaaa 
catggagatc 
ctatacagat 
tgcaaccaag 
agtgacccct 
gagaaacttg 
cccttttgca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

627 



<210> 
<211> 



113 
208 
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<212> PRT 
<213> SHRIMP 



<400> 1X3 





Tyr 


lie 


rile 


Vol 


UlU 


uiy 


Ser 


Mr X W 




Thr 


Gly 


Lys 


Ser 


Ser 


X X7XJ 


1 








5 










10 










15 




Mot* 
net 


Ser 


Lys 


Leu 


x xe 


Asp 


XIII 


n ^ \j 
vjt j. y 


Q»l- 


Cys 


Gly Met 


Cay 
OCX 


Php 

rue 


Leu 


noli 




























30 






PV.£J 


L6U 


35 


net 


Asn 


1 XIX 




ASp 
40 


Tyr 


Tyr 


Asn 


Trp 


tr x. \j 

45 


nld 


OIU 


T 1** 
x ie 




inr 

50 






Leu 


uin 


Leu 
55 


«iy 


fne 


Arg 


Glu 


Thr 

DU 


Arg 


V a.1 


val 


Asp 


/*i 
uiy 




rile 




fro 


Vol 


Leu 


Lys 


X 11 X 




Val 


Asp 




Trp 


Lys 


Lys 


65 










70 










75 










80 


Vj±U 


Gin 


tiJ.y 


Lys 


f*i ii 
85 


oer 


Leu 


Lys 


IjX U 


Tyr 
90 


Leu 


Asp 


Tyr 


Asn 


biy 
95 


bin 


Val 


Met 


Glu 


He 
100 


Tyr 


He 


Ala 


Glu 


Trp 
105 


Leu 


Arg 


Gin 


Arg 


Pro 
110 


Leu 


Ala 


Phe 


His 


Val 
115 


Phe 


Thr 


Tyr 


Thr 


Asp 
120 


Glu 


Ala 


Val 


Lys 


Ser 
125 


Gly 


Phe 


Leu 


Asn 


Glu 
130 


Glu 


Asp 


Leu 


Asp 


Met 
135 


Asp 


Thr 


Ala 


Thr 


Lys 
140 


Trp 


Met 


Ala 


Glu 


lie 


lie 


Arg 


Glu 


Lys 


Arg 


Gly 


Asn 


He 


Gin 


Glu 


He 


Lys 


Val 


Thr 


Pro 


145 










150 










155 










160 


Arg 


Val 


Val 


Phe 


Asn 
165 


Gly 


Asn 


Val 


Cys 


Ser 
170 


Ala 


Cys 


Phe 


Ser 


Asn 
175 


Thr 


Lys 


Arg 


Asn 


Leu 
180 


Tyr 


Asn 


Phe 


Gly 


Thr 
185 


Asn 


Tyr 


Asn 


Asn 


Val 
190 


Val 


His 


Cys 


Asp 


Leu 
195 


Leu 


Cys 


Pro 


Phe 


Ala 
200 


Arg 


His 


Arg 


He 


Val 
205 


His 


Phe 


Leu 



<210> 114 
<211> 375 
<212> DNA 
<213> SHRIMP 

<400> 114 

atgtggagat cttgtattag caacattaga gaaatggggg acaataaaga ttatgaaaca 60 

agactgattc aaagaattaa tgatttggaa tcagaaattg aaaataaaac tgaattatgt 120 

gaaaaaatca atgagcagat gaaaaataca caactaaaat atgataaatg tttegtagag 180 

gaggagaegg aaaaattccg caagatggag gaaagagtta aatacctcaa agagcaggga 240 

atccctctag acccagaaga aagaegtaca atgttggctg aaattgacaa gagtaacaaa 300 

gagttagatg cccttcttga ggaaaatgaa cgtataataa agctcattga tgaagagttg 360 

gaaagtatga aataa 375 

<210> 115 
<211> 124 
<212> PRT 
<213> SHRIMP 

<400> 115 
Met Trp Arg Ser 
1 

Asp Tyr Glu Thr 
20 

He Glu Asn Lys 
35 

Asn Thr Gin Leu 
50 

Lys Phe Arg Lys 



Cys He 
5 

Arg Leu 
Thr Glu 
Lys Tyr 
Met Glu 



Ser Asn 

He Gin 

Leu Cys 
40 

Asp Lys 
55 

Glu Arg 



He Arg 

10 
Arg He 
25 

Glu Lys 
Cys Phe 
Val Lys 



Glu Met 

Asn Asp 

He Asn 

Val Glu 
60 

Tyr Leu 



Gly Asp 

Leu Glu 

30 
Glu Gin 
45 

Glu Glu 
Lys Glu 



Asn Lys 
15 

Ser Glu 
Met Lys 
Thr Glu 
Gin Gly 
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65 

lie Pro Leu Asp Pro 
85 

Lys Ser Asn Lys Glu 
100 

lie Lys Leu lie Asp 
115 



70 

Glu Glu Arg Arg Thr 
90 

Leu Asp Ala Leu Leu 
105 

Glu Glu Leu Glu Ser 
120 



75 80 
Met Leu Ala Glu lie Asp 
95 

Glu Glu Asn Glu Arg lie 
110 

Met Lys 



<210> 116 
<211> 252 
<212> DNA 
<213> SHRIMP 



<400> 116 

atgcaaaaaa aatatgataa attatttgaa 
cgaatccttc aacaaaaaga gaagggaaac 
tcggctgata ttgataggag tatgaaagag 
atagaattat ccattgatac attattggat 
acaactaaat aa 



gatgataaaa ggttccgaga aatagaggaa 60 
cctctagacc cagaagaaag acttgtattg 120 
attgatgatt gtctcgagga aataaaccat 180 
gaatgtgaaa acttgcatta tggtcttcaa 240 

252 



<210> 117 
<211> 83 
<212> PRT 
<213> SHRIMP 



<400> 117 












Met 


Gin 


Lys 


Lys 


Tyr 


Asp 


Lys 


Leu 


1 








5 








Glu 


He 


Glu 


Glu 
20 


Arg 


He 


Leu 


Gin 


Asp 


Pro 


Glu 


Glu Arg 


Leu 


val 


Leu 






35 










40 


Lys 


Glu 
50 


He 


Asp 


Asp 


Cys 


Leu 
55 


Glu 


lie 


Asp 


Thr 


Leu 


Leu 


Asp 


Glu Cys 


65 










70 






Thr 


Thr 


Lys 













Phe Glu Asp Asp Lys Arg Phe Arg 

10 15 
Gin Lys Glu Lys Gly Asn Pro Leu 
25 30 
Ser Ala Asp He Asp Arg Ser Met 
45 

Glu lie Asn His He Glu Leu Ser 
60 

Glu Asn Leu His Tyr Gly Leu Gin 
75 80 



<210> 118 
<211> 2253 
<212> DNA 
<213> SHRIMP 



<400> 118 

atggaaaaaa 

aaaaggtcca 

tacctttctt 

aggtttgagg 

aggttagtag 

tggccagata 

aagaaattga 

gtccaagaaa 

atggatttca 

ctacgcaccc 

gggttcgata 

agtgattggg 

actcttcata 

tcacctgaag 



agactgagac 
gaaataaaga 
ccataatcaa 
agaggtgtaa 
tggacgagtg 
tgacgaagga 
aagaaggatc 
cattcctatc 
ataccatcac 
atttccggac 
aaaagtatac 
tggacagtgg 
ggaaacattt 
agaaatatcc 



ggctgcaaca 
acccaaaaca 
gagtgaaagt 
atccgtaagc 
tctgagacgt 
taacttttac 
tacacatcct 
ccagcaagga 
ctacgccgcc 
acgacaacag 
gaaactcgta 
tgatttggaa 
atgtgttatt 
gataccctca 



acagaaaaag 
acttctactg 
agtagaagta 
aagatgatgg 
tacaaccatc 
gtccaattgt 
gttgtagaag 
gaaggaaata 
aaacaactaa 
agggccatat 
caacactgga 
agggtaaaag 
agtgataaga 
atactaaatt 



acccagaacc 
tttacacttc 
atgtcacctc 
tcaaaggttc 
tagaagacga 
tgaggaaggg 
atgtttggaa 
atcccataaa 
aaacttgctt 
ctggatggtt 
taattggatg 
aaggaacgaa 
agaatggtac 
attacaagtt 



gtctgtcagt 
tgtaaagtgt 
aaccaaagaa 
actgtttttg 
aatcgataaa 
tttagacaag 
ttcccccatc 
gagacatctc 
cgaaacaaac 
agctgaaaac 
tacctacaag 
aaatttcgtg 
aatttcctat 
tctacaaaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 
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gagtatccac 
atacactatt 
tggaaggata 
tacctattat 
atccagacgg 
actgtttcca 
tacccgccca 
tcctgttcgg 
aaacaatact 
aagaacaata 
gaaattataa 
ttgaaggtag 
aggaaaaatg 
gaattaatcc 
tttgcctgca 
ataaaggaaa 
aaagtatgcc 
aataagaaaa 
gacgaagaaa 
agatgctcgt 
gatatctcta 
tccattacat 
attgttcata 
ccgttgaagt 



aaaacaagaa 
gtacgtttga 
tggaagaacg 
tcgacgttaa 
acggcgaagg 
agaaaagtaa 
ctaatgccaa 
tatttgatac 
atcaagaatc 
aagagtacaa 
atgatggtaa 
tgaacgagca 
ccatgaaagt 
ctaaaaggga 
cgggaagagg 
gagtcggagg 
atcgctgcaa 
gaaaaccgcc 
atggaaaatg 
ccgaaaagac 
tggaaacgcc 
gtgtcttggg 
aatttctagg 
atcactttat 



aatacagaaa 
ccaaacgacc 
acacaaacaa 
aaagattaag 
cgtctctgta 
gaaaaataaa 
gtacgtagtg 
ccgtcaaaaa 
ttggatgaca 
agaggcgtta 
cggtgatact 
ctacaggtta 
atactctaga 
taaaattgaa 
tgagcaatac 
tgataagagg 
tcaaccttta 
gacaatagta 
gaagaaggct 
gcaattcggt 
agtaccttct 
aggaaaattc 
gttttgggat 
tcgagttgcc 



atgatagttg 
attcaaggaa 
tcagaagata 
aaattgcgtc 
ctattttcca 
aaacctagag 
ggtgtagatc 
agggtagtga 
gatagaagaa 
gaggaaataa 
tctacaccca 
ctgtggaacg 
aaacaaaagt 
gattaccaca 
gcatcacctg 
tttactttcg 
aatatgctgg 
acaaccacaa 
acacctctca 
tacagttcaa 
tccacttcct 
gtcgacaggg 
aaaaagttaa 
tga 



tcccaaaaca 
tttgtaaaga 
tactttacaa 
caaactggaa 
gagaagtgga 
gagatgagga 
cgggaagaac 
gaaaacacag 
aggcaaacga 
ctaggtacga 
ctaaaaaatt 
aaaagggaaa 
gcatatctaa 
ttgcttttgg 
ccaggatttt 
tggacgagaa 
agaaggattg 
caacaacaac 
gagaaaatag 
accgaaaagt 
cttccttttg 
acttcaatgc 
tggaaaagaa 



caaactaaag 
tttgggagtg 
gcaaggatgg 
ctttcactct 
agaagtagag 
taggagaaat 
taatgtcgtt 
aatgactgcg 
aacgtacaag 
taatggcgaa 
cgaagcttac 
gaaaaagtac 
ctttatagat 
ggatgcgaaa 
cgccaagaag 
atatacgtca 
tttttcaccg 
aacagaagaa 
agataccaga 
atcgacagga 
tactcctact 
aagcaccaat 
agacaagatg 



900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2253 



<210> 119 
<211> 746 
<212> PRT 
<213> SHRIMP 



<400> 119 



Met 


Glu 


Lys 


Lys 


Thr 


Glu 


Thr 


Ala 


Ala 


Thr 


Thr 


Glu 


Lys 


Asp 


Pro 


Glu 


1 








5 










10 










15 




Pro 


Ser 


Val 


Ser 


Lys 


Arg 


Ser 


Arg 


Asn 


Lys 


Glu 


Pro 


Lys 


Thr 


Thr 


Ser 








20 










25 










30 






Thr 


Val 


Tyr 


Thr 


Ser 


Val 


Lys 


Cys 


Tyr 


Leu 


Ser 


Ser 


lie 


He 


Lys 


Ser 






35 










40 










45 








Glu 


Ser 


Ser 


Arg 


Ser 


Asn 


Val 


Thr 


Ser 


Thr 


Lys 


Glu 


Arg 


Phe 


Glu 


Glu 




50 










55 










60 










Arg 


Cys 


Lys 


Ser 


Val 


Ser 


Lys 


Met 


Met 


Val 


Lys 


Gly 


Ser 


Leu 


Phe 


Leu 


65 










70 










75 










80 


Arg 


Leu 


Val 


Val 


Asp 


Glu 


Cys 


Leu 


Arg 


Arg 


Tyr 


Asn 


His 


Leu 


Glu 


Asp 










85 










90 










95 




Glu 


lie 


Asp 


Lys 


Trp 


Pro 


Asp 


Met 


Thr 


Lys 


Asp 


Asn 


Phe 


Tyr 


Val 


Gin 








100 










105 










110 






Leu 


Leu 


Arg 


Lys 


Gly 


Leu 


Asp 


Lys 


Lys 


Lys 


Leu 


Lys 


Glu 


Gly 


Ser 


Thr 






115 










120 










125 








His 


Pro 


Val 


Val 


Glu 


Asp 


Val 


Trp 


Asn 


Ser 


Pro 


He 


Val 


Gin 


Glu 


Thr 




130 










135 










140 










Phe 


Leu 


Ser 


Gin 


Gin 


Gly 


Glu 


Gly 


Asn 


Asn 


Pro 


He 


Lys 


Arg 


His 


Leu 


145 










150 










155 










160 


Met 


Asp 


Phe 


Asn 


Thr 


lie 


Thr 


Tyr 


Ala 


Ala 


Lys 


Gin 


Leu 


Lys 


Thr 


Cys 








165 










170 










175 




Phe 


Glu 


Thr 


Asn 


Leu 


Arg 


Thr 


His 


Phe 


Arg 


Thr 


Arg 


Gin 


Gin 


Arg 


Ala 








180 










185 










190 






lie 


Ser 


Gly 


Trp 


Leu 


Ala 


Glu 


Asn 


Gly 


Phe 


Asp 


Lys 


Lys 


Tyr 


Thr 


Lys 






195 










200 










205 








Leu 


Val 


Gin 


His 


Trp 


lie 


He 


Gly 


Cys 


Thr 


Tyr 


Lys 


Ser 


Asp 


Trp 


Val 




210 










215 










220 










Asp 


Ser 


Gly 


Asp 


Leu 


Glu 


Arg 


Val 


Lys 


Glu 


Gly 


Thr 


Lys 


Asn 


Phe 


Val 


225 










230 










235 










240 
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Thr 


Leu 


His 


Arg 


Lys 


His 


Leu 


Cys 










1 A *n 








Thr 


lie 


Ser 


Tyr 


Ser 


Pro 


Glu 


Glu 








2 oU 










Asn 


Tyr 


Tyr 


Lys 


Phe 


Leu 


Gin 


Thr 






275 










o o t\ 


Gin 


Lys 


Met 


lie 


Val 


Val 


Pro 


Lys 




O fi A 










one 




Thr 


Phe 


Asp 


Gin 


Thr 


Thr 


lie 


Gin 


305 










310 






Trp 


Lys 


Asp 


Met 


Glu 


Glu 


Arg 


His 










325 








Lys 


Gin 


Gly 


Trp 


Tyr 


Leu 


Leu 


Phe 








34 0 










Arg 


Pro 


Asn 


Trp 


Asn 


Phe 


His 


Ser 






355 










■a c rt 


Ser 


Val 


Leu 


Phe 


Ser 


Arg 


Glu 


Val 




370 










375 




Lys 


Ser 


Lys 


Lys 


Asn 


Lys 


Lys 


Pro 


385 










"i r\ r\ 






Tyr 


Pro 


Pro 


Thr 


Asn 


Ala 


Lys 


Tyr 










405 








Thr 


Asn 


Val 


Val 


Ser 


Cys 


Ser 


Val 








420 










Val 


Arg 


Lys 


His 


Arg 


Met 


Thr 


Ala 






435 










440 


Met 


Thr 


Asp 


Arg 


Arg 


Lys 


Ala 


Asn 




450 










45b 




Glu 


Tyr 


Lys 


Glu 


Ala 


Leu 


Glu 


Glu 


465 










470 






Glu 


lie 


lie 


Asn 


Asp 


Gly 


Asn 


Gly 










485 








Phe 


Glu 


Ala 


Tyr 


Leu 


Lys 


Val 


Val 








500 










Asn 


Glu 


Lys 


Gly 


Lys 


Lys 


Lys 


Tyr 






515 










520 


Ser 


Arg 


Lys 


Gin 


Lys 


Cys 


lie 


Ser 




530 










535 




Lys 


Arg 


Asp 


Lys 


lie 


Glu 


Asp 


Tyr 


545 










550 






Phe 


Ala 


Cys 


Thr 


Gly 


Arg 


Gly 


Glu 










565 








Lys 


Lys 


lie 


Lys 


Glu 


Arg 


Val 


Gly 








580 










Asp 


Glu 


Lys 


Tyr 


Thr 


Ser 


Lys 


Val 






595 










600 


Asn 


Met 


Leu 


Glu 


Lys 


Asp 


Cys 


Phe 




610 










615 




Pro 


Thr 


lie 


Val 


Thr 


Thr 


Thr 


Thr 


c o c 

625 










/Tin 






Glu 


Asn 


Gly 


Lys 


Trp 


Lys 


Lys 


Ala 










645 








Thr 


Arg 


Arg 


Cys 


Ser 


Ser 


Glu 


Lys 








660 










Arg 


Lys 


Val 


Ser 


Thr 


Gly 


Asp 


He 






675 










680 


Ser 


Ser 


Ser 


Phe 


Cys 


Thr 


Pro 


Thr 




690 










695 




Lys 


Phe 


Val 


Asp 


Arg 


Asp 


Phe 


Asn 


705 










710 






Phe 


Leu 


Gly 


Phe 


Trp 


Asp 


Lys 


Lys 



vai 


xie 


oer 


Asp 


Lys 


T \7 c 

Lys 


Asn 


oi y 














c, U ~J 




Lys 


Tyr 


Pro 


lie 


Fro 


Ser 


i le 


Leu 


o c c 
ZOO 










on rt 
d * u 






Glu 


Tyr 


Pro 


Gin 


Asn 


Lys 


Lys 


ne 










Zoo 








His 


Lys 


Leu 


Lys 


lie 


His 


Tyr 


Cys 


















Gly 


He 


Cys 


Lys 


Asp 


Leu 


Gly 


vai 






315 












Lys 


Gin 


Ser 


Glu 


Asp 


lie 


Leu 


Tyr 




330 










Tic 
jjj 




Asp 


Val 


Lys 


Lys 


lie 


Lys 


Lys 


Leu 


*3 A C 
















He 


Gin 


Thr 


Asp 


Gly 


Glu 


Gly 


Val 










ICC 








Glu 


Glu 


Val 


Glu 


Thr 


Val 


Ser 


Lys 








380 










Arg 


Gly Asp 


Glu 


Asp 


Arg 


Arg 


Asn 






395 












Val 


Val 


Gly Val 


Asp 


Pro 


Gly 


Arg 




410 










4 id 




Phe 


Asp 


Thr 


Arg 


Gin 


Lys 


Arg 


Val 


425 










4oU 






Lys 


Gin 


Tyr 


Tyr 


Gin 


Glu 


Ser 


Trp 










44 5 








Glu 


Thr 


Tyr 


Lys 


Lys 


Asn 


Asn 


Lys 








4 60 










lie 


Thr 


Arg 


Tyr 


Asp 


Asn 


Gly 


Glu 






475 










4o0 


Asp 


Thr 


Ser 


Thr 


Pro 


Thr 


Lys 


Lys 




490 










495 




Asn 


Glu 


His 


Tyr 


Arg 


Leu 


Leu 


Trp 


505 










510 






Arg 


Lys 


Asn 


Ala 


Met 


Lys 


Val 


Tyr 










525 








Asn 


Phe 


He 


Asp 


Glu 


Leu 


He 


Pro 








540 










His 


He 


Ala 


Phe 


Gly 


Asp 


Ala 


Lys 






555 










C C A 


Gin 


Tyr 


Asp 


Ala 


Arg 


He 


Phe 


Ala 




570 










C "7 C 




Gly 


Asp 


Lys 


Arg 


Phe 


Thr 


Phe 


Val 


585 










590 






Cys 


His 


Arg 


Cys 


Asn 


Gin 


Pro 


Leu 










605 








Ser 


Pro 


Asn 


Lys 


Lys 


Arg 


Lys 


Pro 








620 










Thr 


Thr 


Thr 


Thr 


Glu 


Glu 


Asp 


Glu 






635 










On U 


Thr 


Pro 


Leu 


Arg 


Glu 


Asn 


Arg 


Asp 




650 










655 




Thr 


Gin 


Phe 


Gly 


Tyr 


Ser 


Ser 


Asn 


665 










670 






Ser 


Met 


Pro 


Val 


Pro 


Ser 


Ser 


Thr 










685 








Ser 


He 


Thr 


Cys 


Val 


Leu 


Gly 


Gly 








700 










Ala 


Ser 


Thr 


Asn 


He 


Val 


His 


Lys 






715 










720 


Leu 


Met 


Glu 


Lys 


Lys 


Asp 


Lys 


Met 
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725 730 
Pro Leu Lys Tyr His Phe lie Arg Val Ala 
740 745 



735 



<210> 120 
<211> 411 
<212> DNA 
<213> SHRIMP 



<400> 120 

atggggaata 

aaaaggtctc 

aagcagatcg 

ctcgacaaaa 

gacccagaag 

ggtttcatcg 

gaagaaaagg 



gtgaatctcg 
ataaaactct 
aagagttaca 
acgaagagcg 
aaagacgtgc 
aggaaagtga 
aggttgaaga 



ttctagtggt 
ttacttgtca 
tcacaaaaca 
aaagaagcat 
aatattggct 
acgtctaggt 
ggaacatctt 



attgaaattg 
aatagaactg 
aataagcaat 
cagcaacagc 
gaaatcgata 
ttacttgtag 
ttgaaacaaa 



tacataaaaa 
aaagacatgc 
ttgaacaggc 
aacaaataat 
aacacatgaa 
atgcagaaat 
aagaagacta 



tggtgcacca 60 
ccagatacaa 120 
tcaaaaggtc 180 
aattcctcta 240 
agagattgat 300 
caataacttg 360 
a 411 



<210> 121 
<211> 134 
<212> PRT 
<213> SHRIMP 

<400> 121 



Met 


Gly 


Asn 


Ser 


Glu 


Ser 


Arg 


Ser 


Ser 


Gly 


He 


Glu 


He 


Val 


His 


Lys 


1 








5 










10 










15 




Asn 


Gly 


Ala 


Pro 
20 


Lys 


Arg 


Ser 


His 


Lys 
25 


Thr 


Leu 


Tyr 


Leu 


Ser 
30 


Asn 


Arg 


Thr 


Glu 


Arg 
35 


His 


Ala 


Gin 


He 


Gin 
40 


Lys 


Gin 


He 


Glu 


Glu 
45 


Leu 


His 


His 


Lys 


Thr 


Asn 


Lys 


Gin 


Phe 


Glu 


Gin 


Ala 


Gin 


Lys 


Val 


Leu 


Asp 


Lys 


Asn 




50 










55 










60 






Glu 


Glu 


Arg 


Lys 


Lys 


His 


Gin 


Gin 


Gin 


Gin 


Gin 


He 


He 


He 


Pro 


Leu 


65 










70 










75 










80 


Asp 


Pro 


Glu 


Glu 


Arg 

85 


Arg 


Ala 


He 


Leu 


Ala 
90 


Glu 


He 


Asp 


Lys 


His 
95 


Met 


Lys 


Glu 


lie 


Asp 
100 


Gly 


Phe 


He 


Glu 


Glu 
105 


Ser 


Leu 


Gly 


Leu 


Leu 
110 


Val 


Asp 


Ala 


Glu 


He 
115 


Asn 


Asn 


Leu 


Glu 


Glu 
120 


Lys 


Glu 


Val 


Glu 


Glu 
125 


Glu 


His 


Leu 


Leu 


Lys 
130 


Gin 


Lys 


Glu 


Asp 























<210> 122 
<211> 2772 
<212> DNA 
<213> SHRIMP 



<400> 122 

atgtcctcct 

aaggctctgg 

agaaaccaga 

agaaatggag 

agaatagtgg 

tattttgtca 

cagggcgttc 

gctcccctga 

cttcccccca 

aatgaattca 



cctcgtcgtc 
ctcacccaga 
agtgccccat 
gaaatctgtt 
gaagtcttca 
aggatctgta 
tggacaagaa 
tcccaataac 
gatctaaagc 
gtaacaactg 



gtcgttctcg 
tctggtggac 
ccagt ttctg 
ccctttgcac 
cgggagaaca 
taatggagta 
aatatctgga 
tgatgtggat 
gtaccgagtc 
gcctcctaca 



ttccgcatct 
aagattaccc 
gccgacatct 
ccgttcaaga 
ttggacaatg 
ttttcctatg 
tctggatctg 
ttgctgtaca 
atcactgaag 
aatatcaaag 



ccacctacca 
agaaatgtga 
cgcacctgat 
accaaccaca 
acattgaaga 
tgaacggcgt 
gagaatcctc 
ttttcggtac 
ctgttctagc 
gagcatacgt 



gacctttctc 
cgagacagga 
ccaaggagaa 
tctggaacca 
atcatactgt 
caaggagtta 
ctcttctaga 
tttggtagtt 
actccccttc 
gtctagagat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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ttcaggatgt ttaatctgtt 
agtgaatggg aatccataca 
aaagctgaga agaaacctcc 
gttaatgatg cagtgacaaa 
ctcgccggta ttctccagaa 
tcttcaaatt ctatagacat 
gcaagaaaat tgggattcta 
caagatgttg ccgactgcat 
gaaaaaatac gacagtgcat 
tgtgaaattg acaagacgcc 
ttccccgtcc acgcattctc 
tccgaatgta tattcccgta 
gtaaaagttg tagatttgtt 
ggaagatgtt acgagttttc 
cattgttctt ctctctttga 
gagtttctcc agtatgattt 
ccccatgttt acaataaggt 
tactgttaca aatgcgtatc 
gaaatggtgg atgtaactcc 
tatgatgatg gtcccggaaa 
ggttccagtg gagaaacacc 
agagcaccga ggaagaaaat 
atagcgctag ccaactggag 
gaaggggtaa atgccgtact 
ccctgtgtta ttggtagatc 
aaaattgctt taataaaggc 
aagttacttg aatgggcaga 
acgccagaat gtgtcataca 
gacgaattgg ttcacatgcg 
aatgcggcca gacagaatgc 
gaatttgtcc cctgcaaaga 
gttagagttg taaattgtcc 
aatgtgtttt gtgaatgtgg 
gctgattcta atcattgtgt 
aaatatgcct tagaaagtat 
ggagttaaaa attattatgt 
gctaagaaat aa 



ggccggttta gatcatatag 
cgcatctgtc gtcaagcgaa 
atcaacatct agaattttta 
gatccctata cgtgtactca 
agtgtactct tattcaatgc 
caagcaagga gtaaaaggaa 
tttactccaa aaggatacaa 
agtttcaatc aacgctggta 
tgaggagaaa aacaagccag 
cgatttttct tatagtgaac 
ctcatcacat gatgacaagt 
catcatttcc ctgtatgaga 
ccagtgccct ggttgtaaaa 
aaatttgtgt aagagaatga 
tgctaccata aatcgcgcag 
tgaaactgcg aggagaattg 
agtaaagaat gtaaaagatt 
tcctgtcgta tgtgatgagc 
ccctctaatt aatcttaccg 
tcatatgtgg ccagcaaaat 
caccattagt acgtgcagag 
ggcaggatgg gatgatcaat 
aaagagtggg gaattgccca 
ttatagaggc gacagtttct 
catgagccct agtctggaac 
cttcttccac gagaaaaggg 
actattagtc aaaagttatc 
ccgcgcccat tcatttgtag 
tccagatgat gccacaagaa 
ggctgctgca gcctcatttt 
aaggacgatt gaatggatgt 
ttcatgtaaa aaggctatcc 
aacaaacatg tgctggatat 
ggagaaacac aggattgttt 
gtacgggttt gagatttgta 
aatggagaat ggatttttct 



aaggagaagt tgggggagaa 660 
tggtcaccat tatgcgcaac 720 
gagtgtatgt ggctgaacca 780 
gtaaattatt cggttcaaga 840 
taaatcttcc atatctcctg 900 
ttacattatc tataccatct 960 
cattacaatc ttccctatca 1020 
ttattggtga tgatttctct 1080 
aaaactgttg tatgtgcttt 1140 
atgtggcaag gcacaatttc 1200 
gttgtggagc aaagatttgt 1260 
aaatgactgg tgtggcaggt 1320 
gtggcatgct caatctaaag 1380 
tactaccata cacatcgact 1440 
aggcttgttt ttactcccta 1500 
cgcatggagc taaagacatt 1560 
tggatagact gtgtgccttg 1620 
cgaatgaaag tacggactat 1680 
agattgttga ttcggaagag 1740 
ttacctgtaa ctttattgcg 1800 
atgctgtaac ttttctagga 1860 
cggcagtggg gcaagccatt 1920 
aaaatatgtt tgatttacta 1980 
tgttacgtgc gataaactac 2040 
tcgttaaaag aaaggtgaat 2100 
tgcgtccaga cgcatctaaa 2160 
tcatggaagt tttacttcag 2220 
gcaaaactct cctcattact 2280 
acgcctatat ccagaaccta 2340 
ctgggtcgct ccccaaacct 2400 
atgaaaagga caatgatgat 24 60 
agaaatatgg aggttgtgtg 2520 
gtgaagagaa ggtttctcct 2580 
atagtaactg tgttagggtt 2640 
ccatgaaaaa tgtagaagaa 27 00 
ttgatgtaca agaaatggtt 27 60 

2772 



<210> 123 
<211> 919 
<212> PRT 
<213> SHRIMP 



<400> 123 



Met 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


1 








5 








Gin 


Thr 


Phe 


Leu 
20 


Lys 


Ala 


His 


Pro 


Lys 


Cys 


Asp 
35 


Glu 


Thr 


Gly 


Arg 


Asn 
40 


Ala 


Asp 
50 


He 


Ser 


His 


Leu 


He 
55 


Gin 


Phe 


Pro 


Leu 


His 


Pro 


Phe 


Lys 


Asn 


65 










70 






Val 


Gly 


Ser 


Leu 


His 
85 


Gly 


Arg 


Thr 


Tyr 


Cys 


Tyr 


Phe 
100 


Val 


Lys 


Asp 


Leu 


Asn 


Gly 


Val 
115 


Lys 


Glu 


Leu 


Gin 


Gly 
120 


Ser 


Gly 


Ser 


Gly 


Glu 


Ser 


Ser 


Ser 



Phe 


Ser 


Phe 


Arg 


He 


Ser 


Thr 


Tyr 




10 










15 




Asp 


Leu 


Val 


Asp 


Lys 


He 


Thr 


Gin 


25 










30 






Gin 


Lys 


Cys 


Pro 


He 


Gin 


Phe 


Leu 










45 








Gly 


Glu 


Arg 


Asn 


Gly 


Gly 


Asn 


Leu 








60 










Gin 


Pro 


His 


Leu 


Glu 


Pro 


Arg 


He 






75 










80 


Leu 


Asp 


Asn 


Asp 


He 


Glu 


Glu 


Ser 




90 










95 




Tyr 


Asn 


Gly 


val 


Phe 


Ser 


Tyr 


Val 


105 










110 






Val 


Leu 


Asp 


Lys 


Lys 


He 


Ser 


Gly 










125 








Ser 


Arg 


Ala 


Pro 


Leu 


He 


Pro 


He 
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1 **** 
x j 




inr 


Asp 


V Cll 


Asp 


Leu 


Leu 


iyr 


Tie* 


1 H O 










1 so 






Pro 


Arg 


Car 


Lys 


Ala 


Tyr 


Arg 


Vol 










1 £ R 
lOO 








Pro 


rfic 


Asn 


pin 

ulU 


trllKZ 




Asn 


Asn 


















711 - 

AJ.3 


Tyr 


vai 


Ser 


Arg 


Asp 


rnc 


Axg 
















onn 


Asp 


His 


1 ie 


Glu 


taiy 


blU 


vai 


Giy 




9i a 










^ 1 ZJ 




HlS 


Ala 


sei 


Val 


Val 


Lys 


Arg 


net 


















G1U 


Lys 


Lys 


Pro 


Pro 


Ser 


inr 


Cat* 

Der 










245 








blU 


Pro 


vai 


Asn 


Asp 


Aid 


vai 


inr 








260 










iiys 


Leu 


rfic 


Gly 


Ser 


Arg 


Leu 


nla 






£. / 0 










*5Q A 


Tyr 


oer 


wet 


Leu 


Asn 


Leu 


Pro 


Tyr 


















1 ie 


Lys 


bin 


Gly Val 


Lys 


Giy 


1 ie 


















Lys 


Leu 


Gly 


Phe 


Tyr 


Leu 


Leu 


Gin 










325 








Leu 


Ser 


Gin 


Asp 


Val 


Ala 


ASp 


Cys 








340 










lie 


Gly 


Asp 


Asp 


Phe 


Ser 


Glu 


Lys 






JJJ 












Asn 


Lys 


Pro 


Glu 


Asn 


Cys 


Cys 


Met 




0 / U 










375 




Pro 


Asp 


Phe 


Ser 


Tyr 


Ser 


Glu 


His 


385 










390 






Val 


His 


Ala 


Phe 


Ser 


Ser 


Ser 


His 










405 








He 


Cys 


Ser 


Glu 


Cys 


He 


Phe 


Pro 








420 










Met 


Thr 


Gly 


Val 


Ala 


Gly Val 


Lys 






A *3 <"» 
4 










4 4 n 

1 H U 


Gly 


Cys 


Lys 


Ser 


Gly Met 


Leu 


Asn 














455 




Ser 


Asn 


Leu 


Cys 


Lys 


Arg 


Met 


lie 


4 fob 










470 






Ser 


ber 


Leu 


Phe 


Asp 


Ala 


Thr 


lie 










485 








ser 


Leu 


Glu 


Phe 


Leu 


Gin 


Tyr 


Asp 








500 










His 


Gly 


Ala 


Lys 


Asp 


He 


Pro 


His 






JlD 












Val 


Lys 


Asp 


Leu 


Asp 


Arg 


Leu 


Cys 




530 










535 




Ser 


Pro 


Val 


Val 


Cys 


Asp 


Glu 


Pro 


54 5 










550 






Val 


Asp 


Val 


Thr 


Pro 


Pro 


Leu 


He 










565 








Glu 


Glu 


Tyr 


Asp 


Asp 


Gly 


Pro 


Gly 








580 










Thr 


Cys 


Asn 


Phe 


He 


Ala 


Gly 


Ser 






595 










600 


Thr 


Cys 


Arg 


Asp 


Ala 


Val 


Thr 


Phe 




610 










615 





Phe 


Gly 


Thr 


Leu 


Val 


Val 


Leu 


Pro 






1 -J J 










160 


Tip 
lie 


Thr 


Gin 
0 1 u 


Ala 


Val 


Leu 


Ala 


Leu 




1 7 A 

1 / IS 










1 / — * 




Trp 


Pro 


Jr J. U 


X 111 


7i O T~> 

nail 


l^C 




Gly 


IOC 

loo 










1 -? u 






wet 


fne 


Asn 


Leu 


Leu 


nla 


Vjl _y 


Leu 


















tjiy 


V?1U 


oer 


ulu 


Trp 


ulu 




T 1 0 
11c 


















vai 


inr 


1 ie 


wet 


Arg 


Asn 


Lys 


Ala 
Ala 






*> *"tC 
iJO 










"5^ A 


Arg 


T 1 A 

1 ie 


irne 


Arg 


Val 


Tyr 


Val 


Ala 


















Lys 


116 


Pro 


1 16 


Arg 


veil 


Leu 


OCl 


j£ DO 










9*7 n 






oi_y 


Tip 
11c 


JjCU 


win 


Lys 


Val 


fur 


Ser 


















Leu 


Leu 


Ser 


oer 


Asn 


Ser 


Tic* 

xie 


Asp 








1 A A 










1 ni 


Leu 


Ser 


Tip 

11c 


pro 


Ser 


Ala 


Arg 






010 










^ 2 A 


Lys 


Asp 


1 1 IX 


1 111 


Leu 


V3 1 11 


OC1 
















*^*^'^ 




lie 


vai 


Ser 


ne 


Asn 


Ml a 


fXI \/ 

oiy 


Tip 

lie 


"3>1 

J4D 










A 






lie 


Arg 


fil n 
Vjlfl 


Cys 


Tie* 
lie; 


Ul LI 


01 u 


T \/ c 

lijO 










ice 








Cys 


rfie 


cys 


Glu 


1 ie 


Asp 


Lys 


inr 








■3 0 










XT-. I 

vai 


ft! a 

Ala 


Arg 


MIS 


Asn 


rne 


xrilc 


Pro 






one 










/l AA 
4 UU 


Asp 


Asp 


Lys 


Cys 


Cys 




Aia 


Lys 




A 1 A 

4 1 U 










**3 IO 




Tyr 


lie 


T 1 Q 

1 ie 


oer 


Leu 


Tyr 


blU 


Lys 


/toe 
42D 










4 jU 






vai 


*U» 1 

vai 


Asp 


Leu 


trne 


bin 


Cys 


Pro 










A A^x 








Leu 


Lys 


/— -I 
biy 


Arg 


Cys 


Tyr 


VslU 


OK a 

rne 








A df\ 

4 bu 










Leu 


Pro 


Tyr 


1 nr 


Ser 


inr 


Hie 

n 1 0 


Cys 






A *7 ^ 
I/O 










4 ft A 


Asn 


Arg 


Aia 


LilU 


Aia 


Cys 


it 1 1 tr 


Tyr 




4 Q A 










J 




-rs v« a 

Pne 


GlU 


1 fir 


Aia 


Arg 


Arg 


x ie 


Aia 


DUO 
















Val 


1 J J - 


Asn 


Lys 


Val 


Val 


Lys 


Asn 










525 








Ala 


Leu 


Tyr 


Cys 


Tyr 


Lys 


Cys 


val 








540 










Asn 


Glu 


Ser 


Thr 


Asp 


Tyr 


Glu 


Met 






555 










560 


Asn 


Leu 


Thr 


Glu 


He 


Val 


Asp 


Ser 




570 










575 




Asn 


His 


Met 


Trp 


Pro 


Ala 


Lys 


Phe 


585 










590 






Ser 


Gly 


Glu 


Thr 


Pro 


Thr 


He 


Ser 










605 








Leu 


Gly 


Arg 


Ala 


Pro 


Arg 


Lys 


Lys 



620 
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Mcl 


r\Xa 


«x y 


i rp 


Asp 


Asp 
















dju 






Asn 


Trp 


Arg 


Lys 


OCX 


tjj.y 


fin 


Leu 










D1 J 










taiy 


va i 


Asn 


Aid 


vax 


Leu 


Tyr 








DDV 










Ala 


lie 


Asn 


Tyr 


Pro 


Cys 


vax 


1 ie 






O ' D 










con 


Glu 


Leu 


val 


Lys 


Arg 


Lys 


vax 


Asn 




^ on 










roc 




pne 


HIS 


CjIU 


Lys 


Arg 


vai 


Arg 


Pro 


70^ 










7*1 n 






Trp 


Ala 


ri„ 
blU 


Leu 


Leu 


vax 


Tire? 

Lys 


Ser 


















Thr 


Pro 


GlU 


Cys 


val 


lie 


nlS 


Arg 








*? 4 n 










Lieu 


lieu 


i ie 


i nr 


Asp 


blU 


Leu 


vai 






» jj 










7 £fi 


Arg 


Asn 


Ala 


Tyr 


lie 


uin 


Asn 


Leu 




/ / u 










77 ^ 




Ala 


Aia 


Aid 


oer 


rne 




(jxy 


oer 


*7 O C 










*7 on 






Cys 


Lys 


GlU 


Arg 


1 nr 


lie 


blU 


Trp 










one 
OUj 








vai 


Arg 


val 


va l 


Asn 


Cys 


Fro 


ber 








820 










Gly 


Gly 


Cys 


Val 


Asn 


Val 


Phe 


Cys 






835 










840 


He 


Cys 


Glu 


Glu 


Lys 


Val 


Ser 


Pro 




850 










855 




Lys 


His 


Arg 


He 


Val 


Tyr 


Ser 


Asn 


865 










870 






Glu 


Ser 


Met 


Tyr 


Gly 


Phe 


Glu 


He 










885 








Gly 


Val 


Lys 


Asn 


Tyr 


Tyr 


Val 


Met 








900 










Gin 


Glu 


Met 


Val 


Ala 


Lys 


Lys 





915 



nla 


val 


uiy 


oi.11 


nla 


Tip 

lie 


T 1 
1. ie 


Ala 
met 






DO O 










640 


Pro 


Lys 


Asn 




.nit: 


Asp 


Leu 


Leu 














0 — » -J 




Arg 


oiy 


Asp 


Ser 




Leu 


Leu 


Arg 












670 






biy 


Arg 


Del 




oer 


Pro 


oer 


Leu 










cot 
000 








T we? 


Tic 

x xe 


ax a 


Leu 


x xe 


Lys 


a 1 a 

nXa 


Pho 
rilfc: 








"7 r\n 
/ uu 










Asp 


Axa 


Ser 


Lys 


Lys 


Leu 


Leu 


















7?0 


Tyr 


Leu 


Met 


blU 


vai 


Leu 


Leu 






7 

/ J Lr 










7 




Aid 


nl S 


oer 


rne 


vai 


biy 


x»ys 


1 nr 


1 A *k 










Ten 
/jU 






nlS 


wet 


Arg 


Pro 


Asp 


Asp 


Axa 


1 nr 










"7 ^ 








Asn 


Axa 


Ala 


Arg 


bin 


Asn 


Axa 


A10 








7ft n 










Leu 


riO 


Lys 


Pro 


ulu 


fne 


vax 


Pro 
















0 uu 


Met. 


Tyr 


ulU 


Lys 


Asp 


Asn 


Asp 


Asp 




OIL) 










OiJ 




Cys 


Lys 


Lys 


Ala 


1 le 


Gin 


Lys 


Tyr 


825 










830 






Glu 


Cys 


Gly 


Thr 


Asn 


Met 


Cys 


Trp 










845 








Ala 


Asp 


Ser 


Asn 


His 


Cys 


Val 


Glu 








860 










Cys 


Val 


Arg 


Val 


Lys 


Tyr 


Ala 


Leu 






875 










880 


Cys 


Thr 


Met 


Lys 


Asn 


Val 


Glu 


Glu 




890 










895 




Glu 


Asn 


Gly 


Phe 


Phe 


Phe 


Asp 


Val 


905 










910 







<210> 124 
<211> 1536 
<212> DNA 
<213> SHRIMP 



<400> 124 

atggctgaaa 

gatteggaca 

gtccacctgg 

ctgttatgtc 

gaagaggtgt 

gatgaaggtg 

gtccccaatg 

aagttgagtt 

acctttgaag 

gaatatgaag 

ttgagtaatg 

ggtgtgacat 

gaggtagatg 

aacaatatta 

ga tgatgatg 



ccgtcgccgt 
eggattgeta 
agtgtctttt 
ccatgtgtag 
acaagtatgc 
taaaggatac 
ccaaaggtgt 
teatgettga 
ataatcatga 
gtgaatggct 
atgatgatga 
ccttcaattt 
ataatgatga 
cctttgatac 
atga tgacac 



tgatgaggtg 
taactggtca 
cacctggaga 
ggectatata 
ctcttttcac 
ccttaataag 
taatgagaat 
aactctatct 
ggcagcttta 
taagtatgta 
tgatgaaggt 
cattgaagat 
ctcagagtca 
tattagtggt 
aatgtctttg 



cctacctgtc 
aatggaggga 
tttgaagagc 
ccccctgtgt 
tcatttttgt 
atgtcaacta 
gaggatgttt 
agacaggaaa 
atgggtaaat 
gctcccaata 
gataataatg 
gatgaggata 
cttcctgact 
attagtagtt 
cctgacttga 



ccatctgtat 
tgccgtgttg 
atatggtgaa 
ggttcttccg 
tgtctgctga 
ttctagcacc 
atatggagag 
tgeatgeatt 
tcaaggatat 
ctattgacat 
tctcccctag 
ctgtagtgtt 
tgactgttcc 
cactttatga 
atatgectag 



gggtgattac 
caggaaaagt 
tgaaaatcac 
taaagtgtat 
ctatgttaat 
tactttcttt 
ggcttatacc 
cagtgaagag 
ccccccttat 
tactcaatgc 
tttgttgtct 
tgtcccccca 
ccctagaagt 
tgttaatgat 
tgcttctacg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 
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tcatccgccc ctacctcatc cgcccctacc tcaacctctc ttaacattaa tgttaacctt 960 

tgttttaatg ttgattcaga ctctgatgac gaagaagtaa taccttcttc gtcatcagtg 1020 

"aatcagccct ctacttcctc aggaagtagt agtagtagta gtaacagtag aaagaggcca 1080 

aggtatgggc gtgacgagga caggatgtct aatatttcct ctgagagtaa gagactgtgt 1140 

gtagatgtca agaggtatat gtgtagactc gataatattg atgaggagta taatgagatt 1200 

gccaataggt atctggctga actttctgct cttagagaaa ggagacagga aactgagaat 1260 

aagcttggag attgtatatc tagaggtaac ctgttccata caactgtcaa tgatgtaatt 1320 

ggtaagagtt tgtgctctaa gaaattgaag gtgaagcgta agtacgcttc aaagtggagt 1380 

gctaataagc agctaattgg ttcctgtctc attaagtcag cctctaataa tgctaggttg 144 0 

gatgacgaaa ttgcacatgt acacagttca ttgttgaatg ggtttgatac tgacccttcg 1500 

gaagctgatc aaatatcttc cctgcccaat ctataa 1536 

<210> 125 

<211> 509 

<212> PRT 

<213> SHRIMP 

<400> 125 

Met Ala Glu Thr Val Ala Val Asp Glu Val Pro Thr Cys Pro lie Cys 

1 5 10 15 . 

Met Gly Asp Tyr Asp Ser Asp Thr Asp Cys Tyr Asn Trp Ser Asn Gly 

20 25 30 

Gly Met Pro Cys Cys Arg Lys Ser Val His Leu Glu Cys Leu Phe Thr 

35 40 45 

Trp Arg Phe Glu Glu His Met Val Asn Glu Asn His Leu Leu Cys Pro 

50 55 60 

Met Cys Arg Ala Tyr lie Pro Pro Val Trp Phe Phe Arg Lys Val Tyr 
65 70 75 80 

Glu Glu Val Tyr Lys Tyr Ala Ser Phe His Ser Phe Leu Leu Ser Ala 

85 90 95 

Asp Tyr Val Asn Asp Glu Gly Val Lys Asp Thr Leu Asn Lys Met Ser 

100 105 110 

Thr lie Leu Ala Pro Thr Phe Phe Val Pro Asn Ala Lys Gly Val Asn 

115 120 125 

Glu Asn Glu Asp Val Tyr Met Glu Arg Ala Tyr Thr Lys Leu Ser Phe 

130 135 140 

Met Leu Glu Thr Leu Ser Arg Gin Glu Met His Ala Phe Ser Glu Glu 
145 150 155 160 

Thr Phe Glu Asp Asn His Glu Ala Ala Leu Met Gly Lys Phe Lys Asp 

165 170 175 

lie Pro Pro Tyr Glu Tyr Glu Gly Glu Trp Leu Lys Tyr Val Ala Pro 

180 185 190 

Asn Thr He Asp He Thr Gin Cys Leu Ser Asn Asp Asp Asp Asp Asp 

195 200 205 

Glu Gly Asp Asn Asn Val Ser Pro Ser Leu Leu Ser Gly Val Thr Ser 

210 215 220 

Phe Asn Phe He Glu Asp Asp Glu Asp Thr Val Val Phe Val Pro Pro 
225 230 235 240 

Glu Val Asp Asp Asn Asp Asp Ser Glu Ser Leu Pro Asp Leu Thr Val 

245 250 255 

Pro Pro Arg Ser Asn Asn He Thr Phe Asp Thr He Ser Gly He Ser 

260 265 270 

Ser Ser Leu Tyr Asp Val Asn Asp Asp Asp Asp Asp Asp Asp Thr Met 

275 280 285 

Ser Leu Pro Asp Leu Asn Met Pro Ser Ala Ser Thr Ser Ser Ala Pro 

290 295 300 

Thr Ser Ser Ala Pro Thr Ser Thr Ser Leu Asn He Asn Val Asn Leu 
305 310 315 320 

Cys Phe Asn Val Asp Ser Asp Ser Asp Asp Glu Glu Val He Pro Ser 

325 330 335 

Ser Ser Ser Val Asn Gin Pro Ser Thr Ser Ser Gly Ser Ser Ser Ser 
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340 

Ser Ser Asn Ser 
355 

Met Ser Asn lie 
370 

Arg Tyr Met Cys 
385 

Ala Asn Arg Tyr 

Glu Thr Glu Asn 
420 

His Thr Thr Val 
435 

Leu Lys Val Lys 
450 

Leu lie Gly Ser 
465 

Glu lie Ala His 

Pro Ser Glu Ala 
500" 



Arg 


Lys 


Arg 


Pro 








360 


Ser 


Ser 


Glu 


Ser 






375 




Arg 


Leu 


Asp 


Asn 




390 






Leu 


Ala 


Glu 


Leu 


405 








Lys 


Leu 


Gly 


Asp 


Asn 


Asp 


Val 


He 








440 


Arg 


Lys 


Tyr 


Ala 






455 




Cys 


Leu 


He 


Lys 




470 






Val 


His 


Ser 


Ser 


485 








Asp 


Gin 


He 


Ser 



345 








Arg 


Tyr 


Gly 


Arg 


Lys 


Arg 


Leu 


Cys 








380 


He 


Asp 


Glu 


Glu 






395 




Ser 


Ala 


Leu 


Arg 




410 






Cys 


He 


Ser 


Arg 


425 








Gly 


Lys 


Ser 


Leu 


Ser 


Lys 


Trp 


Ser 








460 


Ser 


Asn 


Asn 


Ala 






475 




Leu 


Leu 


Asn 


Gly 




490 






Ser 


Leu 


Pro 


Asn 



505 





350 






Asp 


Glu 


Asp 


Arg 


365 








Val 


Asp 


Val 


Lys 


Tyr 


Asn 


Glu 


He 








400 


Glu 


Arg 


Arg 


Gin 






415 




Gly Asn 


Leu 


Phe 




430 






Cys 


Ser 


Lys 


Lys 


445 








Ala 


Asn 


Lys 


Gin 


Arg 


Leu 


Asp 


Asp 








480 


Phe 


Asp 


Thr 


Asp 






4 95 




Leu 









<210> 126 
<211> 1746 
<212> DNA 
<213> SHRIMP 



<400> 126 

atggggggac 

cagtatgttt 

tacgcgtctg 

aacatcacta 

aaacgtttca 

gtgtgtgcgt 

gattcccctt 

acctgtcaac 

gatacctgtt 

gatgaagaga 

attccgtcct 

tcaagtgcct 

cagtgtgaga 

gaagtgcctg 

gagattaaat 

gatatgaact 

aatattaagg 

ccatttgcag 

ggggagtatt 

atgggacgtt 

attgcccgaa 

ttggctgaga 

ataaaggctg 

ggactctcat 

ctaatcaggc 

tattcaatga 

tgttctaaca 

ttgcaaagcg 

cttgccatga 

aaatga 



ccactgtaat 
accatcaggg 
gttcaacctc 
tcaagacttg 
agaagtctga 
ccaaggtgat 
actactttaa 
agttgagaat 
tctatggaaa 
tgagattttt 
cccagattat 
gtgagcgaat 
actgggaaaa 
aagatgtgaa 
atggggtcta 
atgaagagtt 
gggtagttgg 
gtatcagtat 
atgagtcaag 
taccagaggt 
aggttgtaag 
cttgtaatgt 
ataactttgt 
accctctagg 
agtacattca 
cgtacagttt 
tggaaggccc 
gaacagacac 
agcacatatt 



tactactacc 
gaataaaaaa 
cgaagccaca 
ggatgacgtc 
agttgctccc 
taggcagtgt 
aggcattgag 
tggccatagg 
ggaccctaaa 
tgacactaga 
ggacataatg 
tagacaaaag 
cttccgtaag 
ggactttaac 
ttacattgtg 
tgtttttgag 
ggtgactggt 
tgatgatgtc 
ggacgcacgc 
aaagggggtt 
aagtatgttt 
aattacaaga 
tatagatagc 
ttattgttac 
cacacctccc 
cagtgtaatg 
tgcctttaat 
tgatttctat 
cccttttaag 



atcaatactg 
cggcctgtgg 
actgttcccg 
atcaacctta 
tctcctccca 
aagaggcagt 
aagagttgta 
tcaattgtta 
gttggcttct 
cacattctta 
tatatggctg 
actagcacgc 
actgctcgtg 
actttcatct 
aataggggga 
tgtgttaatg 
gtgtgtcctc 
attaggtgtg 
ttgttgtatg 
gatacagtcg 
gaaaaactaa 
gttgcaaatg 
atatctggcc 
aacgatgaat 
gagttcttta 
gcttccagta 
ttgatgtcaa 
caaaatcgcc 
gggactgtaa 



gtggagacca 
aagaatataa 
cttacaacaa 
gcatcacgcc 
ctactcgcac 
ataatgagtg 
gtcttgagga 
agtctagcaa 
attggcccac 
aggagttgtc 
tagaggtgtt 
taattaagga 
gttgtttgtc 
gtccctggga 
ctgttgtcaa 
gcctttctgt 

aggggttatg 

tcaaggatag 
gggttgtcat 
caccaataac 
aggtgaacat 
agggaattat 
aacctaaaat 
attttaggaa 
ggggacactg 
tactggaaga 
acatgcactt 
cttcaatcac 
tgaacctgtt 



ccaccaccag 
caacaacaac 
caacaacaac 
ccctccccct 
cttttcaaac 
gattgaacgt 
caattatgat 
gtatgtccat 
ctcttcttgc 
tagtcgtaat 
ccaattgcct 
agtttctgac 
tgatttggtc 
gacctttttt 
gtttatgaag 
atacagaaag 
tttagagatg 
tttagatggt 
gcttcaaagg 
agactctttc 
gccttttgtt 
taatgtcgat 
gattgacttg 
cacggaagaa 
tctaggtgcc 
tgttgttgct 
tttgatgttg 
agaatatgcc 
taaagtaaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1746 
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<211> 575 
<212> PRT 
<213> SHRIMP 



<400> 127 



Met 


Gly 


Gly 


Pro 


Thr 


Val 


He 


Thr 


1 








5 








His 


His 


His 


Gin 


Gin 


Tyr 


Val 


Tyr 








20 










Val 


Glu 


Glu 


Tyr 


Asn 


Asn 


Asn 


Asn 






35 










40 


Ala 


Thr 


Thr 


Val 


Pro 


Ala 


Tyr 


Asn 




50 










55 




Lys 


Thr 


Trp 


Asp 


Asp 


Val 


He 


Asn 


65 










70 






Lys 


Arg 


Phe 


Lys 


Lys 


Ser 


Glu 


Val 










85 








Thr 


Phe 


Ser 


Asn 


Val 


Cys 


Ala 


Ser 








100 










Gin 


Tyr 


Asn 


Glu 


Trp 


He 


Glu 


Arg 






115 










120 


lie 


Glu 


Lys 


Ser 


Cys 


Ser 


Leu 


Glu 




130 










135 




Leu 


Arg 


He 


Gly 


His 


Arg 


Ser 


He 


145 










150 






Asp 


Thr 


Cys 


Phe 


Tyr 


Gly 


Lys 


Asp 










165 








Thr 


Ser 


Ser 


Cys 


Asp 


Glu 


Glu 


Met 








180 










Leu 


Lys 


Glu 


Leu 


Ser 


Ser 


Arg 


Asn 






195 










200 


lie 


Met 


Tyr 


Met 


Ala 


Val 


Glu 


Val 




210 










215 




Glu 


Arg 


He 


Arg 


Gin 


Lys 


Thr 


Ser 


225 










230 






Gin 


Cys 


Glu 


Asn 


Trp 


Glu 


Asn 


Phe 










245 








Leu 


Val 


Glu 


Val 


Pro 


Glu 


Asp 


Val 








260 










Pro 


Trp 


Glu 


Thr 


Phe 


Phe 


Glu 


He 






275 










280 


Asn 


Arg 


Gly 


Thr 


Val 


Val 


Lys 


Phe 




290 










295 




Phe 


Val 


Phe 


Glu 


Cys 


Val 


Asn 


Gly 


305 










310 






Lys 


Gly 


Val 


Val 


Gly 


Val 


Thr 


Gly 










325 








Glu 


Met 


Pro 


Phe 


Ala 


Gly 


He 


Ser 








340 










Lys 


Asp 


Ser 


Leu 


Asp Gly 


Gly 


Glu 






355 










360 


Leu 


Leu 


Tyr 


Gly 


Val 


Val 


Met 


Leu 




370 










375 




Val 


Lys 


Gly 


Val 


Asp 


Thr 


Val 


Ala 


385 










390 






Arg 


Lys 


Val 


Val 


Arg 


Ser 


Met 


Phe 










405 








Phe 


Val 


Leu 


Ala 


Glu 


Thr 


Cys 


Asn 








420 










Gly 


lie 


He 


Asn 


Val 


Asp 


He 


Lys 






435 










440 



Thr 


Thr 


He 


Asn 


Thr 


Gly 


Gly Asp 




10 










15 




His 


Gin 


Gly 


Asn 


Lys 


Lys 


Arg 


Pro 


25 










30 






Tyr 


Ala 


Ser 


Gly 


Ser 


Thr 


Ser 


Glu 










45 








Asn 


Asn 


Asn 


Asn 


Asn 


He 


Thr 


He 








60 










Leu 


Ser 


He 


Thr 


Pro 


Pro 


Pro 


Pro 






75 










80 


Ala 


Pro 


Ser 


Pro 


Pro 


Thr 


Thr 


Arg 




90 










95 




Lys 


Val 


He 


Arg 


Gin 


Cys 


Lys 


Arg 


105 










110 






Asp 


Ser 


Pro 


Tyr 


Tyr 


Phe 


Lys 


Gly 










125 








Asp 


Asn 


Tyr 


Asp 


Thr 


Cys 


Gin 


Gin 








140 










Val 


Lys 


Ser 


Ser 


Lys 


Tyr 


Val 


His 






155 










160 


Pro 


Lys 


Val 


Gly 


Phe 


Tyr 


Trp 


Pro 




170 










175 




Arg 


Phe 


Phe 


Asp 


Thr 


Arg 


His 


He 


185 










190 






He 


Pro 


Ser 


Ser 


Gin 


He 


Met 


Asp 










205 








Phe 


Gin 


Leu 


Pro 


Ser 


Ser 


Ala 


Cys 








220 










Thr 


Leu 


He 


Lys 


Glu 


Val 


Ser 


Asp 






235 










240 


Arg 


Lys 


Thr 


Arg 


Cys 


Leu 


Ser 


Asp 




250 










255 




Lys 


Asp 


Phe 


Asn 


Thr 


Phe 


He 


Cys 


265 










270 






Lys. 


Tyr 


Gly 


Val 


Tyr 


Tyr 


He 


Val 










285 








Met 


Lys 


Asp 


Met 


Asn 


Tyr 


Glu 


Glu 








300 










Leu 


Ser 


Val 


Tyr 


Arg 


Lys 


Asn 


He 






315 










320 


Val 


Cys 


Pro 


Gin 


Gly 


Leu 


Cys 


Leu 




330 










335 




He 


Asp 


Asp 


Val 


He 


Arg 


Cys 


Val 


345 










350 






Tyr 


Tyr 


Glu 


Ser 


Arg 


Asp 


Ala 


Arg 










365 








Gin 


Arg 


Met 


Gly 


Arg 


Leu 


Pro 


Glu 








380 










Pro 


He 


Thr 


Asp 


Ser 


Phe 


He 


Ala 






395 










400 


Glu 


Lys 


Leu 


Lys 


Val 


Asn 


Met 


Pro 




410 










415 




Val 


He 


Thr 


Arg 


Val 


Ala 


Asn 


Glu 


425 










430 






Ala 


Asp 


Asn 


Phe 


Val 


He 


Asp 


Ser 



445 
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lie 


Ser 


Gly Gin 


Pro 


Lys 


Met 


He 


Asp 


Leu 


Gly 


Leu 


Ser 


Tyr 


Pro 


Leu 




450 










455 










4 60 










Gly 


Tyr 


Cys 


Tyr 


Asn 


Asp 


Glu 


Tyr 


Phe 


Arg 


Asn 


Thr 


Glu 


Glu 


Leu 


He 


4 65 










470 






















Arg 


Gin 


Tyr 


He 


His 


Thr 


Pro 


Pro 


Glu 


Phe 


Phe 


Arg 


Gly 


His 


Cys 


Leu 








485 










490 










4 95 




Gly 


Ala 


Tyr 


Ser 


Met 


Thr 


Xyr 


Ser 


Phe 


Ser 


Val 


Met 


Ala 


Ser 


Ser 


lie 






500 










505 










510 






Asp 


Val 


Val 


Ala 


Cys 


Ser 


Asn 


Met 


Glu 


Gly 


Pro 


Ala 


Phe 


Asn 


Leu 


Met 




515 










520 










525 








Ser 


Asn 
530 


Met 


His 


Phe 


Leu 


Met 
535 


Leu 


Leu 


Gin 


Ser 


Gly 
540 


Thr 


Asp 


Thr 


Asp 


Phe 


Tyr 


Gin 


Asn 


Arg 


Pro 


Ser 


He 


Thr 


Glu 


Tyr 


Ala 


Met 


Lys 


His 


He 


545 








550 










555 










560 


Phe 


Pro 


Phe 


Lys 


Gly 


Thr 


Val 


Met 


Asn 


Leu 


Phe 


Lys 


Val 


Lys 


Lys 





565 570 575 



<210> 128 
<211> 1200 
<212> DNA 
<213> SHRIMP 



<400> 128 

atgcttagca 

attcactttt 

gaagaattaa 

ccaatcactg 

aagagtagtc 

aaaggaggac 

ttaattgaaa 

gcgaattgcc 

gatgaaaagt 

acggctggtg 

tctgatacta 

tctgatatta 

aaaatgagac 

aaagaaggag 

tcgggggaag 

tctctagtga 

tcacctatgc 

ccttttgccc 

cctggctcag 

ggggcaagag 



cgtgtgatct 
ctgaatcaat 
acaaggattt 
aaacgttgaa 
tcctgactct 
aggaaatggc 
ctgcaaagga 
gggatagtaa 
atgttaagca 
caaataaacc 
attatacagc 
catatgaagt 
tagcaacagg 
gaagagtaga 
aaatgcagat 
cagagggtgt 
cagctagtcc 
cttcagctga 
aatgggttca 
ggtctggtat 



aaagcaccct 
tccagctaat 
actggatagt 
aaattgtata 
acaagaaaaa 
actgtgtgcg 
ggcctcaatg 
gtgttcggcc 
aaactcttca 
acccaaagag 
catgagtgat 
aacaaagcaa 
cgtggaaatg 
agtagaggta 
ggaaattatg 
ggatgattat 
acttccttct 
tattgttgtt 
ccagagagac 
aaatactgga 



tcttcaacag 
gatataattt 
gtaagaaatc 
actactcaaa 
tttaatgaat 
agccttaaaa 
gataataaaa 
agtttagtta 
tcagcctcgg 
tttataacaa 
aaccttattt 
gctaaagaac 
gtaatgaagg 
gaaggagtag 
ttgcctacac 
cctgtgtttt 
aatgggaata 
gataaaacat 
aggaatagta 
agatatagaa 



acggaaatgt 
ctttcccttc 
agatcaaatt 
cacttttaca 
ggggatcaat 
tcatgggcca 
agaagaataa 
cattatttaa 
ctcttctggc 
aagataatgc 
gtccaggcaa 
gcataaaaaa 
aactagaagc 
aacaacagca 
ctcctccccc 
caccactccc 
gtgcactgga 
cagaaattat 
aaatggagat 
gaaataacac 



acttaaaaat 
ctctgacacg 
cgggttcgac 
ctcttttctc 
tcaactagag 
aatatcagct 
taatgcatgt 
taaaactatc 
aaatactttc 
acatggcaat 
atactactca 
taacaataag 
agaaaataat 
accgtccacg 
ggatctagaa 
ttcattattg 
agatgggggg 
gggcagaaca 
acgaaactat 
tgttctatag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 



<210> 129 
<211> 395 
<212> PRT 
<213> SHRIMP 

<400> 129 

Met Leu Ser Thr Cys Asp Leu Lys 

1 5 
Val Leu Lys Asn He His Phe Ser 
20 

He Ser Phe Pro Ser Ser Asp Thr 

35 40 
Asp Ser Val Arg Asn Gin He Lys 

50 55 
Thr Leu Lys Asn Cys He Thr Thr 



His Pro Ser Ser Thr Asp Gly Asn 

10 15 
Glu Ser He Pro Ala Asn Asp He 
25 30 
Glu Glu Leu Asn Lys Asp Leu Leu 
45 

Phe Gly Phe Asp Pro He Thr Glu 
60 

Gin Thr Leu Leu His Ser Phe Leu 
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Lys 


Ser 


Ser 


Leu 


lie 


Gin 


Leu 


Glu 








100 


Lys 


He 


Met 


Gly 






115 




Ser 


Met 


Asp 


Asn 




130 






Asp 


Ser 


Lys 


Cvs 


145 








Asp 


Glu 


Lys 


Tyr 


Ala 


Asn 


Thr 


Phe 








180 


Thr 


Lys 


Asp 


Asn 






195 




Ser 


Asp 


Asn 


Leu 




210 






* y x 


Glu 


Val 


Thr 


225 








Lys 


Met 


Arg 


Leu 


Ala 


Glu 


Asn 


Asn 








260 


Val 


Glu 


Gin 


Gin 






275 




He 


Met 


Leu 


Pro 




290 






Glu 


Gly 


Val 


Asp 


305 








Ser 


Pro 


Met 


Pro 


Gly 


Gly 


Pro 


Phe 








340 


Glu 


He 


Met 


Gly 






355 




Arg 


Asn 


Ser 


Lys 




370 






Thr 


Gly 


Arg 


Tyr 


385 









70 

Leu Thr Leu Gin 
85 

Lys Gly Gly Gin 

Gin He Ser Ala 
120 

Lys Lys Lys Asn 
135 

Ser Ala Ser Leu 
150 

Val Lys Gin Asn 
165 

Thr Ala Gly Ala 

Ala His Gly Asn 
200 

He Cys Pro Gly 
215 

Lys Gin Ala Lys 
230 

Ala Thr Gly Val 
245 

Lys Glu Gly Gly 

Gin Pro Ser Thr 
280 

Thr Pro Pro Pro 
295 

Asp Tyr Pro Val 
310 

Asp Leu Pro Ser 
325 

Ala Pro Ser Ala 

Arg Thr Pro Gly 
360 

Met Glu lie Arg 
375 

Arg Arg Asn Asn 
390 







75 




Glu 


Lys 


Phe 


Asn 




90 






Glu 


Met 


Ala 


Leu 


105 








Leu 


He 


Glu 


Thr 


Asn 


Asn 


Ala 


Cys 








140 


Val 


Thr 


Leu 


Phe 






155 




Ser 


Ser 


Ser 


Ala 




170 






Asn 


Lys 


Pro 


Pro 


185 








Ser 


Asp 


Thr 


Asn 


Lys 


Tyr 


Tyr 


Ser 








220 


Glu 


Arg 


lie 


Lys 






235 




Glu 


Met 


Val 


Met 




250 






Arg 


Val 


Glu 


Val 


265 








Ser 


Gly 


Glu 


Glu 


Pro 


Asp 


Leu 


Glu 








300 


Phe 


Ser 


Pro 


Leu 






315 




Asn 


Gly 


Asn 


Ser 




330 






Asp 


He 


Val 


Val 


345 








Ser 


Glu 


Trp 


Val 


Asn 


Tyr 


Gly 


Arg 








380 


Thr 


Val 


Leu 








395 





80 

Glu Trp Gly Ser 

95 ' 

Cys Ala Ser Leu 
110 

Ala Lys Glu Ala 
125 

Ala Asn Cys Arg 

Asn Lys Thr He 
160 

Ser Ala Leu Leu 
175 

Lys Glu Phe He 
190 

Tyr Thr Ala Met 
205 

Ser Asp He Thr 

Asn Asn Asn Lys 
240 

Lys Glu Leu Glu 
255 

Glu Val Glu Gly 

270 

Met Gin Met Glu 
285 

Ser Leu Val Thr 

Pro Ser Leu Leu 
320 

Ala Leu Glu Asp 
335 

Asp Lys Thr Ser 
350 

His Gin Arg Asp 
365 

Ser Gly He Asn 



<210> 130 
<211> 1233 
<212> DNA 
<213> SHRIMP 

<400> 130 

atggaagaac atctatcctt caacaaaccc tctccagaaa atggagtagt cttctttgac 60 

tttagtgata atacttctat gtctaacatg gtagacaata tccggcacag acttcctatg 120 

gacaagaaat tttcctccaa ggctctactc ttggcctcta cgcccattcc cagtgacgaa 180 

caactctcca ccaaagtaaa caaggcaata ttttctcata gagaaactat tgttttatct 240 

aaagcgttaa agattgttgt tactggactt tatgtagatg gggaatatgt ggatgatgtt 300 

atttgcctct atccagaaaa acacaccttg aatgggattc tgagatatgt tgtgcatcta 360 

aatatgatgt taatggacaa ggcagaagac gcagacgaaa ttcgttgtgg acttattcct 420 

ctagggagag ggtttaatag agaggctttc aagtttgtgg acccagttat cccttgtgcc 480 

ggctacaaca tattgaatgg ataccaccca gataatggcc accaaatctc cccttcatct 54 0 

actcaaccac aggtgcaaag aagatgcgca gtcaagcaaa tgtacaaaca aatcaatggc 600 

atgtttgaag tggtgaaaca attttcaatc aaacacaaca atagaatctt cactataaat 660 

caagttgatt ttaaaggtga agaaatgaaa atgttctttg ctctctacag cgaagagtta 720 
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cttcctttct 
tctcagttgc 
aatactctga 
ggggataatt 
ggagatgatg 
agaaatgttg 
gagggtgttg 
agggaagaaa 
tatgtaataa 



attcagaaac 
ctccacatgt 
tgaaaacaga 
ttgagttgtt 
cactcgattt 
aactcgctga 
ttggtgaatg 
ttgtattcgg 
aacaaccagt 



gggaaaatta 
gaccatttct 
ttttgggagt 
tgggatgaac 
gcgtagacga 
caacagactg 
ggtgagcaga 
aggtacaaaa 
tgattatgta 



ttatctgaga 
gttttctatt 
tgttttgcac 
aacaacattc 
ataatggaac 
aatcctcaca 
ttcgctccgt 
ttcatttttg 
tga 



aacacgtctc 
tgcgaaatat 
cagctatcaa 
tcgtatccaa 
acatttcaga 
tcactcatgg 
gcaatttcct 
gaagggtcag 



caaatcattt 
ggaagaatac 
aatcgacact 
agtatgtgtt 
tgcaataggg 
gaagattaat 
ttgtaaaccg 
taatggaaac 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1233 



<210> 131 
<211> 404 
<212> PRT 
<213> SHRIMP 



<400> 131 



Met 


Glu 


Glu 


His 


Leu 


Ser 


Phe 


Asn 


1 








5 








Val 


Phe 


Phe 


Asp 


Phe 


Ser 


Asp 


Asn 








20 










Asn 


He 


Arg 


His 


Arg 


Leu 


Pro 


Met 






35 










40 


Leu 


Leu 


Leu 


Ala 


Ser 


Thr 


Pro 


He 




50 










55 




Lys 


Val 


Asn 


Lys 


Ala 


He 


Phe 


Ser 


65 










70 






Lys 


Ala 


Leu 


Lys 


He 


Val 


Val 


Thr 










85 








Asp 


Val 


He 


Cys 


Leu 


Tyr 


Pro 


Glu 








100 










Arg 


Tyr 


Val 


Val 


His 


Leu 


Asn 


Met 






115 










120 


Ala 


Asp 


Glu 


He 


Arg 


Cys 


Gly 


Leu 




130 










135 




Arg 


Glu 


Ala 


Phe 


Lys 


Phe 


Val 


Asp 


145 










150 






Asn 


He 


Leu 


Asn 


Gly 


Tyr 


His 


Pro 










165 








Ser 


Ser 


Thr 


Gin 


Pro 


Gin 


Val 


Gin 








180 










Tyr 


Lys 


Gin 


He 


Asn 


Gly 


Met 


Phe 






195 










200 


Lys 


His 


Asn 


Asn 


Arg 


He 


Phe 


Thr 




210 










215 




Glu 


Glu 


Met 


Lys 


Met 


Phe 


Phe 


Ala 


225 










230 






Ser 


Glu 


Thr 


Gly 


Lys 


Leu 


Leu 


Ser 










245 








Ser 


Gin 


Leu 


Pro 


Pro 


His 


Val 


Thr 








260 










Met 


Glu 


Glu 


Tyr 


Asn 


Thr 


Leu 


Met 






275 










280 


Ala 


Pro 


Ala 


He 


Lys 


He 


Asp 


Thr 




290 










295 




Met 


Asn 


Asn 


Asn 


He 


Leu 


Val 


Ser 


305 










310 






Leu 


Asp 


Leu 


Arg 


Arg 


Arg 


He 


Met 










325 








Val 


Glu 


Leu 


Ala 


Asp 


Asn 


Arg 


Leu 








340 










He 


Asn 


Glu 


Gly 


Val 


val 


Gly 


Glu 



Lys 


Pro 


Ser 


Pro 


Glu 


Asn 


Gly 


Val 




10 










15 




Thr 


Ser 


Met 


Ser 


Asn 


Met 


Val 


Asp 


25 










30 






Asp 


Lys 


Lys 


Phe 


Ser 


Ser 


Lys 


Ala 










45 








Pro 


Ser 


Asp 


Glu 


Gin 


Leu 


Ser 


Thr 








60 










His 


Arg 


Glu 


Thr 


He 


Val 


Leu 


Ser 






75 










80 


Gly 


Val 


Asp 


Gly 


Glu 


Tyr 


Val 


Asp 




90 










95 




Lys 


His 


Thr 


Leu 


Asn 


Gly 


He 


Leu 


105 










110 






Met 


Leu 


Met 


Asp 


Lys 


Ala 


Glu 


Asp 










125 








He 


Pro 


Leu 


Gly 


Arg 


Gly 


Phe 


Asn 








140 










Pro 


Val 


He 


Pro 


Cys 


Ala 


Gly 


Tyr 






155 










160 


Asp 


Asn 


Gly 


His 


Gin 


He 


Ser 


Pro 




170 










175 




Arg 


Arg 


Cys 


Ala 


Val 


Lys 


Gin 


Met 


185 










190 






Glu 


Val 


Val 


Lys 


Gin 


Phe 


Ser 


He 










205 








He 


Asn 


Gin 


Val 


Asp 


Phe 


Lys 


Gly 








220 










Lys 


Glu 


Glu 


Leu 


Leu 


Pro 


Phe 


Tyr 






235 










240 


Glu 


Lys 


His 


Val 


Ser 


Lys 


Ser 


Phe 




250 










255 




He 


Ser 


Val 


Phe 


Tyr 


Leu 


Arg 


Asn 


265 










270 






Lys 


Thr 


Asp 


Phe 


Gly 


Ser 


Cys 


Phe 










285 








Gly 


Asp 


Asn 


Phe 


Glu 


Leu 


Phe 


Gly 








300 










Lys 


Val 


Cys 


Val 


Gly 


Asp 


Asp 


Ala 






315 










320 


Glu 


His 


Asp 


Ala 


He 


Gly 


Arg 


Asn 




330 










335 




Asn 


Pro 


His 


He 


Thr 


His 


Gly 


Lys 


345 










350 






Trp 


Val 


Ser 


Arg 


Phe 


Ala 


Pro 


Cys 
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355 360 365 

Asn Phe Leu Cys Lys Pro Arg Glu Glu lie Val Phe Gly Gly Thr Lys 

370 375 380 

Phe He Phe Gly Arg Val Ser Asn Gly Asn Tyr Val He Lys Gin Pro 
385 390 395 400 

Val Asp Tyr Val 



<210> 132 
<211> 2839 
<212> DNA 
<213> SHRIMP 



<400> 132 

atggattcaa 

aatgaaaacc 

agaactgaca 

agtttccttc 

caatcagccc 

aggatagact 

tactttcttg 

aagaaaatat 

tacaatgaag 

ggtttggtgc 

atgacggatt 

aacaagttgt 

gtgcctcgta 

tttagattcc 

tttggaggtg 

agtgcatgta 

aacgacaaca 

ccctgtgaac 

attgttaaag 

aggacaccta 

atttctaaac 

tgggttgctc 

gagatccttc 

atggttgttc 

gcagttcatt 

gacatccatg 

ctgtttaaga 

gaggacaatg 

aggagaagaa 

acgttcagat 

acccaagtgt 

gctgcggtta 

aaaagacatc 

atatctctag 

ataatttcaa 

atgaccaagg 

tactcgttgt 

ctacgtatat 

tcaaaatcat 

cctatacctt 

actgcaaata 

tttaaagagg 

gttggagaag 

ctagtgtgta 

aagcctagaa 

agcacgactg 



cttctactac 
tgacatggga 
tgagactgaa 
aagatttctt 
tgtaccacat 
actacaacaa 
tcgatgtgaa 
ggcaacgttt 
tgagcaaatt 
atctttcgga 
ttgactttta 
cgggtatgtt 
gattggaaag 
ctagcaacca 
ccttttcatg 
gagaaaacgt 
aaactaaaaa 
tccaatatgc 
cagtgaaatt 
tcgccgaaca 
ttttgggacg 
acaagtataa 
tgaagaaatt 
cagtaatagg 
gcattgagaa 
aagaccctaa 
acaacgacgt 
tacaagactt 
aaatccacga 
ttggcgtctg 
ctcttcagct 
attggaccag 
aaaattcact 
cctctaaatt 
aattcaatgc 
ctggtttggt 
gtggaggatt 
cagctgaggc 
acaacgaaga 
tatttggcgt 
aatgtaatga 
cattagtttc 
atgaggatta 
cactgtatga 
agcatgatcc 
atttttcccc 



aactatagaa 
attcgtagac 
aacatcatcc 
cactgctcgt 
gttggatgga 
gaagatggag 
gacaatagaa 
atatgctgaa 
attgcccgat 
tactcttaaa 
tgaaaaatac 
tagtgagaat 
gtactttaac 
catcagggga 
tgtacaactt 
actggatact 
ttcatcctcc 
aagaataaac 
tgtagcagat 
cgaatcagat 
aaaggtgtct 
acagagtttt 
gacagttgct 
ctttggttca 
tgttataggg 
aaagttggaa 
gatgaaacga 
cttggaagaa 
agaggaatac 
tggctttcaa 
actaaaacaa 
attacttcaa 
tccttggaat 
cataaagaga 
agatattctt 
catagaggat 
taataaccag 
attaaaaaat 
caggccattt 
ggaaccttta 
tagtgtatca 
tcatacaatt 
ttgtgataac 
tatgatgggg 
tacaccatgg 
caactctgtt 



gcagaaaagg 
agagttatac 
agaagactgt 
gatggagtaa 
gttgcttcca 
agattggcct 
tctagacacg 
aaaattgctc 
gaagcaatgg 
aatgcaaaaa 
ataaggtcag 
tttgaaatcc 
gttgaaacga 
ttaatatttg 
tacctgctcg 
cccttctcaa 
aatgaagata 
tctaatgaca 
agagtggaga 
ggctacatgg 
gcttcttatg 
acgaacgatg 
tgtgggctca 
gggatgacaa 
tcttttatct 
gaaatgtccc 
gggcaagatg 
ttaaagacaa 
acaaaatctc 
caccctcttc 
cgtcaaacat 
tttttgttcc 
aggctaggtt 
tctgtgcact 
cctctcggta 
catgcgaggg 
acaacagagc 
gcaagaaatt 
ttacctcgca 
caccctttac 
gatttctggt 
ctaacagatt 
aataagagtg 
agagccaact 
agcagtaaaa 
attgttctat 



cactcctaaa 
ggcacgagaa 
tttcgttcat 
atagtgacga 
taatatcttg 
gcaccagtat 
ttgaactact 
cagaaaaagt 
caaactacaa 
agcccccaac 
atattgtatt 
ttccagacat 
attatagtct 
cttatttcat 
gtttcactct 
aattgaaaca 
atgacgggga 
aaaatgcgtg 
aagcatccgt 
cagactggct 
ccttgttatt 
ttaatggaag 
catacaacca 
acagaaaatt 
cttcggggaa 
tgatgcagct 
ggaaggtaac 
aggagtttgt 
tacacaccaa 
cagctagtag 
tcgtccagag 
cttctgacga 
ccaatttaaa 
gtgaacgtgt 
aagacccaga 
agaatatcga 
aaaagttaaa 
gtgttcttgc 
cggatgaggc 
tcaattcatt 
tagaagagag 
ccagtgtgta 
gaaaacgcat 
ataatggtct 
atactggcca 
tagacactga 



agaatacgta 
acttatgcag 
ctctatatac 
gtggtgtacc 
cttcaggaag 
acgtgaagga 
agatcctgat 
tgtcgatgca 
ctacagaact 
tgatttaacg 
gggaaaatca 
aaacattaaa 
ggagcacaac 
tggtaatatt 
ttctgcggct 
gtatataaag 
agaatattac 
cagaaaaagt 
cacaatgatg 
ctcccttcaa 
tatagtgaac 
tgaaaaatac 
caagtgcggc 
gagacagtat 
gaggaagaag 
ttctgcgcgc 
gttcgcgaat 
gcccaatgaa 
tctaaagatg 
tgacaagcca 
agaaactgca 
gagggataat 
tagacatttc 
tgtcaatgat 
ccattttcta 
taatgccatg 
cagtatacgt 
aacaaccttt 
aaagtttgtg 
tattgataat 
tgacgatatt 
ctcaactctt 
tggtaataca 
acattcagac 
gtctggacgt 
gaatgttgca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 
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gatgactatg aagatgagga ggaagattat gaagctttga aacaatcaga gagggataat 2820 
gtcatcacac taaataacg 2839 

<210> 133 
<211> 945 
<212> PRT 
<213> SHRIMP 

<220> 

<221> VARIANT 

<222> (1)...{945) 

<223> Xaa = Any Amino Acid 



<400> 133 



Met 


Asp 


Ser 


Thr 


Ser 


Thr 


Thr 


Thr 


He 


Glu 


Ala 


Glu 


Lys 


Ala 


Leu 


Leu 


1 








5 










10 










15 




Lys 


Glu 


Tyr 


Val 


Asn 


Glu 


Asn 


Leu 


Thr 


Trp 


Glu 


Phe 


Val 


Asp 


Arg 


Val 








20 










25 










30 






He 


Arg 


His 


Glu 


Lys 


Leu 


Met 


Gin 


Arg 


Thr 


Asp 


Met 


Arg 


Leu 


Lys 


Thr 






35 










40 










45 








Ser 


Ser 


Arg 


Arg 


Leu 


Phe 


Ser 


Phe 


He 


Ser 


He 


Tyr 


Ser 


Phe 


Leu 


Gin 




50 










55 










60 










Asp 


Phe 


Phe 


Thr 


Ala 


Arg 


Asp 


Gly 


Val 


Asn 


Ser 


Asp 


Glu 


Trp 


Cys 


Thr 


65 










70 










75 










80 


Gin 


Ser 


Ala 


Leu 


Tyr 


His 


Met 


Leu 


Asp Gly 


Val 


Ala 


Ser 


He 


He 


Ser 










85 










90 










95 




Cys 


Phe 


Arg 


Lys 


Arg 


He 


Asp 


Tyr 


Tyr 


Asn 


Lys 


Lys 


Met 


Glu 


Arg 


Leu 








100 










105 










110 






Ala 


Cys 


Thr 


Ser 


He 


Arg 


Glu 


Gly 


Tyr 


Phe 


Leu 


Val 


Asp 


Val 


Lys 


Thr 






115 










120 










125 








He 


Glu 


Ser 


Arg 


His 


Val 


Glu 


Leu 


Leu 


Asp 


Pro 


Asp 


Lys 


Lys 


He 


Trp 




130 










135 










140 










Gin 


Arg 


Leu 


Tyr 


Ala 


Glu 


Lys 


He 


Ala 


Pro 


Glu 


Lys 


Val 


Val 


Asp 


Ala 


145 










150 










155 










160 


Tyr 


Asn 


Glu 


Val 


Ser 


Lys 


Leu 


Leu 


Pro 


Asp 


Glu 


Ala 


Met 


Ala 


Asn 


Tyr 










165 










170 










175 




Asn 


Tyr 


Arg 


Thr 


Gly 


Leu 


Val 


His 


Leu 


Ser 


Asp 


Thr 


Leu 


Lys 


Asn 


Ala 








180 










185 










190 






Lys 


Lys 


Pro 


Pro 


Thr 


Asp 


Leu 


Thr 


Met 


Thr 


Asp 


Phe 


Asp 


Phe 


Tyr 


Glu 






195 










200 










205 








Lys 


Tyr 


He 


Arg 


Ser 


Asp 


He 


Val 


Leu 


Gly 


Lys 


Ser 


Asn 


Lys 


Leu 


Ser 




210 










215 










220 










Gly 


Met 


Phe 


Ser 


Glu 


Asn 


Phe 


Glu 


He 


Leu 


Pro 


Asp 


He 


Asn 


He 


Lys 


225 










230 










235 










240 


Val 


Pro 


Arg 


Arg 


Leu 


Glu 


Arg 


Tyr 


Phe 


Asn 


Val 


Glu 


Thr 


Asn 




Ser 










245 










250 










255 




Leu 


Glu 


His 


Asn 


Phe 


Arg 


Phe 


Pro 


Ser 


Asn 


His 


He 


Arg 


Gly 


Leu 


He 








260 










265 










270 






Phe 


Ala 


Tyr 


Phe 


He 


Gly 


Asn 


He 


Phe 


Gly 


Gly Ala 


Phe 


Ser 


Cys 


Val 






275 










280 










285 








Gin 


Leu 


Tyr 


Leu 


Leu 


Gly 


Phe 


Thr 


Leu 


Ser 


Ala 


Ala 


Ser 


Ala 


Cys 


Arg 




290 










295 










300 










Glu 


Asn 


Val 


Leu 


Asp 


Thr 


Pro 


Phe 


Ser 


Lys 


Leu 


Lys 


Gin 


Tyr 


He 


Lys 


305 










310 










315 










320 


Asn 


Asp 


Asn 


Lys 


Thr 


Lys 


Asn 


Ser 


Ser 


Ser 


Asn 


Glu 


Asp 


Asn 


Asp 


Gly 










325 










330 










335 




Glu 


Glu 


Tyr 


Tyr 


Pro 


Cys 


Glu 


Leu 


Gin 


Tyr 


Ala 


Arg 


He 


Asn 


Ser 


Asn 








340 










34 5 










350 






Asp 


Lys 


Asn 


Ala 


Cys 


Arg 


Lys 


Ser 


lie 


Val 


Lys 


Ala 


Val 


Lys 


Phe 


Val 






355 










360 










365 








Ala 


Asp 


Arg 


Val 


Glu 


Lys 


Ala 


Ser 


Val 


Thr 


Met 


Met 


Arg 


Thr 


Pro 


He 
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370 










375 




Ala 


Glu 


His 


Glu 


Ser 


Asp 


Gly 


Tyr 


385 










390 






He 


Ser 


Lys 


Leu 


Leu 


Gly 


Arg 


Lys 










405 








Phe 


He 


Val 


Asn 


Trp 


Val 


Ala 


His 








420 










Asp 


Val 


Asn 


Gly 


Ser 


Glu 


Lys 


Tyr 






435 










440 


Val 


Ala 


Cys 


Gly 


Leu 


Thr 


Tyr 


Asn 




450 










455 




Val 


He 


Gly 


Phe 


Gly 


Ser 


Gly 


Met 


465 










470 






Ala 


Val 


His 


Cys 


lie 


Glu 


Asn 


Val 










485 








Lys 


Arg 


Lys 


Lys 


Asp 


He 


His 


Glu 








500 










Ser 


Leu 


Met 


Gin 


Leu 


Ser 


Ala 


Arg 






515 










520 


Lys 


Arg 


Gly 


Gin 


Asp 


Gly 


Lys 


Val 




530 










535 




Gin 


Asp 


Phe 


Leu 


Glu 


Glu 


Leu 


Lys 


545 










550 






Arg 


Arg 


Arg 


Lys 


He 


His 


Glu 


Glu 










565 








Asn 


Leu 


Lys 


Met 


Thr 


Phe 


Arg 


Phe 








580 










Leu 


Pro 


Ala 


Ser 


Ser 


Asp 


Lys 


Pro 






595 










600 


Lys 


Gin 


Arg 


Gin 


Thr 


Phe 


Val 


Gin 




610 










615 




Trp 


Thr 


Arg 


Leu 


Leu 


Gin 


Phe 


Leu 


625 










630 






Lys 


Arg 


His 


Gin 


Asn 


Ser 


Leu 


Pro 










64 5 








Asn 


Arg 


His 


Phe 


He 


Ser 


Leu 


Ala 








660 










His 


Cys 


Glu 


Arg 


Val 


Val 


Asn 


Asp 






675 










680 


He 


Leu 


Pro 


Leu 


Gly 


Lys 


Asp 


Pro 




690 










695 




Gly 


Leu 


Val 


He 


Glu 


Asp 


His 


Ala 


705 










710 






Tyr 


Ser 


Leu 


Cys 


Gly 


Gly 


Phe 


Asn 










725 








Asn 


Ser 


He 


Arg 


Leu 


Arg 


He 


Ser 








740 










Asn 


Cys 


Val 


Leu 


Ala 


Thr 


Thr 


Phe 






755 










760 


Pro 


Phe 


Leu 


Pro 


Arg 


Thr 


Asp 


Glu 




770 










775 




Phe 


Gly 


Val 


Glu 


Pro 


Leu 


His 


Pro 


785 










790 






Thr 


Ala 


Asn 


Lys 


Cys 


Asn 


Asp 


Ser 










805 








Ser 


Asp 


Asp 


He 


Phe 


Lys 


Glu 


Ala 








820 










Asp 


Ser 


Ser 


Val 


Tyr 


Ser 


Thr 


Leu 






835 










840 


Asp 


Asn 


Asn 


Lys 


Ser 


Gly 


Lys 


Arg 




850 










855 





380 



Met 


Ala 


Asp 


Trp 


Leu 


Ser 


Leu 


Gin 






395 










400 


Val 


Ser 


Ala 


Ser 


Tyr 


Ala 


Leu 


Leu 




410 










415 




Lys 


Tyr 


Lys 


Gin 


Ser 


Phe 


Thr 


Asn 


425 










430 






Glu 


He 


Leu 


Leu 


Lys 


Lys 


Leu 


Thr 










445 








His 


Lys 


Cys 


Gly 


Met 


Val 


Val 


Pro 








460 










Thr 


Asn 


Arg 


Lys 


Leu 


Arg 


Gin 


Tyr 






475 










480 


He 


Gly 


Ser 


Phe 


He 


Ser 


Ser 


Gly 




490 










495 




Asp 


Pro 


Lys 


Lys 


Leu 


Glu 


Glu 


Met 


505 










510 






Leu 


Phe 


Lys 


Asn 


Asn 


Asp 


Val 


Met 










525 








Thr 


Phe 


Ala 


Asn 


Glu 


Asp 


Asn 


Val 








540 










Thr 


Lys 


Glu 


Phe 


Val 


Pro 


Asn 


Glu 






555 










560 


Glu 


Tyr 


Thr 


Lys 


Ser 


Leu 


His 


Thr 




570 










575 




Gly 


Val 


Cys 


Gly 


Phe 


Gin 


His 


Pro 


585 










590 






Thr 


Gin 


Val 


Ser 


Leu 


Gin 


Leu 


Leu 










605 








Arg 


Glu 


Thr 


Ala 


Ala 


Ala 


Val 


Asn 








620 










Phe 


Pro 


Ser 


Asp 


Glu 


Arg 


Asp 


Asn 






635 










640 


Trp 


Asn 


Arg 


Leu 


Gly 


Ser 


Asn 


Leu 




650 










655 




Ser 


Lys 


Phe 


He 


Lys 


Arg 


Ser 


Val 


665 










670 






He 


He 


Ser 


Lys 


Phe 


Asn 


Ala 


Asp 










685 








Asp 


His 


Phe 


Leu 


Met 


Thr 


Lys 


Ala 








700 










Arg 


Glu 


Asn 


He 


Asp 


Asn 


Ala 


Met 






715 










720 


Asn 


Gin 


Thr 


Thr 


Glu 


Gin 


Lys 


Leu 




730 










735 




Ala 


Glu 


Ala 


Leu 


Lys 


Asn 


Ala 


Arg 


745 










750 






Ser 


Lys 


Ser 


Tyr 


Asn 


Glu 


Asp 


Arg 










765 








Ala 


Lys 


Phe 


Val 


Pro 


He 


Pro 


Leu 








780 










Leu 


Leu 


Asn 


Ser 


Phe 


He 


Asp 


Asn 






795 










800 


Val 


Ser 


Asp 


Phe 


Trp 


Leu 


Glu 


Glu 




810 










815 




Leu 


Val 


Ser 


His 


Thr 


He 


Leu 


Thr 


825 










830 






Val 


Gly 


Glu 


Asp 


Glu 


Asp 


Tyr 


Cys 










845 








He 


Gly 


Asn 


Thr 


Leu 


Val 


Cys 


Thr 



860 
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Leu Tyr Asp Met Met Gly Arg Ala 
865 870 
Lys Pro Arg Lys His Asp Pro Thr 
885 

Gin Ser Gly Arg Ser Thr Thr Asp 
900 

Leu Leu Asp Thr Glu Asn Val Ala 
915 920 
Asp Tyr Glu Ala Leu Lys Gin Ser 
930 935 

Xaa 
945 



Asn Tyr Asn Gly Leu His Ser Asp 
875 880 
Pro Trp Ser Ser Lys Asn Thr Gly 

890 895 
Phe Ser Pro Asn Ser Val lie Val 
905 910 
Asp Asp Tyr Glu Asp Glu Glu Glu 
925 

Asp Asn Val lie Thr Leu Asn Asn 
940 



<210> 134 
<211> 4821 
<212> DNA 
<213> SHRIMP 



<400> 134 

atgtctcgaa 

cctttcgacc 

gacatcatag 

aacacatttg 

tgcattagct 

actgaatacg 

actatttttt 

gcctacagaa 

ggaggagtca 

tgtgtagatt 

gagcccattc 

atacacccta 

agtatgactt 

gccaccaacg 

ctaggctgcc 

ctttgtgatg 

gcggcaaaaa 

acagaaaggt 

aacgatacga 

cctgaaatta 

gccatggacg 

ctcacagtca 

gctcacatct 

aattttactg 

cccggtttag 

atcttccccg 

agggatgcgc 

gggtacgatt 

atatgcgcaa 

gctaaaaatg 

aagcgaatgg 

gaaagagacc 

aatagttttg 

gaaggaaaag 

aatgccaaat 

aagaaaagtt 

ataattgatc 

gcatatgaaa 

tttttctcgg 

agggtcgcaa 

gatggagttt 



actctttacg 
ccttatatgt 
ggaaaggttt 
actgttttaa 
gtcgttctct 
gtatagaaga 
cagcctcttc 
acgtcaaccc 
taactggagg 
cgtctgtggt 
ttgggtgcac 
caacattctc 
ctactatgga 
ccccaaaatg 
ctgtcatgtt 
aaagtacaca 
cagattatgc 
acgaaacttt 
ggatgtcggc 
cagcgtttaa 
ccgccggcaa 
gtgaatctgt 
ggaacggaga 
aacatccaaa 
gaatcgatag 
agacggttca 
acaacgatac 
cgtacaaaga 
cagcatacgt 
ataaaacagt 
cccacactcc 
aatcttttgc 
acgacgttac 
agaggtctgg 
tcaaagaaat 
ccacctttgc 
at tacaaggc 
tgtttgctca 
gaagaggact 
aaacgttatc 
taatgaatgg 



cgtcaaagga 
cgatacggac 
tgtagaatct 
aactcctaaa 
cgggggagga 
tgatgatgaa 
tgcgttcgac 
ttttactact 
aggaaaaaag 
gcgaaaggat 
agatattgta 
aattattgat 
ttgtttgtgc 
tgagaagaaa 
caggtatggg 
aatcagactg 
tcaacccttt 
aggtggtagt 
tctcggctat 
cggaggccat 
ttggcactca 
aggaggtgta 
tgcactaaac 
cgcatcgctc 
catagatcat 
catgagagca 
gaccatagaa 
taacccctta 
cgttccttca 
aaataagata 
catcgagaca 
gaacagtaca 
aaatttactt 
cctgcccatc 
tggaggtatt 
aagattcagt 
tggttcatct 
ccctccaact 
aaataatgga 
aaaaaagccc 
tgcatcttca 



ttaaaagaaa 
gcccccttcg 
ctattaccag 
aaatgtcgcg 
acttgcgtca 
tatgacggag 
aagcacgacg 
gtcgaggagg 
ggaacgactt 
ccttcccttc 
ctgtgtggag 
gatgtggacg 
gaacctggct 
gaaggtggta 
gttgtaggag 
gaagaagtgg 
gtagagggtg 
actaaggacg 
tcttatgccg 
ctgattacgg 
agaattgaag 
gtgccctaca 
gataatgggc 
agggtcgtgc 
gctgtgggca 
ggcgatcctt 
acatccttcc 
cagaaactga 
ctccatagag 
ttgcccttgg 
attttcaagc 
ttaaattcta 
ctggactatt 
aatacaagat 
attctgcaac 
gagaaaatat 
gcagtgttta 
gcgtggcgca 
atagaaggta 
gatatttttg 
cctcttgtaa 



atgggggaat 
gtatggctgg 
gggaaattag 
taggaggcaa 
aatccagtag 
tttgtgtccc 
atgatgttgc 
cttatttgca 
atatcaccaa 
tcaataaaga 
gaaaaggagt 
atatcgattt 
actcccagca 
tacaggaaaa 
atactggaac 
ccggtataga 
caaaattatt 
cgtgcctccc 
cttctctttt 
gaggattatt 
attctgacga 
gtggaacagg 
tggttggagc 
ctttacccca 
taatagcgtc 
caggagtaaa 
ttaaagattc 
gaaaatctca 
taataaagga 
tacactatag 
actctctttt 
tgatggttac 
ttttccccaa 
caatatacaa 
ctgttacagc 
tgtctacaaa 
agaaagtggg 
tagcttctaa 
cgggtatgcg 
caggagcaat 
gacctatgga 



tat tcccaac 
agtcaagtca 
ttcccattat 
cgattttgaa 
agaattgaaa 
tctcgccgat 
tacagatgca 
ctacgaatct 
aaaaagaggg 
tccaagatta 
ggggcgacct 
cgacataagc 
gagagaccct 
ggaacaaggt 
aaagggctgt 
tcttcccgat 
gctgcaaata 
tagaccagga 
tggaagagca 
gcgcgaatct 
acaaggaaaa 
aagtgtggcc 
tggaggggga 
tagcaatatt 
ccagggtaaa 
gactgacaga 
tgacaaggca 
cgactctggc 
aaaaccttct 
acctaccgct 
gacggcacaa 
aaattctagt 
cttgaacggc 
tgaaccaaac 
gcagggtgct 
ttctcccaaa 
ggaaaaagaa 
tgaagggaca 
cgaggccgaa 
cctcactgga 
aattcctgca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 
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tcttctctcc 
cctggttcgg 
gacattagag 
ccagcacgac 
gctacatcat 
attcaaaaga 
gtccctctaa 
gaaggagccc 
aggactcttt 
gattggggtc 
tgtgtcaggg 
tacactacaa 
ttttctataa 
aagagtattt 
ttgggagaaa 
gggggtagga 
cgttttcccc 
agagctcaat 
agaaagataa 
ttggccccta 
agacaagaaa 
gcagaattgt 
gattatgaac 
gatgcagcaa 
tcttctcatg 
ggtgtcaagg 
acaatatcgg 
agtgaagttg 
ggtagtacaa 
gaagttttca 
tcatcatcaa 
aaaagagtcc 
ggaggattcg 
ataacctatg 
ggggtaagag 
gtagggggac 
atgagcagat 
gatagattcc 
actatacaac 
aggacaattt 



ccgaacacac 
ccgacaactt 
ctattctaaa 
ccttttctaa 
ttttgggagt 
gtgtccaaga 
aattccatga 
tcaatttcat 
cctctgaaac 
tgagccccca 
aagaaactgg 
tgatggtccc 
agagttatgt 
ttgagatgca 
aggagaactg 
acaaaactga 
tcatgactac 
ccggagacgc 
acctaaaagc 
ataaattttg 
aggtacctgc 
taggggatgt 
gcttgccggg 
aaacgggtca 
gagttactgc 
atatagtgag 
ccgattctgc 
tggtggaagt 
agaagaggag 
caatcaataa 
agtatgtaga 
ctagagtcat 
aatcttgcgc 
cagatttcat 
ttaaaggccc 
ctaatctaag 
attattatac 
cctttgtttg 
ctttagcact 
caatggagta 



ctggttcgaa 
gacagccatc 
ttcgacaaca 
gcctctcgct 
tttaggggga 
atcagtctcg 
aggggatgaa 
ccctcctcct 
gtttattagg 
cacggccggc 
ccaatctttc 
caaacccctg 
agaggaacaa 
agatcctaat 
taattgcacc 
ccctatagcg 
cctccctaaa 
tagaatacta 
tgcaacagaa 
tgccatgcgt 
tgctgttttg 
tggtatgaca 
cgtgcctcct 
agagggtgca 
ctctacattt 
cctgtatgca 
gacaataaat 
agttaacagc 
attatctatt 
gcaaggaaag 
gggaggacaa 
gagaagagtc 
acctacagct 
gagaccttca 
agaaccattt 
gaaatttgca 
aagagaaaaa 
tcaatcagat 
tgtggacatg 
a 



cgcagatcgc 
aacaacactt 
gacattaaaa 
cctcctgcgg 
tttcctttac 
aacggcacgg 
ttatggcaac 
atggctctct 
ccagaattga 
cactacttga 
ggatacccgt 
ggcccgcaat 
acacggctac 
agtaaaaaca 
aacggtttat 
gcaacaccta 
aacgatgttc 
aatactatcg 
aatatatggg 
aggagtacgg 
gacgaaagga 
gatattgtat 
gcggaggctg 
aaggctagaa 
aacgtgggaa 
acgccttgct 
gaaggcgcaa 
aatatggaaa 
tcagattatg 
gctacagaaa 
aaagacatgg 
catgtacttc 
gctcagagtg 
gatcttagtg 
gatgatcttt 
caccatcacc 
actgtttctg 
agaggaccat 
ggagtgttgc 



cggtcaacgc 
atgatcgagt 
cctccttcaa 
gagtgtctgc 
ccataccttc 
tcgggtccat 
aatgtgaagt 
ttgagtcgct 
tccctaatcg 
acggcgtgta 
gctcgggtgc 
cacattcttc 
ttcctgccaa 
tattcgataa 
tctgtcccaa 
gcagaggaaa 
atctctctgc 
gagaaactaa 
atatatcttc 
cttacactcc 
agggtacgtt 
ctaatgacat 
aaattttcca 
ggatagtgga 
cattttcccc 
ttacagacat 
gcattgaacc 
tgttaggagg 
tggatttgga 
atctccgcgt 
tcggtttcta 
ctgtattgac 
cttgtacaag 
gaacaaaaac 
caaccctcta 
attttggata 
tttcagaggg 
tcccaccaaa 
cagaaggcgc 



gagaggagac 
aacaaaggga 
ttcgtacgct 
ggctgctcaa 
cagttttctc 
gcatggaata 
caaggaaact 
actcagggtt 
gttcagagca 
ctccccgcct 
cctctctcag 
gttatctaaa 
tattggagaa 
aatcggggaa 
agtgaatgga 
taggcattca 
agcactcctg 
aactaatgga 
caatgttatg 
ttattccaca 
cgataggaac 
tttaatggag 
cattataagg 
ctttttcgag 
ctacgttgag 
agattcgcca 
aactgatggg 
aagcactgcc 
agaagatgcg 
tagaacatca 
tgaagcgagt 
accctaccat 
aggtgtggaa 
aacattggag 
ctttagaagc 
tgaaggatta 
tgatttgaag 
gagggatgga 
actcacaagg 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4821 



<210> 135 
<211> 1600 
<212> PRT 
<213> SHRIMP 



<400> 135 



50 



65 



Arg 


Asn. 


Ser 
5 


Leu 


Arg 


Val 


Lys 


Gly 
10 


Leu 


Lys 


Glu 


Asn 


Gly 
15 


Gly 


Pro 


Asn 
20 


Pro 


Phe 


Asp 


•Pro 


Leu 

25 


Tyr 


Val 


Asp 


Thr 


Asp 
30 


Ala 


Pro 


Met 


Ala 


Gly 


Val 


Lys 


Ser 


Asp 


He 


He 


Gly 


Lys 


Gly 


Phe 


Val 


35 










40 










45 








Leu 


Leu 


Pro 


Gly 


Glu 
55 


He 


Ser 


Ser 


His 


Tyr 
60 


Asn 


Thr 


Phe 


Asp 


Lys 


Thr 


Pro 


Lys 
70 


Lys 


Cys 


Arg 


Val 


Gly 
75 


Gly 


Asn 


Asp 


Phe 


Glu 
80 


Ser 


Cys 


Arg 
85 


Ser 


Leu 


Gly 


Gly 


Gly 
90 


Thr 


Cys 


Val 


Lys 


Ser 
95 


Ser 


Leu 


Lys 
100 


Thr 


Glu 


Tyr 


Gly 


He 
105 


Glu 


Asp 


Asp 


Asp 


Glu 
110 


Tyr 


Asp 
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Gly 


Val 


Cys 


Val 


Pro 


Leu 


Ala 


Asp 






115 










120 


Phe 


Asp 


Lys 


His 


Asp 


Asp 


Asp 


Val 




130 










135 




Val 


Asn 


Pro 


Phe 


Thr 


Thr 


Val 


Glu 


145 










150 






Gly 


Gly 


Val 


He 


Thr 


Gly 


Gly 


Gly 










165 








Lys 


Lys 


Arg 


Gly 


Cys 


Val 


Asp 


Ser 








180 










Leu 


Leu 


Asn 


Lys 


Asp 


Pro 


Arg 


Leu 






195 










200 


He 


Val 


Leu 


Cys 


Gly 


Gly 


Lys 


Gly 




210 










215 




Thr 


Phe 


Ser 


He 


He 


Asp 


Asp 


Val 


225 










230 






Ser 


Met 


Thr 


Ser 


Thr 


Met 


Asp Cys 










245 








Gin 


Arg 


Asp 


Pro 


Ala 


Thr 


Asn 


Ala 








260 










Gly 


He 


Gin 


Glu 


Lys 


Glu 


Gin 


Gly 






275 










280 


Tyr 


Gly 


Val 


Val 


Gly 


Asp 


Thr 


Gly 




290 










295 




Ser 


Thr 


Gin 


He 


Arg 


Leu 


Glu 


Glu 


305 










310 






Ala 


Ala 


Lys 


Thr Asp 


Tyr 


Ala 


Gin 










325 








Leu 


Leu 


Gin 


He 


Thr 


Glu 


Arg 


Tyr 








340 










Asp 


Ala 


Cys 


Leu 


Pro 


Arg 


Pro 


Gly 






355 










360 


Gly 


Tyr 


Ser 


Tyr 


Ala 


Ala 


Ser 


Leu 




370 










375 




Ala 


Phe 


Asn 


Gly Gly 


His 


Leu 


He 


385 










390 






Ala 


Met 


Asp 


Ala 


Ala 


Gly 


Asn 


Trp 










405 








Glu 


Gin 


Gly 


Lys 


Leu 


Thr 


Val 


Ser 








420 










Tyr 


Ser 


Gly 


Thr 


Gly 


Ser 


Val 


Ala 






435 










440 


Leu 


Asn 


Asp 


Asn 


Gly 


Leu 


Val 


Gly 




450 










455 




His 


Pro 


Asn 


Ala 


Ser 


Leu 


Arg 


Val 


465 










470 






Pro 


Gly 


Leu 


Gly 


He 


Asp 


Ser 


He 










485 








Ser 


Gin 


Gly 


Lys 


He 


Phe 


Pro 


Glu 








500 










Pro 


Ser 


Gly 


Val 


Lys 


Thr 


Asp 


Arg 






515 










520 


lie 


Glu 


Thr 


Ser 


Phe 


Leu 


Lys 


Asp 




530 










535 




Tyr 


Lys 


Asp 


Asn 


Pro 


Leu 


Gin 


Lys 


545 










550 






He 


Cys 


Ala 


Thr 


Ala 


Tyr 


Val 


Val 










565 








Glu 


Lys 


Pro 


Ser 


Ala 


Lys 


Asn 


Asp 








580 










Leu 


Val 


His 


-Tyr 


Arg 


Pro 


Thr 


Ala 



Thr 


lie 


rne 




c\±.a 
















IPS 

X C. J 








Ala 


Thr 


Asp 


A -La. 


n±a 


Tyr 


Arg 


Asn 


















Glu 


Ala 


Tyr 


Leu 


HIS 


Tyr 


IjXU 


Ser 


















Lys 


Lys 


Gly 


rp -> 

Tnr 


Tnr 


Tyr 


xj.e 


inr 




1 /u 














Ser 


Val 


vai ■ 


Arg 


Lys 


Asp 


fro 




185 










i on 
i yu 






Glu 


Pro 


He 


Leu 


Gly 


Cys 


Thr 


Asp 


















Val 


Gly 


Arg 


Pro 


lie 


His 


Pro 


rnr 








220 










Asp 


Asp 


He 


Asp 


Phe 


Asp 


lie 


Ser 






235 












Leu 


Cys 


Glu 


Pro 


Gly 


Tyr 


ser 


uxn 




250 










*> ii c 




Pro 


Lys 


Cys 


Glu 


Lys 


Lys 


bill 


Giy 


265 










^ "7 A 
/U 






Leu 


Gly 


Cys 


Pro 


Val 


Met 


Pne 


Arg 










"5 O C 
ZOJ 








Thr 


Lys 


Gly 


Cys 


Leu 


Cys 


Asp 


LiJ.U 








■3 n rt 
300 










Val 


Ala 


Gly 


He 


Asp 


Leu 


Pro 


Asp 






315 












Pro 


Phe 


Val 


Glu 


Gly 


Ala 


Lys 


Leu 




330 










o o c 




Glu 


Thr 


Leu 


Gly 


Gly 


Ser 


Thr 


Lys 


345 










*3 C f\ 






Asn 


Asp 


Thr 


Arg 


Met 


Ser 


Ala 


Leu 










o c c 








Phe 


Gly 


Arg 


Ala 


Pro 


Glu 


He 


Thr 








380 










Thr 


Gly 


Gly 


Leu 


Leu 


Arg 


Glu 


Ser 






395 










400 


His 


Ser 


Arg 


He 


Glu 


Asp 


Ser 


Asp 




410 










415 




Glu 


Ser 


Val 


Gly 


Gly 


Val 


Val 


Pro 


425 
















Ala 


His 


He 


Trp 


Asn 


Gly 


Asp 


Ala 










A A C 

IT J 








Ala 


Gly 


Gly 


Gly 


Asn 


Phe 


Thr 


Glu 








4 60 










Val 


Pro 


Leu 


Pro 


His 


Ser 


Asn 


He 






475 










480 


Asp 


His 


Ala 


Val 


Gly 


He 


He 


Ala 




4 90 










495 




Thr 


Val 


His 


Met 


Arg 


Ala 


Gly Asp 


jUj 










Z) J. \J 






Arg 


Asp 


Ala 


His 


Asn 


Asp 


Thr 


Thr 










525 








Ser 


Asp 


Lys 


Ala 


Gly 


Tyr 


Asp 


Ser 








540 










Leu 


Arg 


Lys 


Ser 


His 


Asp 


Ser 


Gly 






555 










560 


Pro 


Ser 


Leu 


His 


Arg 


Val 


He 


Lys 




570 










57 5 




Lys 


Thr 


val 


Asn 


Lys 


He 


Leu 


Pro 


585 










590 






Lys 


Arg 


Met 


Ala 


His 


Thr 


Pro 


He 
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595 600 605 



L» J.U 


i nr 


lie 


rne 


Lys 


nlS 


OC1 


Leu 


Leu 


inr 




fZl n 


UlU 


Arg 


Asp 


oj.n 




bl U 




















oz u 










Car- 


rue 


Hid 


Asn 




i nr 


Leu 


Asn 


oer 




we t 


vdl 


i nr 


Asn 


OCX 


OCX 


be j 










DjU 










uj O 










£4 n 
D1U 


Asn 


Cor- 


irne 


Asp 


Asp 


Vdl 


inr 


Asn 


Leu 


Leu 


Leu 


Asp 


Tyr 


rne 


irne 


fro 




















£ C A 

too 










ODD 




Asn 


Leu 


Asn 


(jiy 


Tin 




i»ys 


ulu 


Arg 


Ser 


(jxy 


Leu 


xrro 


x le 


Asn 


Thy 

i nr 








ofoU 










DOj 










b /U 






Arg 




lie 


Tyr 


Asn 


blu 


Pro 


Asn 


Asn 


ax a 


Lys 


rne 


Lys 


bill 


116 


r* i \i 
oiy 






/•7C 

D /3 










boU 










cot 
DOS 










T 1 A 

lie 


lie 


Leu 


bin 


Pro 


Val 


Thr 


Aia 


uin 


tjxy 


Ala 


Lys 


Lys 


oer 


oer 




690 










695 










"7 A A 










Tnr 


Pne 


Ala 


Arg 


Pne 


Ser 


Glu 


Lys 


lie 


Leu 


b€£ 


inr 


Asn 


oer 


Pro 


Lys 


f\JD 










A 1U 










/Id 










^ O A 


lie 


T 1 A 

lie 


Asp 


His 


Tyr 


Lys 


Ala 


Gly 


Ser 


Ser 


Ala 


vai 


pne 


Lys 


Lys 


vai 




















"7 *3 A 

/ oO 










/ 






pi 

(jlU 


Lys 


Glu 


a j. a 


Tyr 


Glu 


Met 


OK a. 

rne 


Aia 


nlS 


Pro 


Pro 


x nr 


nld 


Trp 








740 










~1 A C 

/4o 










/ DU 






Arg 


lie 


Asn 


Glu 


(aiy 


Thr 


Phe 


Phe 


£>er 




Arg 


\j±n 


Asn 


tsxy 


T1 A 

xxe 


IsXU 






755 










760 










/ ob 








Gly 


Thr 


Gly 


Met 


Arg 


Glu 


Ala 


Glu 


Arg 


vai 


Ala 


Lys 


i nr 


Leu 


Ser 


Lys 




770 










775 










"J O A 










Lys 


Pro 


Asp 


lie 


Pne 


Ala 


Gly Ala 


lie 


Leu 


Thr 


Gly 


Asp 


biy 


val 


Leu 


785 










790 










7 95 










800 


Met 


Asn 


Gly Ala 


Ser 


Ser 


Pro 


Leu 


val 


Arg 


Pro 


Met 


Glu 


lie 


Pro 


Ala 










805 










810 










815 




Ser 


Ser 


Leu 


Pro 


Glu 


His 


Thr 


Trp 


Phe 


Glu 


Arg 


Arg 


Ser 


Pro 


val 


Asn 








820 










825 










OTA 






Arg 


Asp 


Pro 


Gly 


Ser 


Ala 


Asp 


Asn 


Leu 


Thr 


Ala 


lie 


Asn 


Asn 


Thr 


Tyr 






835 










840 










84 5 








Asp 


Arg 


Val 


Thr 


Lys 


Gly Asp 


He 


Arg 


Ala 


I le 


Leu 


Asn 


Ser 


Thr 


Thr 




850 










855 










860 










Asp 


lie 


Lys 


Thr 


Ser 


Phe 


Asn 


Ser 


Tyr 


Ala 


Pro 


Ala 


Arg 


Pro 


Phe 


Ser 


865 










870 










875 










880 


Lys 


Pro 


Leu 


Ala 


Pro 


Pro 


Ala 


Gly 


Val 


Ser 


Ala 


Ala 


Ala 


Gin 


Ala 


Thr 










885 










890 










one. 
895 




Ser 


Phe 


Leu 


Gly 


Val 


Leu 


Gly 


Gly 


Phe 


Pro 


lieu 


Pro 


T 1 

lie 


Pro 


Ser 


Ser 








900 










905 










910 






Phe 


Leu 


lie 


Gin 


Lys 


Ser 


Val 


Gin 


Glu 


Ser 


Val 


Ser 


Asn 


Gly 


Thr 


vai 






915 










920 










925 








Gly 


Ser 


Met 


His 


Gly 


lie 


Val 


Pro 


Leu 


Lys 


Pne 


His 


Glu 


Gly 


Asp 


GlU 




930 










935 










940 










Leu 


Trp 


Gin 


Gin 


Cys 


Glu 


Val 


Lys 


Glu 


Thr 


Glu 


Gly 


Ala 


Leu 


Asn 


Pne 


945 










950 










955 










960 


lie 


Pro 


Pro 


Pro 


Met: 


Ala 


Leu 


Phe 


Glu 


Ser 


Leu 


Leu 


Arg 


val 


Arg 


Thr 










965 










970 










975 




Leu 


Ser 


Ser 


Glu 


Thr 


Phe 


He 


Arg 


Pro 


Glu 


Leu 


lie 


Pro 


Asn 


Arg 


Phe 








980 










985 










990 






Arg 


Ala 


Asp 


Trp 


Gly 


Leu 


Ser 


Pro 


His 


Thr 


Ala 


Gly 


His 


Tyr 


Leu 


Asn 






995 










1000 








1005 






Gly 


Val 


Tyr 


Ser 


Pro 


Pro 


Cys 


Val 


Arg 


Glu 


Glu 


Thr 


Gly 


Gin 


Ser 


Phe 




1010 








1015 








1020 








Gly 


Tyr 


Pro 


Cys 


Ser 


Gly 


Ala 


Leu 


Ser 


Gin 


Tyr 


Thr 


Thr 


Met 


Met 


Val 


1025 








1030 








1035 








1040 


Pro 


Lys 


Pro 


Leu 


Gly 


Pro 


Gin 


Ser 


His 


Ser 


Ser 


Leu 


Ser 


Lys 


Phe 


Ser 










1045 








1050 








1055 


lie 


Lys 


Ser 


Tyr 


Val 


Glu 


Glu 


Gin 


Thr 


Arg 


Leu 


Leu 


Pro 


Ala 


Asn 


He 








1060 








1065 








1070 




Gly 


Glu 


Lys 


Ser 


lie 


Phe 


Glu 


Met 


Gin 


Asp 


Pro 


Asn 


Ser 


Lys 


Asn 


He 



1075 1080 1085 



WO 01/38351 



218 



PCT/US00/28888 



Phe Asp Lys lie Gly Glu Leu Gly Glu Lys Glu Asn Cys Asn Cys Thr 

1090 1095 1100 

Asn Gly Leu Phe Cys Pro Lys Val Asn Gly Gly Gly Arg Asn Lys Thr 
1105 1110 1115 1120 

Asp Pro He Ala Ala Thr Pro Ser Arg Gly Asn Arg His Ser Arg Phe 

1125 1130 1135 

Pro Leu Met Thr Thr Leu Pro Lys Asn Asp Val His Leu Ser Ala Ala 

1140 1145 1150 

Leu Leu Arg Ala Gin Ser Gly Asp Ala Arg He Leu Asn Thr He Gly 

1155 1160 1165 

Glu Thr Lys Thr Asn Gly Arg Lys He Asn Leu Lys Ala Ala Thr Glu 

1170 1175 1180 

Asn He Trp Asp He Ser Ser Asn Val Met Leu Ala Pro Asn Lys Phe 
1185 1190 1195 1200 

Cys Ala Met Arg Arg Ser Thr Ala Tyr Thr Pro Tyr Ser Thr Arg Gin 

1205 1210 1215 

Glu Lys Val Pro Ala Ala Val Leu Asp Glu Arg Lys Gly Thr Phe Asp 

1220 1225 1230 

Arg Asn Ala Glu Leu Leu Gly Asp Val Gly Met Thr Asp He Val Ser 

1235 1240 1245 

Asn Asp He Leu Met Glu Asp Tyr Glu Arg Leu Pro Gly Val Pro Pro 

1250 1255 1260 

Ala Glu Ala Glu He Phe His He He Arg Asp Ala Ala Lys Thr Gly 
1265 1270 1275 1280 

Gin Glu Gly Ala Lys Ala Arg Arg He Val Asp Phe Phe Glu Ser Ser 

1285 1290 1295 

His Gly Val Thr Ala Ser Thr Phe Asn Val Gly Thr Phe Ser Pro Tyr 

1300 1305 1310 

Val Glu Gly Val Lys Asp He Val Ser Leu Tyr Ala Thr Pro Cys Phe 

1315 1320 1325 

Thr Asp He Asp Ser Pro Thr He Ser Ala Asp Ser Ala Thr He Asn 

1330 1335 1340 

Glu Gly Ala Ser He Glu Pro Thr Asp Gly Ser Glu Val Val Val Glu 
1345 1350 1355 1360 

Val Val Asn Ser Asn Met Glu Met Leu Gly Gly Ser Thr Ala Gly Ser 

1365 1370 1375 

Thr Lys Lys Arg Arg Leu Ser He Ser Asp Tyr Val Asp Leu Glu Glu 

1380 1385 1390 

Asp Ala Glu Val Phe Thr He Asn Lys Gin Gly Lys Ala Thr Glu Asn 

1395 1400 1405 

Leu Arg Val Arg Thr Ser Ser Ser Ser Lys Tyr Val Glu Gly Gly Gin 

1410 1415 1420 

Lys Asp Met Val Gly Phe Tyr Glu Ala Ser Lys Arg Val Pro Arg Val 
1425 1430 1435 1440 

Met Arg Arg Val His Val Leu Pro Val Leu Thr Pro Tyr His Gly Gly 

1445 1450 1455 

Phe Glu Ser Cys Ala Pro Thr Ala Ala Gin Ser Ala Cys Thr Arg Gly 

1460 1465 1470 

Val Glu He Thr Tyr Ala Asp Phe Met Arg Pro Ser Asp Leu Ser Gly 

1475 1480 1485 

Thr Lys Thr Thr Leu Glu Gly Val Arg Val Lys Gly Pro Glu Pro Phe 

1490 1495 1500 

Asp Asp Leu Ser Thr Leu Tyr Phe Arg Ser Val Gly Gly Pro Asn Leu 
1505 1510 1515 1S20 

Arg Lys Phe Ala His His His His Phe Gly Tyr Glu Gly Leu Met Ser 

1525 1530 1535 

Arg Tyr Tyr Tyr Thr Arg Glu Lys Thr Val Ser Val Ser Glu Gly Asp 

1540 1545 1550 

Leu Lys Asp Arg Phe Pro Phe Val Cys Gin Ser Asp Arg Gly Pro Phe 

1555 1560 1565 

Pro Pro Lys Arg Asp Gly Thr He Gin Pro Leu Ala Leu Val Asp Met 
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1570 1575 1580 

Gly Val Leu Pro Glu Gly Ala Leu Thr Arg Arg Thr lie Ser Met Glu 
1585 1590 1595 1600 



<210> 136 
<211> 3483 
<212> DNA 
<213> SHRIMP 



<400> 136 

atgaaaattg 

aaacctgcga 

gctgctgtta 

tcaaaaaaac 

gatgtgcagg 

tatgatattg 

tcatcattgc 

ggaaaagata 

cttcccttta 

gggggtggca 

gcaggttggc 

caacaagccg 

acaaagggaa 

aaaaacaaga 

acggaagttt 

aactatgata 

cctgctaatt 

ctacctcgcg 

aaaacaaaca 

aatatggagg 

cttagaaatc 

atagaagcgc 

ttaaccacat 

aaattgttct 

aatacacctt 

actgcaccag 

ccatgtatca 

tcttcccccg 

aacgtttcag 

actgcattac 

tacgctgttt 

ttccaaccca 

gttatcgtca 

gtctccgatc 

accagccgcc 

gaagtatttg 

gaaggacagt 

tcaaccttta 

gtctactgtc 

aaacagacta 

gggggagaaa 

acaaacattg 

caaatcgata 

acacaggggg 

ttagaccaaa 

aaggaaaatc 

ggtattgttg 

tggagaaaca 

gcccctctac 

gatggacaat 



tccagaacaa 
aaatagaaaa 
gaaaagaccc 
tcgctaaaaa 
atccaaaatt 
gtgatgatat 
tgccaattcg 
ccatcgtgag 
aggcagagat 
gcagcgatag 
ctcctatcac 
tattcaaaca 
gtattttcaa 
ccgtgttatt 
ttgcttgtag 
atgtggaata 
ttccccgtct 
aaaatcaact 
ctgttaacac 
gaaatgaatc 
catctaaccc 
aatggaaaat 
cttctgggta 
cggttataag 
cgagagtcgg 
attggcctgg 
tcttgaaaac 
ccacagaata 
ttggtccatt 
tgggaggtga 
tccctgtttt 
ttcacagtat 
atgatgatga 
ccgtgtctgg 
ctatatttat 
gaggtgacga 
tttcagacat 
cggcacctgt 
gcactacact 
gtcttgtccc 
taaagtaccc 
tcaatacaga 
aggaacaaga 
agaaagtgta 
ttgtgcacac 
ttctggtaaa 
tttttggagc 
ctggccgcct 
agactcggct 
ttctggtacg 



ttttacgccc 
agctgtattc 
gaaaaagaaa 
tattgaaagg 
gttacacagt 
ttctcctcaa 
aatgattaat 
gatacatatt 
aaaagttgat 
tggtccgtct 
caatgacccg 
cgtggaagta 
cactgggaat 
cgattctcag 
aaatgtagcc 
cactcctgat 
ttcccgacaa 
ttcagatatt 
accagtatta 
cgagggttac 
agctcggacc 
gttgggctgg 
caatattgta 
tactgatgtg 
gtgcttcggt 
acccacgaac 
tgataatgat 
tgccggtgaa 
gactgaagta 
acagcctaca 
agat tcgcat 
cacgtcaagg 
tgatgatgtt 
ttctatactt 
tgacggtgca 
taaaggcata 
gtctgtacca 
aagggacaca 
tcctcccatt 
cctacctaca 
tcggcacatt 
tactcaaact 
acgtcatcga 
cgctaaaatg 
ttcaaatact 
taaagaagac 
aaataaatac 
acgcgcagtg 
agccacatgg 
tctagaccct 



gatgaaagat 
gggaatatag 
aggaacttga 
atatcctctg 
ataatgaaac 
tcggctcctg 
atccgtacag 
cttgacggaa 
ttagtggatg 
ctattcgaaa 
aatgcgttct 
aattctctag 
agactgaaaa 
gtaacaatat 
ctcatgaggc 
acattgaatg 
gctgaaatag 
aacaaacctt 
aacaaagtac 
aagattctaa 
tttatatgtg 
attgtgggct 
tttccagcat 
aatgctaaca 
tctagcgtaa 
gggccagatt 
aatagagaac 
cgcacaacat 
cgaggtggag 
tttaaggcac 
aacggcctag 
aacaaaacta 
gtccttgaag 
gcaaaagaaa 
cgagacgata 
cagatgccct 
tctaacgaat 
cccgccaccg 
tctaatagag 
agcatacctg 
ttcatttctt 
gaattttcaa 
cagttattgc 
tacatgggag 
acatacgtgg 
aagaaacttg 
cctgacgaac 
ggttcttatt 
cctagcggcg 
agatgtggag 



ctggtgtgat 
cagcggctat 
agaatgggtt 
aggaattaaa 
gcactgcacg 
atagagatgg 
aagaattatt 
tattaccaga 
aaaaatatga 
ctttccctga 
ctcgcaacaa 
ctgacggtat 
tttccattgt 
cttcacccat 
ttgatatgcc 
aaaaatatgt 
cgtccaattt 
cagtgtcgtt 
tatacaatga 
acgccacaga 
tgtctgtacc 
tcaaaacgtc 
caaaagttac 
cgaacaaagt 
actttagagt 
tcttctccta 
ctcgaataac 
ctctactgcc 
atattattac 
ccgcagaacc 
tgaaagcgag 
ctgtcttgac 
ataaaagcta 
acgttctttc 
gggttttcag 
ttacgatgga 
tggctatatg 
acataaccaa 
ggataaggga 
aatgggcatt 
ccataagaac 
tagaaaactg 
cagcacctag 
acggcgtttc 
tagacgagtc 
cggctatttt 
ctgcagacag 
ctcaattgag 
accctgttac 
gaattggtag 



acacataaga 
cgatgattcc 
agaaccggcg 
acgtgtcact 
gcagatcggt 
ttcatcatca 
agaaaaaggt 
taatgtccct 
aggagaagac 
atttgtccca 
tggcaataaa 
aacactatca 
aacagaggat 
accaaaaata 
taaagcaata 
ttcagattac 
agcagctaaa 
cgtgtactcg 
aacgttaaag 
gataacccac 
tgaaagtgaa 
gtctgacgtt 
tcaatctgat 
tgttgtacat 
agatgctgca 
ccaactaaga 
ggctgtgctt 
tcgtgcttta 
ccccgttcaa 
tactaaactg 
tgataaccca 
cgtatctgat 
ccacatcaca 
ttcgagaata 
tgtaaagatg 
cgggcatttt 
gaatgatccc 
caagggcata 
ccctttcatg 
cgctgattac 
gaatgatact 
gttgagggaa 
tgaagcgtat 
tgaagaaaca 
tggaaccaaa 
aggaaaatgg 
atacacgaac 
acaaccagtt 
tcggttagcc 
tgcaaacggg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 
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ttctacaata 
gggaatcaag 
gctagaaatg 
aattcagcac 
ttatgtgtta 
gtactagtag 
tacaagaaat 
tcatcttata 
taa 



ataatggtgc 
acaaggtggt 
aaggaaagaa 
tacatcgccg 
aaatccctca 
accctctggt 
tcatagacac 
atatatctgt 



aaacaatgaa 
gtcgtatgca 
gcatttgaaa 
tctctggaac 
aaatgtcatg 
atttgccaat 
ttcttcaggc 
acccttaaac 



tttacttcct 
gaaagagtaa 
aatgacgatg 
aggacgacat 
tctaaaatag 
gtagcgtcaa 
cctgttgtta 
ttttatacca 



cgttactatt 
gattctacat 
gtctagtttt 
tcgaccatga 
aacccggcac 
gtacagacag 
ttgaccgtgc 
catgtggatt 



tgctatagtt 
gaaaattgta 
agtggataga 
tgatattgta 
tagctcaggt 
ggaagaattc 
atctgttaca 
tatagttggt 



3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3483 



<210> 137 
<211> 1156 
<212> PRT 
<213> SHRIMP 



<400> 137 



Met 


Lys 


He 


val 


Gin 


Asn 


Asn 


Phe 


Thr 


Pro 


Asp 


Glu 


Arg 


Ser 


Gly 


Val 


1 








b 










io 










15 




He 


His 


He 


Arg 


Lys 


Pro 


Ala 


Lys 


He 


Glu 


Lys 


Ala 


Val 


Phe 


Gly 


Asn 








20 










2b 










30 






He 


Ala 


Ala 


Ala 


lie 


Asp 


Asp 


Ser 


Ala 


Ala 


Val 


Arg 


Lys 


Asp 


Pro 


Lys 






35 










40 










4b 








Lys 


Lys 


Arg 


Asn 


Leu 


Lys 


Asn 


Gly 


Leu 


Glu 


Pro 


Ala 


Ser 


Lys 


Lys 


Leu 




50 










55 










60 










Ala 


Lys 


Asn 


lie 


Glu 


Arg 


He 


Ser 


Ser 


Glu 


Glu 


Leu 


Lys 


Arg 


Val 


Thr 


65 










I u 










/b 










O f\ 

o U 


Asp Val 


Gin 


Asp 


Pro 


Lys 


Leu 


Leu 


His 


Ser 


lie 


Met 


Lys 


Arg 


Thr 


Ala 




















QA 

y u 










yb 




Arg 


Gin 


He 


Gly 


Tyr 


Asp 


He 


Gly 


Asp 


Asp 


He 


Ser 


Pro 


Gin 


Ser 


Ala 








100 










10b 










110 






Pro 


Asp 


Arg 


Asp 


Gly 


Ser 


Ser 


Ser 


Ser 


Ser 


Leu 


Leu 


Pro 


He 


Arg 


Met 






115 










120 










125 








He 


Asn 


lie 


Arg 


Thr 


Glu 


Glu 


Leu 


Leu 


Glu 


Lys 


Gly 


Gly 


Lys 


Asp 


Thr 




130 




















_L H LF 










He 


Val 


Arg 


He 


His 


He 


Leu 


Asp 


Gly 


He 


Leu 


Pro 


Asp 


Asn 


Val 


Pro 


145 










150 










155 










160 


Leu 


Pro 


Phe 


Lys 


Ala 


Glu 


He 


Lys 


Val 


Asp 


Leu 


Val 


Asp 


Glu 


Lys 


Tyr 










165 










170 










175 




Glu 


Gly 


Glu 


Asp 


Gly 


Gly 


Gly 


Ser 


Ser 


Asp 


Ser 


Gly 


Pro 


Ser 


Leu 


Phe 








180 










185 










190 






Glu 


Thr 


Phe 


Pro 


Glu 


Phe 


Val 


Pro 


Ala 


Gly 


Trp 


Pro 


Pro 


He 


Thr 


Asn 






195 










200 










205 








Asp 


Pro 


Asn 


Ala 


Phe 


Ser 


Arg 


Asn 


Asn 


Gly 


Asn 


Lys 


Gin 


Gin 


Ala 


Val 




210 










215 










220 










Phe 


Lys 


His 


Val 


Glu 


Val 


Asn 


Ser 


Leu 


Ala 


Asp 


Gly 


He 


Thr 


Leu 


Ser 


225 










230 










235 










240 


Thr 


Lys 


Gly 


Ser 


He 


Phe 


Ash 


Thr 


Gly 


Asn 


Arg 


Leu 


Lys 


He 


Ser 


He 










245 










250 










255 




Val 


Thr 


Glu 


Asp 


Lys 


Asn 


Lys 


Thr 


Val 


Leu 


Phe 


Asp 


Ser 


Gin 


Val 


Thr 








260 










265 










270 






He 


Ser 


Ser 


Pro 


He 


Pro 


Lys 


He 


Thr 


Glu 


Val 


Phe 


Ala 


Cys 


Arg 


Asn 






275 










280 










285 








Val 


Met 


Arg 


Leu 


Asp 


Met 


Pro 


Lys 


Ala 


He 


Asn 


Tyr 


Asp 


Asn 


val 


Glu 




290 










295 










300 










Tyr 


Thr 


Pro 


Asp 


Thr 


Leu 


Asn 


Glu 


Lys 


Tyr 


Val 


Ser 


Asp 


Tyr 


Pro 


Ala 


305 










310 










315 










320 


Asn 


Phe 


Pro 


Arg 


Leu 


Ser 


Arg 


Gin 


Ala 


Glu 


He 


Asn 


Leu 


Ala 


Ala 


Lys 










325 










330 










335 




Leu 


Pro 


Arg 


Glu 


Asn 


Gin 


Leu 


Ser 


Asp 


He 


Asn 


Lys 


Pro 


Ser 


Val 


Ser 








340 










345 










350 






Phe 


Val 


Tyr 


Ser 


Lys 


Thr 


Asn 


Thr 


Val 


Asn 


Thr 


Pro 


Val 


Leu 


Asn 


Lys 
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355 360 365 



Val 


Leu 


Tvr 


Asn 


Glu 


Thr 


Leu 


Lys 


Asn 


Met 


Glu 


Gly 


Asn 


Glu 


Ser 


Glu 




370 










375 










380 










Glv 


Tyr 

j 


Lys 


He 


Leu 


Asn 


Ala 


Thr 


Glu 


He 


Thr 


His 


Leu 


Arg 


Asn 


Pro 


385 










390 










395 










400 


Ser 


Asn 


Pro 


Ala 


Arg 


Thr 


Phe 


He 


Cys 


Val 


Ser 


Val 


Pro 


Glu 


Ser 


Glu 










405 










410 










4 1 s 

i j. j 




lie 


Glu 


Ala 


Gin 


Tro 


Lys 


Met 


Leu 


Glv 


Tro 


He 


Val 


Gly 


Phe 


Lys 


Thr 








420 










425 










430 






Car 


Cp 
OCX 




v ax 


Leu 


X 111 


X I1X 


Cp r 

OCX 


ocx 


vji y 


Tyr 


Asn 


He 


Vax 


XT fit? 


Pro 






*a -> -J 




















445 








Ala 


Ser 


Lys 


Val 


Thr 


Gin 


Ser 


Asp 


Lys 


Leu 


Phe 


Ser 


Val 


Tip 

xxcz 


Car 
OCX 


Thr 

1 XIX 




450 










4 ss 










460 










Asp 


Val 


Asn 


Ala 


Asn 


Thr 


Asn 


ys 


Val 


val 


Va 1 
vox 


His 


Asn 


Thr 
XiJx 




Ser 


465 










470 










47^ 










4 fin 

" O w 


Arg 


Val 


Gly 


Cys 






Cp r* 


C£> r 

ocx 


Val 


Asn 


pv,p 

tri It? 


Arg 


Val 


Asp 


nla 


A "I -a 










485 










490 
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Thr 


Ala 


Pro 


Asp 


Trn 


Pro 


Gly 


Pro 

t X V 


Thr 


Asn 


Glv 
yiy 


Pro 


Asp 


rue 


Pha 


Cck v 








500 










505 










si n 






Tyr 


Gin 


Leu 


Arg 


Pro 


wjy o 


Tip 


Tip 

X Xc 


Leu 


Lys 


1 1 1 X 


Asp Asn 


Asp 


Asn 


Arg 
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c o c 








Glu 


Pro 


Arg 


He 


Thr 


Ala 


VOX 


Leu 


Ser 


Ser 


D y r-v 
XT X U 


Ala 


Thr 


ul Li 


Tyr 


a l ^ 

nla 
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OH U 










Gly Glu Arg 


Thr 


Thr 


Ser 


Leu 


Leu 


Pro 


Arg 


Ala 


Leu 


Asn 


v a x 




Ua 1 
v al 












JJv 




















jDU 


Gly 


Pro 


Leu 


J. I IX 


uxu 




ax y 


uiy 


Gly Asp 


lie 


He 


Thr 


Pro 


vax 


uin 




















570 










ETC 

D / D 




Thr 


Ala 


Leu 




wx jf 


VJ J. 


V91U 


VJXil 


Pro 


Thr 


Php 


Lys 


Ala 


Pro 


nla 


ulli 


















585 
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Pro 


Thr 


Lys 


Leu 


•Pur 


Ala 


Val 


Phe 


Pro 


Val 


Leu 


Asp 


Ser 


Hie: 

nx s 


Asn 


uiy 






595 
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605 








Leu 


Val 


Lys 


Axel 


Cor' 

OCX 


Asp 


Asn 




Phe 


Gin 


rlO 


He 


His 


oer 


Ti- 
ne 


inr 




610 




















620 










Ser 


Arg 


Asn 


Lys 


xnr 


inr 


vai 


Leu 


Thr 


Val 


Car 


Asp 


Val 


xxe 


vai 


Asn 


625 










DJU 




















bqU 


Asp 


Asp Asp 


Asp 


Asp 


Val 


vax 


Leu 


Glu 


Asp 


Lys 


Ser 


Tyr 


nlS 


i xe 


inr 










Ofl o 










650 










ODD 




Val 


Ser 


Asp 


Pro 


val 


C<=k-»- 


oiy 


Ser 


He 


Leu 


Ha 
nld 


Lys 


Glu 


Asn 


vax 


Leu 








DDU 










665 










CIA 






Ser 


Ser 


Arg 


He 


Thr 


Ser 




Pro 


He 


Phe 


Tip 

lie? 


Asp 


Gly 


nxa 


Arg 


Asp 






675 










DO u 










685 








Asp 


Arg 


Val 


Phe 


Ser 


Val 


Lys 


Ma *- 

l its l_ 


Glu 


Val 


i lie 


Gly 


Gly 


Asp 


Asp 


Lys 




690 










O ;7 J 










700 










Gly 


lie 


Gin 


Met, 


Pro 


Phe 


Thr 




Asp 


Gly 


His 


Phe 


Glu 


oiy 


Ulll 


Pho 

"He 


705 










* X \J 
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/ 1 Zj 












Ser 


Asp 


Met 


OCX 


v a x 




ocx 


Asn 


Glu 


Leu 




He 


Trp 


Asn 


Asp 


fro 










7? ^ 










730 
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Ser 


Thr 


Phe 


Thr 


Ala 


Pro 


Val 


Arg 


Asp 


Thr 


Pro 


Ala 


Thr 


Asp 


He 


Thr 








740 










745 










750 






Asn 


Lys 


Gly 


He 


Val 


Tyr 


Cys 


Arg 


Thr 


Thr 


Leu 


Pro 


Pro 


He 


Ser 


Asn 






755 










760 










765 








Arg 


Gly 


He 


Arg 


Asp 


Pro 


Phe 


Met 


Lys 


Gin 


Thr 


Ser 


Leu 


Val 


Pro 


Leu 




770 










775 










780 










Pro 


Thr 


Ser 


He 


Pro 


Glu 


Trp 


Ala 


Phe 


Ala 


Asp 


Tyr 


Gly 


Gly 


Glu 


He 


785 










790 










795 










800 


Lys 


Tyr 


Pro 


Arg 


His 


He 


Phe 


He 


Ser 


Ser 


He 


Arg 


Thr 


Asn 


Asp 


Thr 










805 










810 










815 




Thr 


Asn 


He 


Val 


Asn 


Thr 


Asp 


Thr 


Gin 


Thr 


Glu 


Phe 


Ser 


He 


Glu 


Asn 








820 










825 










830 






Trp 


Leu 


Arg 


Glu 


Gin 


He 


Asp 


Lys 


Glu 


Gin 


Glu 


Arg 


His 


Arg 


Gin 


Leu 
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Leu 


pro 


a j. a 


Pro 


Ser 


blU 


rt-La 


Tyr 


X Hi 






Glu 


Lys 


Val 




Ala 




O DU 










D JJ 
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Lys 


Met 


Tyr 


Met 
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t»iy 


vax 


Ser 


uJLU 


ii 


X J I £ 


Leu 


T\ en 
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Gin 


He 


O b3 










R"70 
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val 
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Thr 


oer 


Asn 


inr 


inr 


Tyr 


vai 


Val 


Asp 


bi U 


OCX 




X 11 X. 


Lys 










OO J 




















895 




Lys 


GlU 


Asn 


Leu 


Leu 


va J. 


Asn 


Lys 


pi,, 
blu 


Asp 


Lys 


Lys 


Leu 


7V 1 

nio 


nid 


Tie* 
lie 






y uu 




















qi n 






Leu 


Gly 


Lys 


Trp 


\j±y 


T "1 A 

lie 


vai 


Val 


Phe 




Aia 


Asn 


Lys 


Tyr 


fro 


Asp 






yi d 










920 










925 








Glu 


Pro 


Ala 


Asp 


Arg 


Tyr 


Thr 


Asn 


Trp 


Arg 


Asn 


Thr 


Gly 


Arg 


Leu 


Arg 




930 










935 










940 










Ala 


Val 


Gly 


Ser 


Tyr 


Ser 


Gin 


Leu 


Arg 


bin 


Pro 


Val 


Ala 


Pro 


Leu 


bin 


945 








950 










955 
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Thr 


Arg 


Leu 


Ala 


Thr 


Trp 


Pro 


Ser 


Gly 


Asp 


Pro 


Val 


Thr 


Arg 


Leu 


Ala 










965 










970 










975 




Asp 


Gly 


Gin 


Phe 


Leu 


Val 


Arg 


Leu 


Asp 


Pro 


Arg 


Cys 


Gly 


Gly 


He 


Gly 








980 










985 










990 






Ser 


Ala 


Asn 


Gly 


Phe 


Tyr 


Asn 


Asn 


Asn 


Gly 


Ala 


Asn 


Asn 


Glu 


Phe 


Thr 






995 










1000 








1005 






Ser 


Ser 


Leu 


Leu 


Phe 


Ala 


He 


Val 


Gly 


Asn 


Gin 


Asp 


Lys 


Val 


Val 


Ser 




1010 








1015 








1020 








Tyr 


Ala 


Glu 


Arg 


Val 


Arg 


Phe 


Tyr 


Met 


Lys 


He 


Val 


Ala 


Arg 


Asn 


Glu 


1025 








1030 








1035 








1040 


Gly 


Lys 


Lys 


His 


Leu 


Lys 


Asn 


Asp 


Asp 


Gly 


Leu 


Val 


Leu 


Val 


Asp 


Arg 










1045 








1050 








1055 


Asn 


Ser 


Ala 


Leu 


His 


Arg 


Arg 


Leu 


Trp 


Asn 


Arg 


Thr 


Thr 


Phe 


Asp 


His 








1060 








1065 








1070 




Asp 


Asp 


He 


Val 


Leu 


Cys 


Val 


Lys 


He 


Pro 


Gin 


Asn 


Val 


Met 


Ser 


Lys 






1075 








1080 








1085 






lie 


Glu 


Pro 


Gly 


Thr 


Ser 


Ser 


Gly 


Val 


Leu 


Val 


Asp 


Pro 


Leu 


Val 


Phe 




1090 








1095 








1100 








Ala 


Asn 


Val 


Ala 


Ser 


Ser 


Thr 


Asp 


Arg 


Glu 


Glu 


Phe 


Tyr 


Lys 


Lys 


Phe 


1105 








1110 








1115 








1120 


He 


Asp 


Thr 


Ser 


Ser 


Gly 


Pro 


Val 


Val 


He 


Asp 


Arg 


Ala 


Ser 


Val 


Thr 










1125 








1130 
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Ser 


Ser 


Tyr 


Asn 


He 


Ser 


Val 


Pro 


Leu 


Asn 


Phe 


Tyr 


Thr 


Thr 


Cys 


Gly 



1140 1145 1150 



Phe He Val Gly 
1155 



<210> 138 
<211> 615 
<212> DNA 
<213> SHRIMP 

<400> 138 

atggaatttg gcaacctaac aaacctggac gttgeaatta ttgeaatett gtccattgea 60 

ateattgetc taatcgttat catggttata atgattgtat tcaacacacg tgttggaaga 120 

agegtegteg ctaattatga tcagatgatg cgagtcccaa ttcaaagaag ggcaaaggta 180 

atgtcaattc gtggagagag gtcctacaat actcctcttg gaaaggtggc catgaagaat 240 

ggtctctccg ataaggacat gaaggatgtt tetgetgate ttgtcatctc taccgtcaca 300 

gccccaagga ctgatcccgc tggcactggg gecgagaact ctaacatgac tttgaagatc 360 

ctcaacaaca ctggcgtcga tctcttgatc aacgacatta ctgttcggcc aactgttatt 420 

gcaggaaaca ttaagggaaa tactatgtcg aacacttact tctcgagcaa ggacattaaa 480 

tcttcatctt caaaaattac cctcattgac gtgtgcagca aatttgaaga cggcgcagcc 540 

ttcgaagcta caatgaacat tggattcacc tccaagaatg tgatcgatat caaggacgaa 600 
atcaagaaga agtaa 615 
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<211> 204 
<212> PRT 
<213> SHRIMP 



<400> 139 



Met 
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Phe 


Gly Asn 


Leu 


Thr 


Asn 


Leu 


Asp Val 
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He 


He Ala 


He 
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Leu 
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lie 


Ala 
20 


He 


He 


Ala 


Leu 


He 
25 


Val 


He 


Met 


Val 


He Met 
30 


He 


Val 


Phe 


Asn 


Thr 


Arg 


Val 


Gly Arg 


Ser 


Val 


Val 


Ala 


Asn 


Tyr Asp 


Gin 






35 










40 










45 






Met 


Met 
50 


Arg 


Val 


Pro 


He 


Gin 
55 


Arg 


Arg 


Ala 


Lys 


Val 
60 


Met 


Ser lie 


Arg 


Gly 


Glu 


Arg 


Ser 


Tyr 


Asn 


Thr 


Pro 


Leu 


Gly Lys 


Val 


Ala 


Met Lys 


Asn 


65 










70 










75 








80 


Gly 


Leu 


Ser 


Asp 


Lys 
85 


Asp 


Met 


Lys 


Asp 


Val 
90 


Ser 


Ala 


Asp 


Leu Val 
95 


He 


Ser 


Thr 


Val 


Thr 
100 


Ala 


Pro 


Arg 


Thr 


Asp 
105 


Pro 


Ala 


Gly 


Thr 


Gly Ala 
110 


Glu 


Asn 


Ser 


Asn 
115 


Met 


Thr 


Leu 


Lys 


He 
120 


Leu 


Asn 


Asn 


Thr 


Gly 
125 


Val Asp 


Leu 


Leu 


lie 
130 


Asn 


Asp 


He 


Thr 


Val 
135 


Arg 


Pro 


Thr 


Val 


He 
140 


Ala 


Gly Asn 


He 


Lys 


Gly 


Asn 


Thr 


Met 


Ser 


Asn 


Thr 


Tyr 


Phe 


Ser 


Ser 


Lys 


Asp He 


Lys 


145 










150 










155 








160 


Ser 


Ser 


Ser 


Ser 


Lys 
165 


He 


Thr 


Leu 


He 


Asp 
170 


Val 


Cys 


Ser 


Lys Phe 
175 


Glu 


Asp 


Gly 


Ala 


Ala 
180 


Phe 


Glu 


Ala 


Thr 


Met 

185 


Asn 


He 


Gly 


Phe 


Thr Ser 
190 


Lys 


Asn 


Val 


He 
195 


Asp 


He 


Lys 


Asp 


Glu 
200 


He 


Lys 


Lys 


Lys 









<210> 140 
<211> 2571 
<212> DNA 
<213> SHRIMP 



<400> 140 

atgacagagt 

cgactaaatt 

gaaaaacaaa 

acgacagaat 

gtcatgagga 

tggagattga 

ccattattga 

gctaatctag 

tcagaaatgt 

aaaagagcca 

tctatcgcct 

ctagagtacg 

tcctttagtg 

atatcgactg 

gtaggttctt 

atcgacgcag 

acaactaaac 

acagaaaggg 

actgctgaga 

gggcctaaac 

aagtttaacg 

gttatgttct 



caaaggacta 
atgtatcgga 
ggaaggttgt 
tttttaattc 
gtgtctacag 
ccgaaactgc 
accttttgaa 
ttctttattc 
tggccaacag 
ttgctctatt 
ctagaatgaa 
gacgcgctgc 
atggagttac 
ttttaaccaa 
tccatgaagc 
agtacagggc 
tggataaata 
aaaaagaatg 
atctctttag 
gcacacctac 
atattgtttc 
ccaactactc 



tgttctggca 
agggctagtt 
tatttcttct 
gggactaaga 
agatatagaa 
agcggaagaa 
tgcctttgag 
cactagaggt 
aaacaattca 
tctaggatat 
agagaggggt 
tgacatgatt 
agtcagtgaa 
catggtttct 
tcttttaggg 
agtgctgtcc 
tgtgaacaat 
ggtcgagagg 
ggaccaccca 
cgcccttcaa 
accagaaact 
tgttacccgc 



ttagtagcag 
gctgccattt 
gacgtattcg 
ttagccaaag 
ggtgttagag 
ctatgtgact 
aatattatgg 
gatactaatg 
acggtggccg 
actctttgtg 
ttagatccct 
gaaaaaagga 
gaagatggac 
gtaatacagg 
agggaaatca 
aggaccagaa 
ccccatctgc 
gaaagagaac 
tatcttccaa 
gctcttcaac 
ttggagtatt 
gttcttgatc 



aaactaaaac 
caaatttaca 
gacccacgtg 
gacacctatc 
agcatataat 
ttactttttt 
acggtgtgtt 
aaccttcatg 
atctggcaat 
atattctaag 
ttgctgccat 
taaaagactt 
aatcttatgt 
aaggattcta 
tggtactatt 
acgctaaacc 
agatgccatc 
gaattaaaac 
aggcgataga 
gtgaatacaa 
ttcttgtcaa 
ccgactcagc 



cgatgaaaaa 
aaatacacct 
gttcaataaa 
taaagatgcc 
tgacccctct 
gaaacaagca 
tagatcagca 
ggtaattgat 
gggacgagca 
atggaagcaa 
gcctcctcac 
tattgaaggg 
tgttccaact 
cccccccaag 
atcagcagcc 
aaatcctttg 
cgaaagtgtc 
tacggatatg 
cggaatttta 
gagatgtaac 
taatagacaa 
tgcccgattt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 
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tcaatgtatg 
aatagctctg 
ttattcgtct 
atcagtttaa 
aaagcagtac 
aatgggttag 
ggagggttag 
gaagtgcgta 
gtccatgtat 
acctccaatc 
tggggagatt 
acatcatcat 
agaaatgtga 
tctgcatttt 
gaaaatgtta 
gagatatgga 
caagacttga 
cctatttcat 
atgggaagca 
cctctaagtt 
gattcttata 



tgctatggaa 
cagtgaaaag 
gccagcgttg 
aggaacaaag 
tcgatgcttt 
gggagttgat 
gtggtgatat 
acatgcacct 
ctaaagggtt 
cttcaataac 
tgaagtttcc 
tattatcctt 
ttgatgcgtc 
catccaagat 
acagagaaaa 
aaactagtgg 
tacggaacgc 
ccatagcctt 
ataataagta 
atactggaaa 
ctacaatgaa 



tgcattattt 
caggttaatt 
tgaatctggt 
taaaccgtct 
aggtggtggt 
ggttgagata 
aggttttgaa 
agtggataag 
tgctgccctt 
gaacaatttc 
tacacataac 
tgcaagcttt 
aaacactagt 
gttagtggat 
attgatctgg 
gagtaaagcc 
atctatagga 
attcaggtct 
ccgtctcgcc 
gattgtattc 
agtatttcct 



ttatgctctg 
cttcaagtgt 
ttcataacaa 
atgggagaac 
ggtggtaata 
ctctctgcag 
ggaaaaatga 
aaagggtatg 
tctttctacc 
gacagagcag 
ttatggggaa 
tgggctttaa 
tttgtaccag 
aacatgttga 
aaagcattta 
gcatccgaca 
agacttattg 
atgaagagga 
agggacacga 
tctttggatg 
ctaacacctt 



gagggctaac 
ttttgaaaga 
aatctctaga 
aggaactaga 
ataaaggagc 
attcgggtct 
aacaaaaaag 
tgtttgaagc 
tgttgtacgc 
tatatttact 
atgttccaac 
ggaatgctgt 
ggaggccctt 
agaataatta 
gagagatgca 
ggaatgtgaa 
tcgaacctgt 
gtagaagcga 
aaacagctac 
atttgaaaaa 
tagatggata 



acagaaaact 
tatgcattct 
cacttttacc 
gacgtactgg 
agaaaatgtc 
attaagaggg 
agaggacgaa 
tgcaaaatat 
ggctgcagcc 
attggcaaga 
agatgaaaat 
tagagcacga 
acctctctta 
tgtcaaagtg 
aacggaatca 
gaaggccaag 
aggaaagacc 
agatttaaag 
accaaggaac 
tttcagtaag 

g 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2571 



<210> 141 
<211> 852 
<212> PRT 
<213> SHRIMP 



<400> 141 
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Ser 
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Pro 
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Ser 


Ser 


Asp 


Val 


Phe 
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Pro 


Thr 
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Ser 


Gly 


Leu 


Arg 


Leu 


Ala 
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Val 


Met 


Arg 


Ser 


val 


Tyr 


Arg 


Asp 
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He 


Asp 


Pro 


Ser 


Trp 


Arg 


Leu 


Thr 








100 










Asp 


Phe 


Thr 


Phe 


Leu 


Lys 


Gin 


Ala 






115 










120 


Phe 


Glu 


Asn 


He 


Met 


Asp 


Gly 


Val 
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135 




Lys 


Thr 


Arg 


Gly 


Asp 


Thr 


Asn 


Glu 


145 










150 






Met 


Leu 


Ala 


Asn 


Arg 


Asn 


Asn 


Ser 










165 








Arg 


Ala 


Lys 


Arg 


Ala 


He 


Ala 


Leu 
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He 


Leu 


Arg 


Trp 


Lys 


Gin 


Ser 


He 
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Leu 


Asp 


Pro 


Phe 


Ala 


Ala 


Met 


Pro 




210 










215 




Ala 


Asp 


Met 


He 


Glu 


Lys 


Arg 


He 


225 










230 






Ser 


Asp 


Gly 


Val 


Thr 


Val 


Ser 


Glu 










245 








Pro 


Thr 


He 


Ser 


Thr 


Val 


Leu 


Thr 



Leu 


Ala 


Leu 


Val 


Ala 


Glu 


Thr 


Lys 




10 










15 




Val 


Ser 


Glu 


Gly 


Leu 


Val 


Ala 


Ala 


25 










30 






Glu 


Lys 


Gin 


Arg 


Lys 


Val 


Val 


He 










45 








Trp 


Phe 


Asn 


Lys 


Thr 


Thr 


Glu 


Phe 








60 










Lys 


Gly 


His 


Leu 


Ser 


Lys 


Asp 


Ala 






75 










80 


He 


Glu 


Gly 


Val 


Arg 


Glu 


His 


He 




90 










95 




Glu 


Thr 


Ala 


Ala 


Glu 


Glu 


Leu 


Cys 


105 










110 






Pro 


Leu 


Leu 


Asn 


Leu 


Leu 


Asn 


Ala 










125 








Phe 


Arg 


Ser 


Ala 


Ala 


Asn 


Leu 


Val 
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Pro 


Ser 


Trp 


Val 


He 


Asp 


Ser 


Glu 






155 










160 


Thr 


Val 


Ala 


Asp 


Leu 


Ala 


Met 


Gly 




170 










175 




Phe 


Leu 


Gly 


Tyr 


Thr 


Leu 


Cys 


Asp 


185 










190 






Ala 


Ser 


Arg 


Met 


Lys 


Glu 


Arg 


Gly 










205 








Pro 


His 


Leu 


Glu 


Tyr 


Gly 


Arg 


Ala 








220 










Lys 


Asp 


Phe 


He 


Glu 


Gly 


Ser 


Phe 






235 










240 


Glu 


Asp 


Gly 


Gin 


Ser 


Tyr 


Val 


Val 




250 










255 




Asn 


Met 


Val 


Ser 


Val 


He 


Gin 


Glu 
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260 



Caiy 




Tyr 


riO 






2/D 




Arg 


Glu 


He 


Met 










Ala 


Val 


Leu 


Ser 


305 








Lys 


Leu 


ASp 


Lys 


Ser 


Val 


Thr 


GJ.U 








34 0 


lie 


Lys 


inr 


i nr 






ice 

355 




Tyr 


Leu 


Pro 


Lys 




3 70 






Thr 


Ala 


Leu 


Gin 


385 








Asn 


Asp 


He 


Val 


Arg 


Gin 


Val 


Met 








4^U 


Asp 


Ser 


Ala 


ax a 






435 




Leu 


Cys 


Ser 


Gly 




4 50 






Ser 


Arg 


Leu 


He 


465 








Val 


Cys 


Gin 


Arg 


Phe 


Thr 


He 


Ser 








500 


Glu 


Leu 


Glu 


Thr 






515 




Gly 


Gly 


Asn 


Asn 




530 






Met 


Val 


Glu 


He 


545 








Leu 


Gly 


Gly 


Asp 


Asp 


Glu 


Glu 


Val 








580 


Phe 


Glu 


Ala 


Ala 






595 




Ser 


Phe 


Tyr 


Leu 




610 






Thr 


Asn 


Asn 


Phe 


625 








Asp 


Leu 


Lys 


irne 


Glu 


Asn 


Thr 


Ser 








660 


Asn 


Ala 


Val 


Arg 






675 




Pro 


Gly 


Arg 


Pro 




690 






Val 


Asp 


Asn 


Met 


705 








Arg 


Glu 


Lys 


Leu 


Glu 


lie 


Trp 


Lys 



Pro 


Lys 


Val 


Glv 








280 


v d X 


Leu 


T oti 


Ser 






^ ^ j 




Axg 


i in 


Arg 


Asn 










Tyr 


Val 


Asn 


Asn 


325 








Arg 




Lys 




Asp 


Mat 

net. 


1 ill 


nla 










ax a 


i ie 


Asp 


r i *, 
bly 






D / d 




AT a 


j^eu 


Gin 


rg 










Ser 


Pro 


Glu 


Thr 


405 








IT i lfcS 


Ser 


Asn 


Tyr 


Arg 






net 








H H KJ 


V3iy 


Leu 


inr 


bin 






4 C, 




Leu 


bin 


vai 


f ne 




«i /u 






Cys 


Glu 


Ser 


Gly 


485 








Leu 


Lys 




uin 


Tyr 


Trp 


Lys 


Ala 








DZU 


Lys 


vjiy 


Ala 


U1U 






ddd 




Leu 


Ser 


TV 1 -» 

Ala 


Asp 




jjU 






lie 


Gly 


Phe 


Glu 


565 








Arg 


Asn 


riet 


Hie 


Lys 


Tyr 


vai 


HIS 








£r»n 

DUU 


Leu 


Tyr 


nla 


riia 






DID 




Asp 


Arg 


nla 


Val 




b3U 






*r J. U> 


L lit 


n x o 


Asn 


645 








Ser 


Leu 


Leu 


Ser 


Ala 


Arg 


Arg 


Asn 








680 


Leu 


Pro 


Leu 


Leu 






695 




Leu 


Lys 


Asn 


Asn 




710 






He 


Trp 


Lys 


Ala 


725 








Thr 


Ser 


Gly 


Ser 



265 



Ser 


Phe 


His 


Glu 


Ala 


Ala 


He 


Asp 










ma 




P m 


noil 






^1 ^ 

Ji J 






His 


Leu 


Gin 










Trp 


Val 


Glu 


Arg 


345 








UlU 


Asn 


Leu 


XT JlS 


Tift 
1 1 fci 


Leu 


uiy 


XT X V 










rin 
UlU 


Tyr 


Lys 


Arg 






J? j 




Leu 


oiu 


i yr 






/in 






Ser 


val 


Thr 


Arg 


425 








Tyr 


Vdl 


Leu 


Trp 


Lys 


TK v 

i nr 


Asn 


oer 








^ OU 


Leu 


Lys 


Asp 








,4 "7 ^ 
H I D 






i ie 


1 £1 X 


Lys 




h yu 






Ser 


Lys 


Pro 


Ser 


505 








Val 


Leu 


Asp 


Ala 


Asn 


vax 


Asn 


Gly 








D4 U 


oer 


bly 


Leu 


Leu 






ODD 




Gly 


Lys 


Met 


Lys 




tin 






Leu 


Val 


Asp 


Lys 


585 








vai 


Ser 


Lys 


Q»iy 


M.-L a 


Mia 


1 11 X 












Tyr 


Leu 


Leu 


Leu 






ODD 




Leu 


T rn 
1 ip 


Gly 


Asn 




650 






Phe 


Ala 


Ser 


Phe 


665 








Val 


He 


Asp 


Asn 


Ser 


Ala 


Phe 


Ser 








700 


Tyr 


Val 


Lys 


Val 






715 




Phe 


Arg 


Glu 


Met 




730 






Lys 


Ala 


Ala 


Ser 



745 





270 






Ala 


Leu 


Leu 


Glv 


285 








Ala 


Glu 


Tyr 


Arg 


Pro 


Leu 


Thr 


Thr 








j ^ \j 




Prn 




V91U 










villi 


Arg 


OlU 


Arg 










Arg 


Asp 


His 


XT 1 tl 


365 








Lys 


Arg 


Thr 


Pro 




Asn 


Lys 


rile 








4 no 


_ 

i*eu 


vai 


Asn 


Asn 






*± i j 




vol 


Leu 


Asp 


Pro 




430 






Asn 


Til a 

Ala 


Leu 


Jriie 


445 








Ser 


Ala 


Val 


Lys 


Hi cs 
nib 


Cor 


Leu 


Pho 

XT Ix€ 








4 ft n 


oer 


Leu 


Asp 


Th v- 

inr 












uiy 




bin 




olU 






Leu 


bly 


vaiy 


biy 


525 








Leu 


Gly 


Glu 


Leu 


Arg 


uiy 


uiy 


(jiy 








D OU 


oin 


Lys 


Arg 


blu 










Lys 


vaiy 


Tyr 


vai 




con 

jyu 






rne 


Aid 


nla 


Leu 


605 








Asn 


Pro 


Ser 


He 


nla 


Arg 


Trp 


vjxy 








a a f\ 
o4 U 


val 


Pro 


inr 


Asp 






ODD 




Trp 


Ala 


Leu 


Arg 




670 






Thr 


Ser 


Phe 


Val 


685 








Ser 


Lys 


Met 


Leu 


Glu 


Asn 


Val 


Asn 








720 


Gin 


Thr 


Glu 


Ser 






735 




Asp 


Arg 


Asn 


Val 




750 
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Lys 


Lys 


Ala 


Lys 


Gin 


Asp 


Leu 


He 


Arg 


Asn 


Ala 


Ser 


He 


Gly Arg 


Leu 






755 










760 










765 








Tip 

J. lc 


val 

770 




Pro 


\7-a "1 
Val 




Lys 
775 


Thr 


IT -L \J 


Tip 

lie 


oer 


Ser 
780 


Tie 


Ala 


Leu 


rrne 


Arg 


Ser 


Met 


Lys 


Arg 


Ser 


Arg 


Ser 


Glu 


Asp 


Leu 


Lys 


Met 


Gly 


Ser 


Asn 


785 










790 










795 










800 


Asn 


Lys 


Tyr 


Arg 


Leu 


Ala 


Arg Asp 


Thr 


Lys 


Thr 


Ala 


Thr 


Pro 


Arg 


Asn 










805 










810 










815 




Pro 


Leu 


Ser 


Tyr 
820 


Thr 


Gly 


Lys 


He 


Val 
825 


Phe 


Ser 


Leu 


Asp 


Asp 
830 


Leu 


Lys 


Asn 


Phe 


Ser 
835 


Lys 


Asp 


Ser 


Tyr 


Thr 
840 


Thr 


Met 


Lys 


Val 


Phe 
845 


Pro 


Leu 


Thr 


Pro 


Leu 
850 


Asp 


Gly 



























<210> 142 
<211> 591 
<212> DNA 
<213> SHRIMP 



<400> 142 

atggacgttt 

ctccagcgcg 

aaggagggaa 

ccagccactt 

cgcaagtcaa 

cccgtccgtt 

ccctacgacg 

ccaccatacg 

cgtcaagtcc 

agggagaaga 



cttcctataa 
ccacctacaa 
ggtacgtccg 
gctacacttg 
tattcagaga 
tagaagagct 
aeggatctte 
acccatcaga 
tcagttctat 
aggccctggg 



gagcactatt 
ggatcgtatg 
atcgttggcc 
cggctacacc 
gccttgcgct 
tgtcgacgtg 
tacategtet 
aaatccacag 
gaeggatcac 
tcttggccgt 



gactaccaca 
gagaeggaat 
accatggacg 
tttattagac 
tacccagaac 
ccagaaggag 
tcacaagccg 
aggtcccaag 
gccagggcca 
cgcaacaact 



acattgaaga 
tggtcctcga 
aattggaggt 
gcagggcacc 
ttctccccga 
cgagtttttt 
aatgtgaaga 
tgtgtgatta 
acctcataaa 
ttagctacta 



tatggacgat 
gatggctaag 
acctgaagaa 
cccaccaaaa 
tgcaccatcc 
cacctaccct 
tgattatcct 
ttgtaccaca 
aaatctgaag 

g 



60 

120 

180 

240 

300 

360 

420 

480 

540 

591 



<210> 143 
<211> 194 
<212> PRT 
<213> SHRIMP 



<400> 143 












Met 


Asp 


Val 


Ser 


Ser 


Tyr 


Lys 


Ser 


1 








5 








Asp 


Met 


Asp 


Asp 


Leu 


Gin 


Arg 


Ala 








20 










Val 


Leu 


Glu 


Met 


Ala 


Lys 


Lys 


Glu 






35 










40 


Thr 


Met 


Asp 


Glu 


Leu 


Glu 


Val 


Pro 




50 










55 




Cys 


Gly 


Tyr 


Thr 


Phe 


He 


Arg 


Arg 


65 










70 






Ser 


Tie 


Phe 


Arg 


Glu 


Pro 


Cys 


Ala 










85 








Pro 


Ser 


Pro 


Val 


Arg 


Leu 


Glu 


Glu 








100 










Ser 


Phe 


Phe 


Thr 


Tyr 


Pro 


Pro 


Tyr 






115 










120 


Ser 


Gin 


Ala 


Glu 


Cys 


Glu 


Asp 


Asp 




130 










135 




Glu 


Asn 


Pro 


Gin 


Arg 


Ser 


Gin 


Val 


145 










150 






val 


Leu 


Ser 


Ser 


Met 


Thr 


Asp 


His 



165 



Thr 


He 
10 


Asp 


Tyr 


His 


Asn 


He 
15 


Glu 


Thr 


Tyr 


Lys 


Asp 


Arg 


Met 


Glu 


Leu 


25 










30 






Gly 


Arg 


Tyr 


Val 


Arg 
45 


Ser 


Leu 


Ala 


Glu 


Glu 


Pro 


Ala 
60 


Thr 


Cys 


Tyr 


Thr 


Arg 


Ala 


Pro 
75 


Pro 


Pro 


Lys 


Arg 


Lys 
80 


Tyr 


Pro 
90 


Glu 


Leu 


Leu 


Pro 


Asp 
95 


Ala 


Leu 


Val 


Asp 


Val 


Pro 


Glu 


Gly 


Ala 


105 










110 






Asp 


Asp 


Gly 


Ser 


Ser 
125 


Thr 


Ser 


Ser 


Tyr 


Pro 


Pro 


Pro 
140 


Tyr 


Asp 


Pro 


Ser 


Cys 


Asp 


Tyr 
155 


Cys 


Thr 


Thr 


Arg 


Gin 
160 


Ala 


Arg 
170 


Ala 


Asn 


Leu 


lie 


Lys 
175 


Asn 



WO 01/38351 



227 



PCT/US00/28888 



Leu Lys Arg Glu Lys Lys Ala Leu Gly Leu Gly Arg Arg Asn Asn Phe 
180 185 190 

Ser Tyr 



<210> 144 
<211> 627 
<212> DNA 
<213> SHRIMP 



<400> 144 

atggccccaa 

gaaagggatg 

atctgcctca 

gtatacaaag 

cgaggattgg 

atggaagaga 

tacaatgaag 

aaaacaattg 

caagctctag 

ttcaaggagg 

caccctaaaa 



actccttcca 
aacatgacga 
ctgcaaactg 
tactacgcta 
ccctctcaat 
ctgaagaaaa 
gcgatggcat 
aaggaggaac 
agacattgat 
atatggagga 
aggtgtctgt 



gaaatttgct 
ccctttacgg 
tgtttcgagg 
cgggaaccca 
ggatcaaggt 
ccccatcgac 
gacccacctt 
aagaacgtgg 
gatgtcccat 
aggatttgtg 
tttgtga 



ccagttatca 
cagattgatt 
aagagaaaag 
tacaagtacc 
gaagtaggaa 
aagtgcggag 
tacaacgacg 
gtaaagaaga 
tctataaaac 
cacaaactcc 



agactgagaa 
ttagagatag 
ctggatctgc 
gtcgccccaa 
catgcctccc 
tggcgttcct 
aagagtatat 
accgccaaga 
aatattccaa 
acacatttat 



gaaggaagaa 
aaagacatta 
acatgatcga 
tagaacacat 
tctgcgaccc 
gtactccaac 
aaagaagtgc 
atacttcaga 
ttttattttt 
taatatggta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

627 



<210> 145 
<211> 204 
<212> PRT 
<213> SHRIMP 



<400> 145 



Met 


Ala 


Pro 


Asn 


Ser 


Phe 


Gin 


Lys 


Phe 


Ala 


Pro 


Val 


He 


Lys 


Thr 


Glu 


1 








5 










10 










15 




Lys 


Lys 


Glu 


Glu 
20 


Glu 


Arg 


Asp 


Glu 


His 
25 


Asp 


Asp 


Pro 


Leu 


Arg 
30 


Gin 


He 


Asp 


Phe 


Arg 
35 


Asp 


Arg 


Lys 


Thr 


Leu 
40 


He 


Cys 


Leu 


Thr 


Ala 
45 


Asn 


Cys 


Val 


Ser 


Arg 
50 


Lys 


Arg 


Lys 


Ala 


Gly 
55 


Ser 


Ala 


His 


Asp 


Arg 
60 


Val 


Tyr 


Lys 


Val 


Leu 


Arg 


Tyr 


Gly 


Asn 


Pro 


Tyr 


Lys 


Tyr 


Arg 


Arg 


Pro 


Asn 


Arg 


Thr 


Gly 


65 










70 










75 










80 


Leu 


Ala 


Leu 


Ser 


Met 
85 


Asp 


Gin 


Gly 


Glu 


Val 
90 


Gly 


Thr 


Cys 


Leu 


Pro 
95 


Leu 


Arg 


Pro 


Met 


Glu 


Glu 


Thr 


Glu 


Glu 


Asn 


Pro 


He 


Asp 


Lys 


Cys 


Gly 


Val 






100 










105 










110 






Ala 


Phe 


Lys 
115 


Asn 


Tyr 


Asn 


Glu 


Gly 
120 


Asp 


Gly 


Met 


Thr 


His 
125 


Leu 


Tyr 


Asn 


Asp 


Glu 
130 


Glu 


Tyr 


lie 


Lys 


Lys 
135 


Cys 


Lys 


Thr 


He 


Glu 
140 


Gly 


Gly 


Thr 


Arg 


Thr 


Trp 


Val 


Lys 


Lys 


Asn 


Arg 


Gin 


Glu 


Tyr 


Phe 


Arg 


Gin 


Ala 


Leu 


Glu 


145 










150 










155 










160 


Thr 


Leu 


Met 


Met 


Ser 
165 


His 


Ser 


He 


Lys 


Gin 
170 


Tyr 


Ser 


Asn 


Phe 


He 
175 


Phe 


Phe 


Lys 


Glu 


Asp 
180 


Met 


Glu 


Glu 


Gly 


Phe 
185 


Val 


His 


Lys 


Leu 


His 
190 


Thr 


Phe 


lie 


Asn 


Met 


Val 


His 


Pro 


Lys 


Lys 


Val 


Ser 


Val 


Leu 











195 200 



<210> 
<211> 
<212> 



146 
870 
DNA 
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<213> SHRIMP 



<400> 146 

atggagggag 

aaggacgata 

cttcgagacg 

gaagaactct 

atacacatct 

agagcagagg 

tatgatacct 

aagaccctga 

gatcttcacc 

ggagaattgt 

aatattgcat 

ggagagttta 

aagaagcagt 

gaaaaaattg 

ggaaacttgc 



aacatcaata 
gaactggaac 
acactattcc 
tgtggttcat 
ggaacgctaa 
gagatttggg 
gttcttccga 
gagaaaatcc 
ttatggctct 
cgtgccagtt 
catactctct 
tcctcactct 
tgtgccgcgt 
aggactttac 
agatgaaaat 



tttgaaccta 
aggaactcta 
agttctcact 
caggggcaat 
tgggtcgcgg 
acccgtatac 
ttatactgga 
agatgataga 
tcctccatgc 
gtatcagcga 
tctgactcat 
tggtgacgca 
ccctagacca 
tatcgacatg 
ggctgtttga 



gtcagggaga 
tccatttttg 
accaagaaaa 
acagacgcca 
gaatttttgg 
ggattccaat 
aagggtattg 
aggatgatta 
cacatgactg 
agcggagatg 
ctgatggcca 
cacatttata 
ttccctaagt 
ttttatcttg 



tcctagaaag 
gaccccaaat 
ttttctggag 
aagaattggc 
acagtagagg 
ggcgtcattt 
atcaattggc 
tgacggcatg 
ctcaatttta 
tcgggttggg 
gtatggtggg 
atacccacat 
tgaggatttt 
aggggtatca 



aggagtgaag 60 
gaggttctct 120 
aggagttgtg 180 
caagaagaag 24 0 
gttatacgat 300 
tggtgctgaa 360 
caatatacta 420 
gaatcctatg 4 80 
tgtggctaat 540 
cgtgcccttc 600 
tctaaaaccg 660 
tgaggtgtta 720 
aatggctcca 780 
accacacagt 840 
870 



<210> 147 
<211> 285 
<212> PRT 
<213> SHRIMP 



<400> 147 



Met 


Glu 


Gly 


Glu 


His 


Gin 


Tyr 


Leu 


Asn 


Leu 


Val 


Arg 


Glu 


He 


Arg 


Gly 


1 








5 










10 










15 




Val 


Lys 


Lys 


Asp 
20 


Asp 


Arg 


Thr 


Gly 


Thr 

25 


Gly 


Thr 


Leu 


Ser 


He 
30 


Phe 


Gly 


Pro 


Gin 


Met 
35 


Arg 


Phe 


Ser 


Leu 


Arg 
40 


Asp 


Asp 


Thr 


He 


Pro 
45 


Val 


Leu 


Thr 


Thr 


Lys 
50 


Lys 


He 


Phe 


Trp 


Arg 

55 


Gly 


Val 


Val 


Glu 


Glu 
60 


Leu 


Leu 


Trp 


Phe 


lie 


Arg 


Gly 


Asn 


Thr 


Asp 


Ala 


Lys 


Glu 


Leu 


Ala 


Lys 


Lys 


Lys 


He 


His 


65 










70 










75 










80 


He 


Trp 


Asn 


Ala 


Asn 
85 


Gly 


Ser 


Arg 


Glu 


Phe 
90 


Leu 


Asp 


Ser 


Arg 


Gly 
95 


Asp 


Arg 


Ala 


Glu 


Gly 
100 


Asp 


Leu 


Gly 


Pro 


Val 
105 


Tyr 


Gly 


Phe 


Gin 


Trp 
110 


Arg 


His 


Phe 


Gly 


Ala 
115 


Glu 


Tyr 


Asp 


Thr 


Cys 
120 


Ser 


Ser 


Asp 


Tyr 


Thr 
125 


Gly 


Lys 


Gly 


He 


Asp 
130 


Gin 


Leu 


Ala 


Asn 


He 
135 


Leu 


Lys 


Thr 


Leu 


Arg 
140 


Glu 


Asn 


Pro 


Asp 


Asp 


Arg 


Arg 


Met 


He 


Met 


Thr 


Ala 


Trp 


Asn 


Pro 


Met 


Asp 


Leu 


His 


Leu 


145 










150 










155 










160 


Met 


Ala 


Leu 


Pro 


Pro 
165 


Cys 


His 


Met 


Thr 


Ala 
170 


Gin 


Phe 


Tyr 


Val 


Ala 
175 


Asn 


Gly 


Glu 


Leu 


Ser 


Cys 


Gin 


Leu 


Tyr 


Gin 


Arg 


Ser 


Gly 


Asp 


Val 


Gly 


Leu 






180 










185 










190 






Gly 


Val 


Pro 


Phe 


Asn 


He 


Ala 


Ser 


Tyr 


Ser 


Leu 


Leu 


Thr 


His 


Leu 


Met 




195 










200 










205 








Ala 


Ser 
210 


Met 


Val 


Gly 


Leu 


Lys 
215 


Pro 


Gly 


Glu 


Phe 


He 
220 


Leu 


Thr 


Leu 


Gly 


Asp 


Ala 


His 


He 


Tyr 


Asn 


Thr 


His 


He 


Glu 


Val 


Leu 


Lys 


Lys 


Gin 


Leu 


225 










230 










235 










240 


Cys 


Arg 


Val 


Pro 


Arg 


Pro 


Phe 


Pro 


Lys 


Leu 


Arg 


He 


Leu 


Met 


Ala 


Pro 








245 










250 










255 




Glu 


Lys 


He 


Glu 
260 


Asp 


Phe 


Thr 


He 


Asp 
265 


Met 


Phe 


Tyr 


Leu 


Glu 
270 


Gly 


Tyr 


Gin 


Pro 


His 
275 


Ser 


Gly 


Asn 


Leu 


Gin 
280 


Met 


Lys 


Met 


Ala 


Val 
285 
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<210> 148 
<211> 678 
<212> DNA 
<213> SHRIMP 



<400> 148 

atggccttta 

acaacaacag 

ggtattgagg 

agtggaggtg 

aacagtggtt 

aactggattg 

ggtggggatg 

gatgtgactt 

tggatgtacc 

gcatcaaacc 

tcatacgtta 

ggattctttg 



attttgaaga 
accctacccg 
tgatgaagag 
ctatgtcgta 
cccatgtcaa 
aagagactga 
gttcacgtgg 
tctttgtgtt 
aaaaactact 
caatgttttc 
ttgaaggcca 
tataataa 



ctctacaaat 
ccccaatatc 
gcgtctcgta 
tttctggctc 
gacaaactgc 
tcgacatgga 
gggatatttt 
tgttgatagg 
agcaattggt 
tgtatgtaat 
tggagtgggt 



ctctttgcca 
atattctttg 
cggcaaggaa 
gaagataatg 
ttggcattat 
cagtactgta 
acttcgctag 
aataatactg 
gcaaaggtag 
gcgtgtaggt 
cattctgatt 



atatggactt 
aaagtctact 
agtgtgggaa 
cagaagatat 
tccttcaaga 
cttttcccca 
ccatgaaatg 
tagaaaatgc 
taaaggtgat 
gcaagtaccc 
tgacatgtga 



gacggctggc 
ccccaactct 
ttttgaagca 
ggagaatctc 
gtttatcagc 
atacatggac 
gatggctagg 
ggcatccata 
tgttgacaat 
aggcccagtg 
tgagatttct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

678 



<210> 149 
<211> 220 
<212> PRT 
<213> SHRIMP 



<400> 149 



Met 


Ala 


Phe 


Asn 


Phe 


Glu 


Asp 


Ser 


Thr 


Asn 


Leu 


Phe 


Ala 


Asn 


Met 


Asp 


1 








5 










10 










15 




Leu 


Thr 


Ala 


Gly Thr 


Thr 


Thr 


Asp 


Pro 


Trp 


Asn 


He 


He 


Phe 


Phe 


Glu 








20 










25 










30 






Ser 


Leu 


Leu 
35 


Pro 


Asn 


Ser 


Gly 


He 
40 


Glu 


Val 


Met 


Lys 


Arg 
45 


Arg 


Leu 


Val 


Arg 


Gin 
50 


Gly 


Lys 


Cys 


Gly 


Asn 
55 


Phe 


Glu 


Ala 


Ser 


Gly 
60 


Gly 


Ala 


Met 


Ser 


Tyr 


Phe 


Trp 


Leu 


Glu 


Asp 


Asn 


Ala 


Glu 


Asp 


Met 


Glu 


Asn 


Leu 


Asn 


Ser 


65 










70 










75 










80 


Gly 


Ser 


His 


Val 


Lys 
85 


Thr 


Asn 


Cys 


Leu 


Ala 
90 


Leu 


Phe 


Leu 


Gin 


Glu 
95 


Phe 


He 


Ser 


Asn 


Trp 
100 


He 


Glu 


Glu 


Thr 


Asp 
105 


Arg 


His 


Gly 


Gin 


Tyr 
110 


Cys 


Thr 


Phe 


Pro 


Gin 


Tyr 


Met Asp 


Gly 


Gly 


Asp 


Gly 


Ser 


Arg 


Gly 


Gly 


Tyr 


Phe 






115 










120 










125 








Thr 


Ser 
130 


Leu 


Ala 


Met 


Lys 


Trp 
135 


Met 


Ala 


Arg 


Asp 


Val 
140 


Thr 


Phe 


Phe 


Val 


Phe 


Val 


Asp 


Arg 


Asn 


Asn 


Thr 


Val 


Glu 


Asn 


Ala 


Ala 


Ser 


He 


Trp 


Met 


145 










150 










155 










160 


Tyr 


Gin 


Lys 


Leu 


Leu 
165 


Ala 


He 


Gly 


Ala 


Lys 
170 


Val 


Val 


Lys 


Val 


He 
175 


Val 


Asp 


Asn 


Asn 


Pro 
180 


Met 


Phe 


Ser 


Val 


Cys 
185 


Asn 


Ala 


Cys 


Arg 


Cys 
190 


Lys 


Tyr 


Pro 


Gly 


Pro 
195 


Val 


Ser 


Tyr 


Val 


He 
200 


Glu 


Gly 


His 


Gly 


Val 
205 


Gly 


His 


Ser 


Asp 


Leu 
210 


Thr 


Cys 


Asp 


Glu 


He 
215 


Ser 


Gly 


Phe 


Phe 


Val 
220 











<210> 150 
<211> 1269 
<212> DNA 
<213> SHRIMP 
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<400> 150 

atggattcta 

gttttaggga 

agggtaggta 

gataatgggg 

ggtggagatg 

gcaaacgtca 

aatgattcaa 

ccattcagag 

atcctttcaa 

aaaaatcaat 

ccagagactg 

gaaaaggtat 

aaaggtaaaa 

gaaacaataa 

gacgaacaga 

gtaaataagg 

gccaatatta 

actagcacag 

tttgtttgtg 

agtgatactg 

gtagaaattt 

actcagtga 



atacttctat 
ttataataac 
aacgcaaata 
tagaaggagg 
gatacgtaga 
tctttcaaga 
ctgataagat 
aagcaaccgt 
ctctagattt 
tagaaaattt 
attatttatg 
tagggttctc 
aatatgaggt 
aagagaaggc 
cctacaaatc 
agaaaaggga 
ctcctctaac 
caaaggaaaa 
gtctgagtaa 
ttttcccttc 
taggtaaaag 



tttaccgcca 
ggtagcatta 
ttacccttct 
aggaggaaca 
tctttctcct 
agtgtcaaag 
aatggaagaa 
agaacgtgaa 
aacagaggaa 
tgactatgaa 
gacaactcat 
agatataaat 
aactactgga 
aggaaatagt 
tttcttccct 
agttactgta 
tacgccagtt 
gacctatcta 
caagtcaact 
atttgacgaa 
actaggtgta 



agcaaacggc 
atagcttccg 
tcatcctctt 
acaacgacac 
caaaagaagg 
gatcaaggag 
actgaggcga 
gtgtttaagg 
caatttaagg 
gacatgaccc 
ttcgatccga 
agtatagaaa 
aatgtagctg 
ctcatccgta 
gcattcaatc 
tccatcaata 
ggagcagctt 
tttgtaatag 
cctctcatgg 
gcagaaagac 
ggaagataca 



cagggttaaa 
tttcatcctt 
cctcagaatt 
caactcaacc 
ctgaactaag 
tggcctttag 
gaatcaataa 
atgacacaga 
acattgttat 
gtctcatctt 
aaaaatatga 
gaatatcctc 
tcctcgttga 
atgtcgagtt 
aagttttctt 
atggatgtgt 
ccggacacta 
acaagtacga 
ctctaaatat 
ctctgacgga 
caaacgccaa 



tctgttacag 
tatattttat 
atctgatgta 
ttcacctgac 
aactagagtt 
acgggcaatg 
cttttcagag 
caaaaacttt 
ggctgaagtg 
tgataacatc 
cacgtactct 
tacattttat 
ttttgaatct 
tattgttgtg 
ctcctttaaa 
aggtatagtg 
catctatggc 
taccactgaa 
tctctttatg 
tgccaaggca 
catcagaaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1269 



<210> 151 
<211> 418 
<212> PRT 
<213> SHRIMP 



<400> 151 



Met 


Asp 


Ser 


Asn 


Thr 


Ser 


He 


Leu 


Pro 


Pro 


Ser 


Lys 


Arg 


Pro 


Gin 


Leu 


1 








5 










10 










15 




Leu 


Gin 


Val 


Leu 


Gly 


He 


He 


He 


Thr 


Val 


He 


Ala 


Ser 


Val 


Ser 


Ser 








20 










25 










30 






Phe 


He 


Phe 


Tyr 


Arg 


Val 


Gly 


Lys 


Arg 


Lys 


Tyr 


Tyr 


Pro 


Ser 


Ser 


Ser 






35 










40 










45 








Ser 


Ser 


Ser 


Glu 


Leu 


Ser 


Asp 


Val 


Asp 


Asn 


Gly 


Val 


Glu 


Gly 


Gly 


Gly 




50 










55 










60 










Gly 


Thr 


Thr 


Thr 


Thr 


Pro 


Thr 


Gin 


Pro 


Ser 


Pro 


Asp 


Gly 


Gly 


Asp 


Gly 


65 










70 










75 










80 


Tyr 


Val 


Asp 


Leu 


Ser 


Pro 


Gin 


Lys 


Lys 


Ala 


Glu 


Leu 


Arg 


Thr 


Arg 


Val 










85 










90 










95 




Ala 


Asn 


Val 


He 


Phe 


Gin 


Glu 


Val 


Ser 


Lys 


Asp 


Gin 


Gly 


Val 


Ala 


Phe 








100 










105 










110 






Arg 


Arg 


Ala 


Met 


Asn 


Asp 


Ser 


Thr 


Asp 


Lys 


He 


Met 


Glu 


Glu 


Thr 


Glu 






115 










120 










125 








Ala 


Arg 


He 


Asn 


Asn 


Phe 


Ser 


Glu 


Pro 


Phe 


Arg 


Glu 


Ala 


Thr 


Val 


Glu 




130 










135 










140 










Arg 


Glu 


Val 


Phe 


Lys 


Asp 


Asp 


Thr 


Asp 


Lys 


Asn 


Phe 


He 


Leu 


Ser 


Thr 


145 










150 










155 










160 


- Leu 


Asp 


Leu 


Thr 


Glu 


Glu 


Gin 


Phe 


Lys 


Asp 


He 


Val 


Met 


Ala 


Glu 


Val 










165 










170 










175 




Lys 


Asn 


Gin 


Leu 


Glu 


Asn 


Phe 


Asp 


Tyr 


Glu 


Asp 


Met 


Thr 


Arg 


Leu 


He 








180 










185 










190 






Phe 


Asp 


Asn 


He 


Pro 


Glu 


Thr 


Asp 


Tyr 


Leu 


Trp 


Thr 


Thr 


His 


Phe 


Asp 






195 










200 










205 








Pro 


Lys 


Lys 


Tyr 


Asp 


Thr 


Tyr 


Ser 


Glu 


Lys 


Val 


Leu 


Gly 


Phe 


Ser 


Asp 




210 










215 










220 










He 


Asn 


Ser 


He 


Glu 


Arg 


He 


Ser 


Ser 


Thr 


Phe 


Tyr 


Lys 


Gly 


Lys 


Lys 


225 










230 










235 










240 
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Tyr 


ri l 

Ol u 


V Ct X 


Thr 

1 UX 


Thr 


Gly Asn 


Val 


Ala 


Val 


Leu 


Va 1 


Asp 


Phe 


Glu 


Ser 










245 










250 










255 




Ol u 


J. 11 J. 


T1 A 

j. xe 


Lys 


Glu 


Lys 


Ala 


Gly 


Asn 


Ser 


Leu 


He 


Arg 


Asn 


Val 


Glu 








260 










265 










270 






OK £3 

rne 


Tie 

x xe 


V a x 


V d X 


ASp 




Gin 


Thr 


Tur 

iyr 


Lys 


OCX 


Phe 




Pro 


Ala 


Phe 






£. f ~J 










280 










£ O J 








Asn 


bin 


v a. x 




tr tits 


ocx 


Phe 


Lys 


v a x 






ox u 


xiy © 


Arg 


Glu 


Val 














295 










300 










i nr 


vax 


Ser 


Tip 
X xc 


noil 


Asn 


Gly 


Cys 


V CLX 




T 1 fa 


V Cl X 


Ala 


now 


Tip 


-l UX 


"*n<^ 




















J X 










320 


Pro 


Leu 


i nr 


i nx 




vai 


Gly Ala 






vjiy 




iyr 


Tip 
X ic 


iyr 




































Thr 


Ser 


Thr 


Ala 


Lys 


Glu 


Lys 


Thr 


Tyr 


Leu 


Phe 


Val 


He 


Asp 


Lys 


Tyr 








340 










345 










350 






Asp 


Thr 


Thr 


Glu 


Phe 


val 


Cys 


Gly 


Leu 


Ser 


Asn 


Lys 


Ser 


Thr 


Pro 


Leu 






355 










360 










365 








Met 


Ala 


Leu 


Asn 


He 


Leu 


Phe 


Met 


Ser 


Asp 


Thr 


Val 


Phe 


Pro 


Ser 


Phe 




370 










375 










380 










Asp 


Glu 


Ala 


Glu 


Arg 


Pro 


Leu 


Thr 


Asp 


Ala 


Lys 


Ala 


Val 


Glu 


He 


Leu 


385 










390 










395 










400 


Gly 


Lys 


Arg 


Leu 


Gly 


Val 


Gly Arg 


Tyr 


Thr 


Asn 


Ala 


Asn 


He 


Arg 


Asn 










405 










410 










415 




Thr 


Gin 































<210> 152 
<211> 831 
<212> DNA 
<213> SHRIMP 



<400> 152 

atggaagggg 

gggtctattg 

aaaataaatg 

actataagaa 

gaaaaaatta 

tttggaaaaa 

tttgaaagtg 

gtatctatga 

ccttcaccta 

aatgcaggtt 

gctgaaaatc 

tttatagact 

gttcagaaaa 

cctgatgaac 



ttattttgga 
atgtgggtac 
aagaaaccac 
gtatagtggc 
gagcaatgac 
tttcaactgg 
ctttaaaggc 
acagtttacg 
tcatgggtat 
cgaaagaatt 
tcattattag 
acgaaacaaa 
caaaggctct 
cagaagggag 



taagatagaa 
ggctattttg 
aatggagaag 
taatgttatc 
agataaggaa 
agatggagga 
agaatataaa 
tatcgatgcg 
tagagaatac 
ggtttttatg 
agttgccaac 
gaagggggtg 
taaagggcgt 
tatcataata 



acaattgcaa 
cgccgtcaat 
attatgggca 
aaagagaata 
ttaaatgaca 
accttccatc 
aacgttccag 
attaatggaa 
ggcaccattc 
accaagattg 
cagcagtata 
tccaaggagg 
agtacttact 
cacgcactag 



<210> 153 
<211> 274 
<212> PRT 
<213> SHRIMP 



aaagggcctc 
tcatggaaaa 
caaaggaaga 
ctgttaaaga 
atagggaatt 
tctttgaaga 
gagcaactac 
aaatcgaaga 
gcaggggcag 
aaaaaagacc 
atgttatgag 
aaatgtttat 
tttcattcgt 
ggttttattg 



tcctagttat 
.aattaggggt 
aagagaggac 
aaatgtaacc 
tatgcatgat 
tacaccgggt 
tccaaaatac 
ggtttataat 
gtacgaagaa 
caataatgta 
gatggtgttt 
accatataat 
aaggaaaatt 
a 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

831 



<400> 153 

Met Glu Gly Val He Leu Asp Lys He Glu Thr 

15 10 
Ser Tyr Gly Ser He Asp Val Gly Thr Ala He 

20 25 
Met Glu Lys He Arg Gly Lys He Asn Glu Glu 

35 40 
He Met Gly Thr Lys Glu Glu Arg Glu Asp Thr 
50 55 



He Ala Lys Arg Asp 
15 

Leu Arg Arg Gin Phe 
30 

Thr Thr Met Glu Lys 
45 

He Arg Ser He Val 
60 
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Ala 


Asn 


Val 


lie 


Lys 


Glu 


Asn 


Thr 


65 










70 






lie 


Arg 


Ala 


Met 


Thr 


Asp 


Lys 


Glu 










85 








His 


Asp 


Phe 


Gly 


Lys 


lie 


Ser 


Thr 








100 










Phe 


Glu 


Asp 


Thr 


Pro 


Gly 


Phe 


Glu 






115 










120 


Asn 


Val 


Pro 


Gly Ala 


Thr 


Thr 


Pro 




130 










135 




Arg 


lie 


Asp 


Ala 


lie 


Asn 


Gly 


Lys 


145 










150 






Pro 


lie 


Met 


Gly 


lie 


Arg 


Glu 


Tyr 










165 








Glu 


Glu 


Asn 


Ala 


Gly 


Ser 


Lys 


Glu 








180 










Lys 


Arg 


Pro 


Asn 


Asn 


Val 


Ala 


Glu 






195 










200 


Gin 


Gin 


Tyr 


Asn 


Val 


Met 


Arg 


Met 




210 










215 




Lys 


Lys 


Gly 


Val 


Ser 


Lys 


Glu 


Glu 


225 










230 






Lys 


Thr 


Lys 


Ala 


Leu 


Lys 


Gly 


Arg 










245 








Lys 


lie 


Pro 


Asp 


Glu 


Pro 


Glu 


Gly 








260 










Phe 


Tyr 















val 


Lys 


GlU 


Asn 


Val 


Thr 


OlU 


Lys 






/ D 










ft n 


Leu 


Asn 


Asp 


Asn 


Arg 


Glu 


Phe 


Met 














95 




Gly 


Asp 


Gly 


Gly 


Thr 


Phe 


His 


Leu 


luo 










110 






Ser 


Ala 


Leu 


Lys 


Ala 


Glu 


Tyr 


Lys 










125 








Lys 


Tyr 


Val 


Ser 


Met 


Asn 


Ser 


Leu 








140 










lie 


Glu 


Glu 


Val 


Tyr Asn 


Pro 


Ser 






155 










160 


Gly 


Thr 


lie 


Arg 


Arg 


Gly Arg 


Tyr 




170 










175 




Leu 


Val 


Phe 


Met 


Thr 


Lys 


He 


Glu 


185 










190 






Asn 


Leu 


lie 


lie 


Arg 


Val 


Ala 


Asn 










205 








Val 


Phe 


Phe 


He 


Asp 


Tyr 


Glu 


Thr 








220 










Met 


Phe 


lie 


Pro 


Tyr 


Asn 


Val 


Gin 






235 










240 


Ser 


Thr 


Tyr 


Phe 


Ser 


Phe 


Val 


Arg 




250 










255 




Ser 


lie 


He 


He 


His 


Ala 


Leu 


Gly 


265 










270 







<210> 154 
<211> 486 
<212> DNA 
<213> SHRIMP 



<400> 154 

atgacatctc 

aaccgatgtg 

aaattcaagt 

aatgttgtta 

aactggtcta 

attacacgcc 

atgaagtaca 

aaattgagca 

gtctag 



cagctccatc 
gtttcctcct 
gtgaacccaa 
cccagatatc 
aatctggtca 
cttcagaagt 
cacgttcctt 
aaattagtag 



<210> 155 
<211> 159 
<212> PRT 
<213> SHRIMP 



accctcttcc 
tgacaacaac 
aaatctggaa 
ccccgaccag 
tgaacgcttc 
gattgaaagt 
gatggaccac 
ttggtgtaca 



acccccaaat 
aaggaagtgg 
ctaattggtg 
atatttgtgg 
agtgacatga 
gtgataaaga 
accgagaaat 
acccctatac 



ccagttgtac 
tcatctacga 
tactttctgg 
gaacatatat 
gtaacaactg 
aaacgtccag 
actatttttc 
gacagtgggt 



cactattgta 
caccaattcc 
agtctctgat 
ggtcaaatat 
tctggacaat 
cgactttaaa 
tggtgaccaa 
atgcaactcc 



60 

120 

180 

240 

300 

360 

420 

480 

486 



<400> 155 

Met Thr Ser Pro Ala Pro Ser Pro Ser Ser Thr Pro Lys Ser Ser Cys 

15 10 15 

Thr Thr He Val Asn Arg Cys Gly Phe Leu Leu Asp Asn Asn Lys Glu 

20 25 30 

Val Val lie Tyr Asp Thr Asn Ser Lys Phe Lys Cys Glu Pro Lys Asn 

35 40 45 

Leu Glu Leu He Gly Val Leu Ser Gly Val Ser Asp Asn Val Val Thr 

50 55 60 

Gin He Ser Pro Asp Gin He Phe Val Gly Thr Tyr Met Val Lys Tyr 
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65 










70 








75 










80 


Asn 


Trp 


Ser 


Lys 


Ser 


Gly 


His 


Glu Arg 


Phe 


Ser 


Asp 


Met 


Ser 


Asn 


Asn 








85 








90 










95 




Cys 


Leu 


Asp 


Asn 


lie 


Trp 


Ser 


Glu Val 


He 


Glu 


Ser 


Val 


He 


Lys 


Lys 




100 








105 










110 






Thr 


Ser 


Ser 


Asp 


Phe 


Lys 


Met 


Lys Tyr 


Thr 


Arg 


Ser 


Leu 


Met 


Asp 


His 






115 










120 








125 








Thr 


Glu 


Lys 


Tyr 


Tyr 


Phe 


Ser 


Gly Asp 


Gin 


Lys 


Leu 


Ser 


Lys 


He 


Ser 




130 










135 








140 










Ser 


Trp 


Cys 


Thr 


Thr 


Pro 


He 


Arg Gin 


Trp 


Val 


Cys 


Asn 


Ser 


Val 




145 








150 








155 













<210> 156 
<211> 894 
<212> DNA 
<213> SHRIMP 

<400> 156 

atggcattac aggaaaagga tataactata gggaatgttt ctgctgccct acgagagttg 60 

atgtactcac ccacacatat gcagcatcac gataagctaa acacattcct ggacagaaat 120 

gttgaatcat cttcagaaga gaaaataaga caaattgtgg ataaaatacg atcccaaaca 180 

acatctgaca tatctgaaac agtcaataat gtcacaacta atgggactgc attttccctt 240 

ttcgaagata ccttagaagg tatggtgaaa aaaaatatag gtgataacct tcagagtggg 300 

gactttattg atggccgtaa aaagctcaat gacatgaaga gtctagctac tggagccatc 360 

ttatctagac agcgagattt tgttgcagaa agtataacag gaacaaagga ctggctcaag 420 

gctataatgg gttgtggtat tataaggtat actgtatttg tcaataacct tgcaagatca 4 80 

acactcgata atgatgatga caaggcagca acctattata acacccctat atatggcggg 54 0 

tattgtaaaa tggctataaa ggactatgaa ataccagatt cgtacagcaa ggtcgaagcg 600 

gaacatacag ttgaaggaag aaagatgacc tttaatataa aatggagagg cgataccata 660 

aacaacctaa taacaatcat cccttcagtg acaggttatc ttgcttccat ctctgaagac 720 

gcagatgtgc aggcgccatt acttttaaac tgcaacaact gttttataga ggcagatatg 780 

agtagcctct acatggatga gaaaaaaaca gaggcatcat ttaccctcaa cttaccggaa 84 0 

atcgaaggag ctgatgcgaa tgcagtctat gaaatatgta tagtagtagt ttga 894 

<210> 157 
<211> 297 
<212> PRT 
<213> SHRIMP 

<400> 157 

Met Ala Leu Gin Glu 

1 5 
Leu Arg Glu Leu Met 
20 

Leu Asn Thr Phe Leu 
35 

He Arg Gin He Val 
50 

Ser Glu Thr Val Asn 
65 

Phe Glu Asp Thr Leu 
85 

Leu Gin Ser Gly Asp 
100 

Lys Ser Leu Ala Thr 
115 

Ala Glu Ser He Thr 
130 

Cys Gly He He Arg 
145 



Lys 


Asp 


He 


Thr 


He 


Gly 


Asn 


Val 


Ser 


Ala 


Ala 










10 










15 




Tyr 


Ser 


Pro 


Thr 


His 


Met 


Gin 


His 


His 


Asp 


Lys 








25 










30 






Asp 


Arg 


Asn 


Val 


Glu 


Ser 


Ser 


Ser 


Glu 


Glu 


Lys 






40 










45 








Asp 


Lys 


He 


Arg 


Ser 


Gin 


Thr 


Thr 


Ser 


Asp 


He 




55 










60 










Asn 


Val 


Thr 


Thr 


Asn 


Gly 


Thr 


Ala 


Phe 


Ser 


Leu 


70 










75 










80 


Glu 


Gly 


Met 


Val 


Lys 


Lys 


Asn 


He 


Gly 


Asp 


Asn 










90 










95 




Phe 


He 


Asp 


Gly 


Arg 


Lys 


Lys 


Leu 


Asn 


Asp 


Met 








105 










110 






Gly 


Ala 


He 


Leu 


Ser 


Arg 


Gin 


Arg 


Asp 


Phe 


Val 






120 










125 








Gly 


Thr 


Lys 
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Trp 


Leu 


Lys 


Ala 


He 


Met 


Gly 
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Tyr 


Thr 


Val 


Phe 


Val 


Asn 


Asn 


Leu 


Ala 


Arg 


Ser 
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155 










160 
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Thr 


Leu 
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Tyr 
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Thr 
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ne 
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He 
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Glu 
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Thr 


val 


blU 


Gly Arg 


Lys 






195 




















£. \J J 








Met 


Thr 
210 


Phe 


Asn 


He 


Lys 


Trp 
215 


Arg 


Gly 


Asp 


Thr 


He 
220 


Asn 


Asn 


Leu 


He 


Thr 


He 


He 


Pro 


Ser 


Val 


Thr 


Gly 


Tyr 


Leu 


Ala 


Ser 


He 


Ser 


Glu 


Asp 


225 
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240 


Ala 


Asp 


Val 


Gin 


Ala 
245 


Pro 


Leu 


Leu 


Leu 


Asn 
250 


Cys 


Asn 


Asn 


Cys 


Phe 
255 


He 


Glu 


Ala 


Asp Met 


Ser 


Ser 


Leu 
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Met 
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Glu 


Lys 


Lys 


Thr 


Glu 


Ala 








260 










265 










270 






Ser 


Phe 


Thr 
275 


Leu 


Asn 


Leu 


Pro 


Glu 
280 


He 


Glu 


Gly 


Ala 


Asp 
285 


Ala 


Asn 
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Val 
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Glu 


He 
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He 


Val 


Val 


Val 

















290 295 



<210> 158 
<211> 6588 
<212> DNA 
<213> SHRIMP 



<400> 158 

atgcaactcc 

ccttctggaa 

aatgagatgg 

gaaattaacg 

agatggtgcc 

tacaagtgca 

aagaggaaga 

ggaaaggaag 

ggtgttgatt 

ggaaactatg 

gatgaatctt 

gatgaacgac 

ttccatatgc 

ccacttggac 

catgaatttg 

ccccgattca 

gtgtttattc 

cgaggttctg 

tttcaagtaa 

gaaactctcc 

aagggattct 

cagagtgtca 

actggtttca 

ggctgtagat 

atgaggatcc 

atcatctttg 

atagggcagc 

aaaaaatcag 

ggaggagaga 

gagactataa 

caagtgattg 

gataaagtga 

gtctttactc 

gcagatgcat 

gaaactccat 



gtctagaaaa 
tgactggatt 
acgccctaat 
ccaatatgga 
accacgagaa 
ggagaaaaat 
atgttggtgg 
ccctagaagc 
caacaccatg 
gatgtattgc 
ccttcatatc 
tacgaaacaa 
tggctgatat 
ccaatggcag 
ttacccattc 
attttgatat 
aaaattgtca 
cttctgggaa 
caaaatgtaa 
ctcgtctacc 
gtagagggga 
agaatttttg 
tctcttcaga 
ctaatcgtgc 
taaagtcacg 
aaaacgttaa 
acaagatact 
tcatcacatc 
gaggagtatt 
taaattcgta 
aaattttcac 
agcctcacgt 
gtatcgtcaa 
ctacagcaga 
atagattgag 



ctttgtcaag 
caacatattt 
ttttatggcg 
cttgtacacg 
gagtttttct 
catgcaagcc 
agcacctgcg 
tagtttcgat 
cccccattct 
ctgcttcgca 
aacggatgcc 
tgaaaatgga 
tacccaaaag 
gcagtggaat 
tctagtaaac 
cttgtacaac 
cattttaact 
gtggtggact 
agttgaaaag 
taatccagga 
aaatctaggg 
tctcatggtt 
atcgatagtg 
ccaagatact 
tgaaggtgct 
ccacgatacg 
gtactatgat 
tattgggttc 
tggactggtt 
cgatcctgaa 
caacgagttt 
gattagtggg 
acatttgagt 
atctgtcctt 
cacccctcaa 



gaagaacatg 
aatagttccc 
gctttcttga 
tttgattatg 
gtcgtccgag 
ctggacaata 
ttcacattta 
gtgattgggg 
tcagccatgc 
tcaatgttct 
tctaaaattg 
ggagaagaaa 
gctcatgaaa 
tggcctactc 
acattgaaaa 
ttgcttaatc 
ggacataaaa 
attaattttg 
gatagaaaaa 
agcactaccg 
agtgtaggtg 
gaaaatagga 
tctgatccct 
attaataatg 
cgtgtatggt 
gccatctcta 
attgaaacaa 
tgtttgtgta 
gcacctggat 
gaaaaggaag 
gaaatgttgc 
tggaacaatg 
gatcacacca 
cctagagcaa 
gaaagaatac 



aaactgtcgt 
ccgtgtattt 
agcacaatag 
cgggtgcttt 
cacaacttat 
actacaacaa 
tgagcgggga 
gaacaagagg 
aactaaaact 
ttgtattgga 
gacaagcgca 
ataatgtctt 
ctgcctattc 
acactgtgga 
atctagggga 
catttggaaa 
acaatgaaaa 
tgggtgtgaa 
tatccgattt 
tcgatgacag 
aagtcgtatc 
aatttagtgt 
tcttttcact 
gccgagttag 
tggccaagga 
cggacgctat 
cagataaaga 
cgggaggcga 
ccgacgtgga 
acattatgaa 
ttggttttgg 
tagcttttga 
aagacatgtc 
cagaaggagg 
aactagcaag 



ggtgcacaac 
tgaagtgcac 
tttatgggga 
tctggacgaa 
caactcgtat 
caagaataag 
cggagaggga 
aggaagattt 
ggacaatgaa 
gaacccaggt 
agcatggata 
taaaaagacc 
caataccatc 
acctattgcc 
tagaaaactt 
aatgttgcta 
tgtggtgcct 
catgtggact 
ggcctgtatg 
aatagttttt 
cgacattaca 
ggataaagaa 
agaagtgact 
tgctcgtgta 
tgaaaatgcc 
ggagcgagct 
tttcaccgac 
tatgacacat 
aaaggtgaaa 
acagtgccct 
aaagtatata 
cgaccccttt 
ttattgtgta 
aggaggagga 
cactggtatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 
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ttcaataaat 
gcagatttat 
ttatcctcca 
agtgctattc 
gaatttaatt 
gtaaaagatg 
caggatcagg 
tctacctcca 
gaggcggtcg 
agaaatatgg 
gccacagaca 
acgattgaca 
acctttgtga 
aaaaacaaaa 
gtgttgtccg 
aatagacaaa 
ttggatgccg 
gtcactgccg 
gcggccaagg 
gtattgccca 
tgtcgtacat 
gaggaaatta 
agtcctgtga 
tatgaaggca 
gttggcgcat 
aatgttggaa 
cctcaaatgc 
tttgccaaga 
aagctcggta 
gagtttgcgt 
ggaattggag 
tgtgtatttg 
tacgggga ta 
ccagaatatg 
aaaatggctg 
gagcgccgag 
ggtcttttgt 
ttcaatgaca 
aagagaccta 
gcggccgaag 
gacgagaaga 
tctgttatca 
gacgccatga 
gcgaatgcag 
accaccaaac 
aagaaaagct 
acaaatcctg 
cagagtagac 
atttcattgg 
gaggaagggg 
aacatgaagg 
atcacatctg 
aaaaatgtag 
gcattgtcat 
tctctaaata 
gacgacgacg 
aacaaaggtg 
ggtgaagaaa 
aagggaaaga 
gatgtca ttc 
agaaccagag 



tgggaaaatt 
tggccggggc 
gtaataaagg 
ggttggatct 
tgaacgccgt 
aagtagacct 
caaaagtaca 
tcaacaagga 
tgacagccaa 
gagaagaaag 
caaagggagg 
tgacaagttt 
cccatcgaca 
ccaccgactc 
agtacagacc 
ctccaattac 
agaagacaaa 
caggtatgga 
tgaatgatga 
taatgttaat 
tggaggatat 
taagaaccca 
cccaaatggc 
atgaaaaatt 
ttgattcggc 
tgttggttag 
aagccactta 
tgggagacat 
tgaattccat 
ctggatctgc 
gaggcacaag 
gcaatattgg 
cagattctgt 
atgaacaaac 
caattattcc 
acgctggtgt 
ttgccaagaa 
tgataaaatt 
gccacgcaga 
gacccgctgg 
gtatggagga 
acaacttgta 
cttcccgccc 
ctttcacccc 
taaagggtct 
gttatttgga 
ctatgatgat 
ctaatcctct 
ggcagaaatt 
aagtccctgc 
gaagtgttcc 
tatacaagat 
aagtgttgtg 
ttaa tacagg 
ataaacaaat 
acgacgaaga 
actgtgtcac 
ctaaaacaag 
cggaggaaat 
ttgaccgttt 
tcttacaaca 



tgtagacaag 
agaaagtcag 
atcagcagga 
catgaaagtg 
gctcgccaaa 
acactttcat 
cgtctattgt 
aggggagatt 
cctggccact 
ggcagataga 
tatggtgagt 
gtacccgatg 
aattatgcaa 
tttattgttg 
cattgatatt 
tcgcatagag 
taataaaacg 
ttacttcccc 
tatgcatata 
cgacagaccg 
tctctcagaa 
ttggacattt 
tcgtcacatt 
cgagaggttg 
caatgatcca 
gacatggaac 
cagagccgat 
gaaaagagct 
gtatggtcat 
caatactgcc 
gcactcggtg 
ttgtggatta 
attctgcgtg 
tggcaaatat 
catcctagtc 
gggcatgatg 
aacataccat 
gaaatcaacc 
tgggtatgtt 
taagaaattg 
atggtttacc 
tgcctcacaa 
catagaagcg 
ttacaaaaag 
ccaatcattg 
tgtgatgaag 
tactagttcc 
agctctagcg 
taagacggtg 
tggttatttt 
tgcattttca 
ggttgagagc 
ttctacatct 
cgtcattgtt 
gttattgttt 
agcagaagaa 
ggaaaaaaag 
cgagaaaacg 
tgctagttcg 
actaaaagca 
atatagcaat 



aaaactggca 
gccaatacca 
tggttccaga 
tgcgaaaagg 
gtgagtagtg 
ctattgggat 
tgcaaggatg 
tttaggctgt 
cctctatgta 
agaggtgtgg 
caacctatcg 
accatgtgtc 
ctgagggata 
ttggacgtta 
gcagtcgcat 
gaaagtttgg 
tggtgcacca 
gagattgtgt 
gccccagcaa 
tacattggcg 
ctcgaaaagg 
aagggtcaaa 
attgaatcta 
gttagcttat 
gctgttagac 
gtaaaaactg 
cgagttgtga 
ggtctaaaca 
ttggccctaa 
tcaagtattt 
acggccaatc 
cagatggctc 
cataatattg 
tattatgtga 
aactcgttaa 
aatatcgccc 
atgcttcact 
gataacaata 
gttccccata 
aaaagctttt 
tcttcaccta 
attgtagggg 
ggtacagaaa 
ggagcctttg 
attgcaagat 
aatcatgtgg 
cgagtcaaca 
ataaataacc 
acatcagttt 
aacgctggta 
gtcaagaatt 
gataagacgg 
gccaatactg 
acaaaggacg 
gttggagggg 
gaggacgaag 
atccctggac 
gagggagaaa 
ttgagtaaat 
acacattctt 
tgtacattat 



tgttgaaacc 
agtttaagga 
aaattattgg 
cctataaaga 
tcggcgacaa 
tcttgaagct 
cctacttgac 
gtatggactc 
taggagaagg 
gagtaagaag 
tcaatcatgt 
agaataatct 
gattggtact 
ttgacgagtg 
catggaagaa 
gtctaagatt 
atacatctcc 
gtgacattaa 
gtttagagta 
cacacataac 
acttttctgt 
aacaatacga 
cgggaagaaa 
cagacagaat 
tgtggtcttc 
acattcttaa 
tgcagaacaa 
aagttggaca 
gagcacgttc 
ccaacatgtc 
agattacaga 
ttcctggtac 
taggtgatgg 
tggatattgc 
caaagggcat 
atgaacgtct 
ttaatgaaaa 
ataagtttgc 
atccttcatt 
tggaagagga 
cgtggatggc 
tggagaaggg 
tgatggaggc 
tgaagaaggg 
ttttaccaaa 
agaattttgc 
agtttgatac 
acctgaaccc 
cttcttggag 
gcgtgcgttg 
tatctgttgt 
caataaaatc 
gattttctct 
tggctatggc 
gaaaggatta 
aaaatggtga 
gaagcactaa 
gaaagggctc 
gtgggaagaa 
cttgcaccaa 
cttcctatat 



tgaaatgact 
acgcaacaag 
cggtatgtgc 
atccctctct 
ggttaaaaat 
gaagaaggcc 
tggtatagtt 
tgctttaacc 
agcaatctgt 
acactctatt 
tccctatcaa 
gtgcaccact 
tgaaaaaatg 
caatcagatt 
tagcaactct 
catagaaaat 
caatatgaat 
tatgcagttt 
tatgcttcaa 
agctggaaaa 
tgaaaaagat 
tttctgtcat 
tatccgtgat 
ttatcgccgc 
tcgtctaatc 
gggaatcatc 
ggccaaggag 
aaatattatg 
gagccgtaaa 
agccaccgga 
aaatgctcga 
taagcagacg 
aggaatgata 
tctaaaaaat 
ccagtttgta 
agctgtcgct 
tagtgcagcg 
atcctttatc 
gattcttaga 
aggaattcat 
catggatgct 
taactggatt 
ggtgacgcaa 
aattacaccc 
aatagaggaa 
atcccatata 
gtcaaaagaa 
ttcttcagaa 
tctttcggca 
ggatgccacc 
gcccaacgcc 
catgattgct 
gagaagagga 
ttgtatacga 
cggtgaagac 
aaacgaagag 
caaggatgtt 
taagacggca 
agatgcgaga 
caatgaagag 
aacttcagtc 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4 020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
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atgaaattgg 
atacgtaatc 
gacgccatgg 
ttgactcaag 
ccattaatgg 
acagatacat 
gctttaaaat 
ttgatggctg 
gtaaaagatg 
gacacaaata 
ggtatgagtg 
ggtactgtgt 
ctttgtaacg 
acggcagcag 



accaaagagt 
tctccaacaa 
ttgctgcagt 
accattggaa 
gtgtgccttt 
gcgacatggc 
ttgagaacga 
agatgttgtt 
ctcaagatag 
caattatatt 
ttagtgaaat 
ggaattatac 
aacgtgtcaa 
tagaattggc 



agcagaccaa 
gaagaggcaa 
taaggttaag 
aaagtgcccc 
tgaagttgca 
ttgttgtcaa 
aagattggcc 
cggaggggat 
aaagttggtg 
tttgtttgag 
aaaagacgct 
tgatgaaaga 
tggaaattgt 
tgctgcatgt 



atggaaaatt 
gaaaagggag 
tttttcccag 
gtgtccatcc 
ctcaattctc 
tcattgtatt 
cggcaaattg 
aaactattgg 
aaatctttat 
tctttaaggt 
gttagaggtc 
ttttttggac 
aaattgtcct 
ctatcttgtg 



taatatctca 
ggccttttaa 
ttttagatgc 
cagaaacgcg 
taataggaaa 
ttgtcctctt 
gcctagatga 
cccaggaagt 
tgcctttaaa 
ttgctcagaa 
tggccttttc 
cattgtataa 
ttataactgg 
ttttgtaa 



attggatcaa 
gtctgaattg 
gtctagaaaa 
tgaagaaaaa 
acacaagtgc 
gtacacgcta 
ctctgtagat 
gttaaaaagg 
ttataaccat 
acctgtagct 
taccactaca 
catggatgaa 
tatttatcat 



5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6588 



<210> 159 
<211> 2183 
<212> PRT 
<213> SHRIMP 
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TV an 
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Arg 
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Met 


Arg 
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a on 
ft 0 u 




Leu 
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Ser 


Arg 
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TV I — 
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Arg 


Val 
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Ala 


Tin. 

lie 
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vai 
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His 


Asp 
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Aia 


1 ie 
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Asp 


Ala 
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Arg 
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Tyr 


Asp 
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Arg 


Gly 
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Gly 
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Val 
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Ser 
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GlU 


Lys 
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Val 


Lys 


Glu 


Thr 


He 


He 


Asn 


Ser 


Tyr 


Asp 


Pro 


Glu 


GlU 
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Lys 


GlU 


Asp 
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He 


Met 
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Gin 


Cys 


Pro 


Gin 


val 


lie 


Glu 


He 
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Fne 
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705 
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Gly Ala 
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Glu 


Arg 
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Ser 


Ser 


Ser 
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Gly 
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Ala 


Gly 
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745 
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He 
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Met 
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Val 
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Glu 
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Glu 


Ser 
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Ser 


Glu 
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Leu 


Asn 


Ala 
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780 










Val 


Leu 


Ala 
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Val 
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Val 
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Val 
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Val 
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1140 1145 1150 



Asn He Arg Asp Tyr Glu Gly Asn Glu Lys Phe Glu Arg Leu Val Ser 

1155 1160 1165 

Leu Ser Asp Arg He Tyr Arg Arg Val Gly Ala Phe Asp Ser Ala Asn 

1170 1175 1180 

Asp Pro Ala Val Arg Leu Trp Ser Ser Arg Leu He Asn Val Gly Met 
1185 1190 1195 1200 

Leu Val Arg Thr Trp Asn Val Lys Thr Asp He Leu Lys Gly He He 

1205 1210 1215 

Pro Gin Met Gin Ala Tyr Ala Asp Arg Val Val Met Gin Asn Lys Ala 

1220 1225 1230 

Lys Glu Phe* Ala Lys Met Gly Asp Met Lys Arg Ala Gin Lys Val Gly 

1235 1240 1245 

Gin Asn He Met Lys Leu Gly Met Asn Ser Met Tyr Gly His Leu Ala 

1250 1255 1260 

Leu Arg Ala Arg Ser Ser Arg Lys Glu Phe Ala Ser Gly Ser Ala Asn 
1265 1270 1275 1280 

Thr Ala Ser Ser He Ser Asn Met Ser Ala Thr Gly Gly He Gly Gly 
1285 1290 1295 
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Gly Thr Arg His Ser Val Thr Ala Asn Gin He Thr Glu Asn Ala Arg 

1300 1305 1310 

Cys Val Phe Gly Asn He Gly Cys Gly Leu Gin Met Ala Leu Pro Gly 

1315 1320 1325 

Thr Lys Gin Thr Tyr Gly Asp Thr Asp Ser Val Phe Cys Val His Asn 

1330 1335 1340 

He Val Gly Asp Gly Gly Met He Pro Glu Tyr Asp Glu Gin Thr Gly 
1345 1350 1355 1360 

Lys Tyr Tyr Tyr Val Met Asp He Ala Leu Lys Asn Lys Met Ala Ala 

1365 1370 1375 

He He Pro He Leu Val Asn Ser Leu Thr Lys Gly He Gin Phe Val 

1380 1385 1390 

Glu Arg Arg Asp Ala Gly Val Gly Met Met Asn He Ala His Glu Arg 

1395 1400 1405 

Leu Ala Val Ala Gly Leu Leu Phe Ala Lys Lys Thr Tyr His Met Leu 

1410 1415 1420 

His Phe Asn Glu Asn Ser Ala Ala Phe Asn Asp Met He Lys Leu Lys 
1425 1430 1435 1440 

Ser Thr Asp Asn Asn Asn Lys Phe Ala Ser Phe He Lys Arg Pro Ser 

1445 1450 1455 

His Ala Asp Gly Tyr Val Val Pro His Asn Pro Ser Leu He Leu Arg 

1460 1465 1470 

Ala Ala Glu Gly Pro Ala Gly Lys Lys Leu Lys Ser Phe Leu Glu Glu 

1475 1480 1485 

Glu Gly He His Asp Glu Lys Ser Met Glu Glu Trp Phe Thr Ser Ser 

1490 1495 1500 

Pro Thr Trp Met Ala Met Asp Ala Ser Val He Asn Asn Leu Tyr Ala 
1505 1510 1515 1520 

Ser Gin He Val Gly Val Glu Lys Gly Asn Trp He Asp Ala Met Thr 

1525 1530 1535 

Ser Arg Pro He Glu Ala Gly Thr Glu Met Met Glu Ala Val Thr Gin 

1540 1545 1550 

Ala Asn Ala Ala Phe Thr Pro Tyr Lys Lys Gly Ala Phe Val Lys Lys 

1555 1560 1565 

Gly He Thr Pro Thr Thr Lys Leu Lys Gly Leu Gin Ser Leu He Ala 

1570 1575 1580 

Arg Phe Leu Pro Lys He Glu Glu Lys Lys Ser Cys Tyr Leu Asp Val 
1585 1590 1595 1600 

Met Lys Asn His Val Glu Asn Phe Ala Ser His He Thr Asn Pro Ala 

1605 1610 1615 

Met Met He Thr Ser Ser Arg Val Asn Lys Phe Asp Thr Ser Lys Glu 

1620 1625 1630 

Gin Ser Arg Pro Asn Pro Leu Ala He Asn Asn His Leu Asn Pro Ser 

1635 1640 1645 

Ser Glu He Ser Leu Gly Gin Lys Phe Lys Thr Val Thr Ser Val Ser 

1650 1655 1660 

Ser Trp Ser Leu Ser Ala Glu Glu Gly Glu Val Pro Ala Gly Tyr Phe 
1665 1670 1675 1680 

Asn Ala Gly Ser Val Arg Trp Asp Ala Thr Asn Met Lys Gly Ser Val 

1685 1690 1695 

Pro Ala Phe Ser Val Lys Asn Leu Ser Val Val Pro Asn Ala He Thr 

1700 1705 1710 

Ser Val Tyr Lys Met Val Glu Ser Asp Lys Thr Ala He Lys Ser Met 

1715 1720 1725 

He Ala Lys Asn Val Glu Val Leu Cys Ser Thr Ser Ala Asn Thr Gly 

1730 1735 1740 

Phe Ser Leu Arg Arg Gly Ala Leu Ser Phe Asn Thr Gly Val lie Val 
1745 1750 1755 1760 

Thr Lys Asp Val Ala Met Ala Cys He Arg Ser Leu Asn Asn Lys Gin 

1765 1770 1775 

Met Leu Leu Phe Val Gly Gly Gly Lys Asp Tyr Gly Glu Asp Asp Asp 
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1780 1785 1790 

Asp Asp Asp Glu Glu Ala Glu Glu Glu Asp Glu Glu Asn Gly Glu Asn 

1795 1800 1805 

Glu Glu Asn Lys Gly Asp Cys Val Thr Glu Lys Lys lie Pro Gly Arg 

1810 1815 1820 

Ser Thr Asn Lys Asp Val Gly Glu Glu Thr Lys Thr Ser Glu Lys Thr 
1825 1830 1835 1840 

Glu Gly Glu Arg Lys Gly Ser Lys Thr Ala Lys Gly Lys Thr Glu Glu 

1845 1850 1855 

lie Ala Ser Ser Leu Ser Lys Cys Gly Lys Lys Asp Ala Arg Asp Val 

I860 1865 1870 

lie Leu Asp Arg Leu Leu Lys Ala Thr His Ser Ser Cys Thr Asn Asn 

1875 1880 1885 

Glu Glu Arg Thr Arg Val Leu Gin Gin Tyr Ser Asn Cys Thr Leu Ser 

1890 1895 1900 

Ser Tyr lie Thr Ser Val Met Lys Leu Asp Gin Arg Val Ala Asp Gin 
1905 1910 1915 1920 

Met Glu Asn Leu lie Ser Gin Leu Asp Gin He Arg Asn Leu Ser Asn 

1925 1930 1935 

Lys Lys Arg Gin Glu Lys Gly Gly Pro Phe Lys Ser Glu Leu Asp Ala 

1940 1945 1950 

Met Val Ala Ala Val Lys Val Lys Phe Phe Pro Val Leu Asp Ala Ser 

1955 I960 1965 

Arg Lys Leu Thr Gin Asp His Trp Lys Lys Cys Pro Val Ser He Pro 

1970 1975 1980 

Glu Thr Arg Glu Glu Lys Pro Leu Met Gly Val Pro Phe Glu Val Asn 
198 5 1990 1995 2000 

Ser Leu He Gly Lys His Lys Cys Thr Asp Thr Cys Asp Met Ala Cys 

2005 2010 2015 

Cys Gin Ser Leu Tyr Phe Val Leu Leu Tyr Thr Leu Ala Leu Lys Phe 

2020 2025 2030 

Glu Asn Glu Arg Leu Ala Arg Gin lie Gly Leu Asp Asp Ser Val Asp 

2035 2040 2045 

Leu Met Ala Glu Met Leu Phe Gly Gly Asp Lys Leu Leu Ala Gin Glu 

2050 2055 2060 

Val Leu Lys Arg Val Lys Asp Ala Gin Asp Arg Lys Leu Val Lys Ser 
2065 2070 2075 2080 

Leu Leu Pro Leu Asn Tyr Asn His Asp Thr Asn Thr lie lie Phe Leu 

2085 2090 2095 

Phe Glu Ser Leu Arg Phe Ala Gin Lys Pro Val Ala Gly Met Ser Val 

2100 2105 2110 

Ser Glu He Lys Asp Ala Val Arg Gly Leu Ala Phe Ser Thr Thr Thr 

2115 2120 2125 

Gly Thr Val Trp Asn Tyr Thr Asp Glu Arg Phe Phe Gly Pro Leu Tyr 

2130 2135 2140 

Asn Met Asp Glu Leu Cys Asn Glu Arg Val Asn Gly Asn Cys Lys Leu 
2145 2150 2155 2160 

Ser Phe He Thr Gly He Tyr His Thr Ala Ala Val Glu Leu Ala Ala 

2165 2170 2175 

Ala Cys Leu Ser Cys Val Leu 
2180 



<210> 160 
<211> 4172 
<212> DNA 
<213> SHRIMP 

<400> 160 

atgactgaac aaggggatca aggaataaaa gtaaggaaat tacatggccc gagaggagaa 60 
agaggagaaa ctggtccagc aggagcagtt ggccctgcag gccctcaagg agaaagagga 120 
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gcaattggac 
ggagcaattg 
agaggagaaa 
gcaattggac 
ggaccggcag 
aatggacgcc 
gcaattggac 
ggagcaacag 
agaggagaaa 
agaagaggag 
ggagaaaaag 
agaggagaga 
ttcccaggac 
ggccctcaag 
gctggcctcc 
gcaactggtc 
ggagcaacag 
agaggagcaa 
gaaagaggag 
ggagaaagag 
gctggtcctc 
cctataggcc 
ggtcctcaag 
gctggccctc 
cctgctggcc 
ggccctatag 
gatggagcaa 
cgcccaggaa 
ggattaactg 
actggtgcaa 
gcaacaggcc 
ggagaaaatg 
caaggagaaa 
gaaaagggag 
ggagaaaggg 
ccaggaaggg 
cgcacaggaa 
ggattagctg 
attggaccta 
gaggatggtt 
ggaccgcaag 
ccagggcgac 
gcacctggta 
ggccctattg 
gacggagaaa 
acaactggcg 
ggtgcacaag 
gacggtccac 
cctagaggtc 
ggacctcgag 
gaaggtcctc 
catcgaggag 
ggggaaaaag 
cctggcacgg 
gcacaagggg 
gggccgatgg 
agaggagaga 
aaagatgccg 
tgggaaggaa 
cctgtgatga 
atacaagtaa 



cggcaggaaa 
gaccggcagg 
atggacgccc 
cggcaggaaa 
gaaaggatgg 
caggaagaga 
cggcaggaag 
gtataccagg 
ttggacgccc 
caacaggacg 
gagaagctgg 
ctggaccacc 
caccaggcga 
ggaaaagagg 
caggaagaga 
taccaggaag 
gccgcgcagg 
caggtatacc 
aaactggacc 
gagaaaatgg 
aaggagaaaa 
ctagaggaga 
gaagaagagg 
caggagaaag 
ctcaaggaga 
gtcctgctgg 
ctggccctat 
gagatggagc 
gacgcccagg 
tgggaaagaa 
gcgcaggaaa 
gacgcccagg 
caggattagc 
aaaatggacg 
gagagactgg 
atggtgtgga 
gggatggggc 
gcctgccagg 
taggcccacg 
ccacaggccc 
gagaaagagg 
aaggaaggga 
atgatggtct 
gacctcaagg 
tgggaccagc 
ctaaaggacc 
gatctagagg 
ccggtgaaga 
ctagaggtat 
gtccccgagg 
gaggtcctag 
aaaagggtga 
gtccaagagg 
aaggggttag 
aattaggtcc 
ggcgtacagg 
aaggaggtag 
taaatatcat 
atgaaatcgg 
tagaaaa tag 
tgacaatgca 



ggatggagca 

aaaggatgga 

aggaagagat 

ggatggagca 

agcagttggc 

tggagcagtt 

agatggagca 

aagggatggc 

aggaagagat 

cgcaggaaag 

taaggacggt 

gggaagggac 

aactggacca 

agaaacagga 

tggagcaatt 

gaatggtgtg 

aaaggatggg 

aggaagggat 

agcaggaagg 

acgcccagga 

aggagaaaat 

aactggtgca 

agcaacaggc 

aggagaaact 

aacaggatta 

ccctcaagga 

aggtcctgct 

aactggccct 

aagagatgga 

tggtgtggac 

ggatggagca 

aagagatgga 

tgggctgcca 

cccaggaaag 

tcctataggt 

tggttctgtt 

aattggccct 

gatagatgga 

aggagaaaga 

tatgggcccc 

attaaaggga 

tggaataatg 

agagggacct 

aataagagga 

cggcaaggac 

tagaggatta 

agaaaaaggc 

aggtcctcaa 

tcgtggccgt 

aacaaaagga 

aggtatacaa 

taaaggagac 

t t tacgagga 

agg tcctaga 

ccaaggacca 

agatactggt 

aggaagaaag 

acaaaaatat 

agaagcatac 

aatagggatg 

ctcaataaca 



gttggccctg 
gcagttggcc 
ggagcagttg 
gttggccctc 
cctgcaggcc 
ggccctgcag 
gttggccctg 
gtggacggtt 
ggagcagttg 
gatggtgcag 
tctatagggc 
ggcactgcag 
ccaggaaagg 
cccgttggac 
ggccctgcag 
gatggttcta 
gcagttggcc 
ggtgtggacg 
gacggttcag 
agagatgggg 
ggacgcccag 
atgggaaaga 
cgcgcaggaa 
ggaccagcag 
actggcagcc 
gaaaaaggag 
ggccctcaag 
ataggtcctg 
gcaactggcc 
ggttctacgg 
gttggccctg 
gcaactggcc 
ggaagagatg 
gatggggcaa 
cctgctggcc 
ggccctcaag 
gtaggtcctg 
aaggacggtt 
ggagaaactg 
caaggactaa 
cggccaggaa 
ggtcctaggg 
gaaggaagag 
ttaaaaggta 
ggaatagaag 
agaggttttc 
gatagagggc 
ggtcttagag 
tcaggacctc 
agaacaggaa 
ggaaaggaag 
cgtggagaac 
attcgaggcc 
ggggtgagag 
acaggtcctc 
cccatgggcc 
ggaaaaaatg 
tcaatcaccc 
attggaagat 
acaaatgagg 
actagaggaa 



caggccctca 
ctcaaggccc 
gccctcaagg 
aaggagaaag 
ctcaaggaga 
gccctccagg 
caggccctcc 
ctgtgggccc 
gccctgcagg 
ttggtcctgc 
ctcaaggaat 
cagaaagagg 
atggtgtgga 
ctaggggtga 
gccctccagg 
tcggccccca 
ctgcaggccc 
gttctgtggg 
ttggccctgc 
caactggccc 
gaagagatgg 
atggcgtgga 
aggatggggc 
gaagggacgg 
caggaagaga 
aaaatggacg 
gagaaaaagg 
ctggccctca 
ctataggtcc 
gtcctcaagg 
ctggccctcc 
ctataggtcc 
gagcaattgg 
ctggccctat 
ctcaaggagc 
gaaaaagagg 
caggccctaa 
ccgtgggtcc 
gacgaccagg 
gaggagctac 
aagatggtga 
gtcttcgagg 
atggtgcacc 
tccagggacg 
gccctagagg 
aaggaagaac 
taacaggccc 
gagaaagggg 
aaggaagtaa 
tacaaggcct 
gaagaatggg 
aaggcatcgc 
ctattggtgc 
gtgttcctgg 
aagggccagc 
ctcctggagc 
gccctaaagg 
atgctcgtgc 
cttatggaac 
acaaaaaaaa 
gaacatcggg 



aggagaaaga 
tccaggagaa 
agaaagagga 
aggagcaatt 
aagaggagaa 
agaaagagga 
aggagaaaga 
tcaaggagaa 
ccctcaagga 
aggccctcaa 
acaaggccca 
agaaagaggc 
tggttctgag 
accaggtcta 
agaaagagga 
aggaagaaga 
tccaggagaa 
ccctccagga 
tggccctcat 
tataggtcct 
agcaactggc 
cggttctatg 
agttggccct 
ttcagttggc 
tggagcaact 
cccaggaaga 
agaaaatgga 
aggagaaaca 
tagaggagaa 
aagaagagga 
aggagaaaga 
tgctggccct 
tcctcaagga 
gggtcctcca 
aactggtctt 
attaataggg 
aggagaaaca 
tcaaggagca 
aagggacggt 
gggagctcca 
aacaggtcct 
agaaaaagga 
tggtcccgct 
accaggaaga 
tcaagatgga 
aggagaaact 
tcaaggaaga 
agcacctggc 
cggcgtgcaa 
cactggcata 
gaaaattgga 
tggagcagac 
tcctggtaag 
ctatcctggc 
aggtcctcaa 
agtgggacca 
agcggacgga 
agagataatg 
tgatacaatc 
cgaatattgt 
tgtttttgtg 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 
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gtaagcaata 
tgtagaacag 
gaaagcaaat 
tcaaaaattg 
gacgctgatg 
ggggaaaaag 
cccactgaag 



agacagatta 
atgtcagtac 
cgtacagatt 
ccgtggtaat 
tggcgcgaag 
atagaggttt 
aagagttatt 



tatcctttta 
aaatgcgagg 
tattaggccg 
gtatccagac 
agaaacaacg 
ctatgctgat 
ggtattgcaa 



gttactttac 
tcagagaggg 
tctgaccaat 
gcaagcatga 
tctgtgcttt 
agaggaactg 
gc 



tgatgccaga 
tgaatgctgt 
ctataggtac 
gttactcagt 
tattagcaga 
tagggaggtt 



aagtgtttcc 
tagagaaaga 
tcattcacgt 
tgatacatta 
aaccatacac 
gatggtacct 



3840 
3900 
3960 
4020 
4080 
4140 
4172 



<210> 161 
<211> 1389 
<212> PRT 
<213> SHRIMP 



<220> 

<221> VARIANT 

<222> {!)... (1389) 

<223> Xaa = Any Amino Acid 



<400> 161 



Met 


Thr 


Glu 


Gin 


Gly 


Asp 


Gin 


Gly 


1 








5 








Pro 


Arg 


Gly Glu 


Arg 


Gly 


Glu 


Thr 








20 










Ala 


Gly 


Pro 
35 


Gin 


Gly 


Glu 


Arg 


Gly 
40 


Gly Ala 


Val 


Gly 


Pro 


Ala 


Gly 


Pro 




50 










55 




Pro 


Ala 


Gly Lys 


Asp 


Gly 


Ala 


Val 


65 










70 






Arg 


Gly 


Glu 


Asn 


Gly 
85 


Arg 


Pro 


Gly 


Gly 


Glu 


Arg 


Gly 
100 


Ala 


He 


Gly 


Pro 


Pro 


Gin 


Gly Glu 


Arg 


Gly 


Ala 


He 






115 










120 


Val 


Gly 
130 


Pro 


Ala 


Gly 


Pro 


Gin 
135 


Gly 


Gly Arg 


Asp 


Gly 


Ala 


Val 


Gly 


Pro 


145 










150 






Ala 


He 


Gly 


Pro 


Ala 
165 


Gly 


Arg 


Asp 


Pro 


Gly 


Glu 


Arg 
180 


Gly 


Ala 


Thr 


Gly 


Gly 


Ser 


Val 
195 


Gly 


Pro 


Gin 


Gly 


Glu 
200 


Arg 


Asp 


Gly Ala 


Val 


Gly 


Pro 


Ala 




210 










215 




Thr 


Gly 


Arg 


Ala 


Gly 


Lys 


Asp 


Gly 


225 










230 






Gly 


Glu 


Lys 


Gly 


Glu 
245 


Ala 


Gly 


Lys 


He 


Gin 


Gly 


Pro 
260 


Arg 


Gly 


Glu 


Thr 


Ala 


Ala 


Glu 

275 


Arg 


Gly 


Glu 


Arg 


Gly 
280 


Gly 


Pro 
290 


Pro 


Gly 


Lys 


Asp 


Gly 
295 


Val 


Lys 


Arg 


Gly 


Glu 


Thr 


Gly 


Pro 


Val 


305 










310 






Ala 


Gly 


Leu 


Pro 


Gly 
325 


Arg 


Asp 


Gly 



lie 


Lys 
10 


Val 


Arg 


Lys 


Leu 


His 
15 


Gly 


Gly 


Pro 


Ala 


Gly 


Ala 


Val 


Gly 


Pro 


25 










30 






Ala 


He 


Gly 


Pro 


Ala 
45 


Gly 


Lys 


Asp 


Gin 


Gly 


Glu 


Arg 
60 


Gly 


Ala 


He 


Gly 


Gly 


Pro 


Gin 
75 


Gly 


Pro 


Pro 


Gly 


Glu 
80 


Arg 


Asp 
90 


Gly 


Ala 


Val 


Gly 


Pro 
95 


Gin 


Ala 


Gly 


Lys 


Asp 


Gly 


Ala 


Val 


Gly 


105 










110 






Gly 


Pro 


Ala 


Gly 


Lys 
125 


Asp 


Gly 


Ala 


Glu 


Arg 


Gly 


Glu 
140 


Asn 


Gly 


Arg 


Pro 


Ala 


Gly 


Pro 
155 


Pro 


Gly 


Glu 


Arg 


Gly 
160 


Gly 


Ala 
170 


Val 


Gly 


Pro 


Ala 


Gly 
175 


Pro 


He 


Pro 


Gly 


Arg 


Asp 


Gly 


Val 


Asp 


185 










190 






Arg 


Gly 


Glu 


He 


Gly 
205 


Arg 


Pro 


Gly 


Gly 


Pro 


Gin 


Gly 
220 


Arg 


Arg 


Gly 


Ala 


Ala 


Val 


Gly 
235 


Pro 


Ala 


Gly 


Pro 


Gin 
240 


Asp 


Gly 
250 


Ser 


He 


Gly 


Pro 


Gin 
255 


Gly 


Gly 


Pro 


Pro 


Gly 


Arg 


Asp 


Gly 


Thr 


265 










270 






Phe 


Pro 


Gly 


Pro 


Pro 
285 


Gly 


Glu 


Thr 


Asp 


Gly 


Ser 


Glu 
300 


Gly 


Pro 


Gin 


Gly 


Gly 


Pro 


Arg 
315 


Gly 


Glu 


Pro 


Gly 


Leu 
320 


Ala 


He 
330 


Gly 


Pro 


Ala 


Gly 


Pro 
335 


Pro 
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Lj-L y 


C 1 n 


Arg 


Gly 


Ala 


Thr Gly 


Leu 


XT X. \J 


Gly Arg 


Asn 


Gly 


Val 


Asp 


Gly 








340 








345 










350 






ber 


Tl Q 

lie 


Gly 


Pro 


Gin 


Gly Arg 


Arg 


ij Ay 




Thr 


Gly 


Arg 


Ala 


Gly 


- 

i»y s 






355 


















365 








Asp 


f i ii 
oiy 


Ala 


Val 


Gly 


Pro Ala 


\3 Ay 


Pro 


Pro 


Gly 


Glu 


Arg 


Gly 


Ala 


Thr 




J / u 








375 










380 










biy 


lie 


Pro 


Gly 


Arg 


Asp Gly 


vai 


Asp 


Gly 


Ser 


Val 


uiy 


Pro 


Pro 


m \r 

ul y 


OQc: 
jOj 










390 








395 










400 


Glu 


Arg 


Gly Glu 


Thr 


Gly Pro 


Ala 


Giy 


Arg 


Asp 


Gly 


oer 


Val 


Gly 


Pro 










405 








410 










415 




Ala 


Gly 


Pro 


His 


Gly Glu Arg 


Gly 


blU 


Asn 


Gly Arg 


Pro 


Gly Arg 


Asp 








420 


















430 






Gly 


Ala 


Thr 


Gly 


Pro 


lie Gly 


Pro 


Ala 


Gly 


Pro 


Gin 


biy 


Glu 


Lys 


biy 






435 








440 










d d S 

*i *a J 








Glu 


Asn 


Gly Arg 


Pro 


Gly Arg 


Asp 


Giy 


Ala 


Thr 


Gly 


fro 


He 


Gly 


fro 












455 










4 60 










Arg 


Gly 


Glu 


Thr 


Gly Ala Met 


Gly 


Lys 


Asn 


Gly 


Val 


Asp 


Gly 


Ser 


Mot- 
net 


^D3 










470 








475 










480 


Gly 


Pro 


Gin 


Gly 


Arg Arg Gly Ala 


Tnr 


Gly Arg 


Ala 


Giy 


Lys 


Asp 


bly 










485 








490 










495 




Ala 


val 


Gly 


Pro 


Ala 


Gly Pro 


Pro 


bly 


Glu 


Arg 


Gly 


bXU 


Thr 


Gly 


Pro 








500 


















510 






AX a 


Gly 


Arg 


Asp 


Gly 


Ser Val 


Gly 


Pro 


Ala 


Gly 


Pro 


Gin 


Gly 


Glu 


xnr 






515 








520 










~>£. D 








Gly 


Leu 


Thr 


Gly 


Ser 


Pro Gly Arg 


Asp 


Gly Ala 


Thr 


Giy 


Pro 


He 


uiy 












535 










540 










Pro 


Ala 


Gly 


Pro 


Gin 


Gly Glu 


Lys 


Gly 


Glu 


Asn 


Gly 


Arg 


Pro 


Gly 


Arg 


C A C 










550 








555 










<^^n 


Asp 


Gly 


Ala 


Thr 


Gly 


Pro lie 


Gly 


Pro 


Ala 


Gly 


Pro 


Gin 


Gly 


Glu 


Lys 










565 








570 










575 




Gly 


Glu 


Asn Gly 


Arg 


Pro Gly 


Arg 


Asp 


Gly Ala 


Thr 


Gly 


Pro 


He 


GXy 








580 








585 










590 






Pro 


Ala 


Gly 


Pro 


Gin 


Gly Glu 


Thr 


Gly 


Leu 


Thr 


Gly 


Arg 


Pro 


Gly 


Arg 






595 








600 


















Asp 


Gly 


Ala 


Thr 


Gly 


Pro lie 


Gly 


Pro 


Arg 


Gly 


Glu 


rpu _ 

Tnr 


Gly Ala 


ner 




olU 








615 










620 










Gly 


Lys 


Asn 


Gly 


Val 


Asp Gly 


Ser 


Thr 


Gly 


Pro 


Gin 


Gly 


Arg 


Arg 


GXy 


625 










630 








635 










CIA A 


Ala 


Thr 


Gly Arg 


Ala 


Gly Lys 


Asp 


Gly 


Ala 


Val 


Gly 


Pro 


Ala 


Gly 


Pro 










645 








650 










655 




Pro 


Gly 


Glu 


Arg 


Gly 


Glu Asn 


Gly Arg 


Pro 


Gly 


Arg 


TV c«n 

ASp 


Gly 


Ala 


inr 








660 








665 










67 0 






Gly 


Pro 


lie 


Gly 


Pro 


Ala Gly 


Pro 


Gin 


Gly 


Glu 


Thr 


Gly 


Leu 


Ala 


Gly 






675 








680 


















Leu 


Pro 


Gly Arg 


Asp Gly Ala 


He 


Gly 


Pro 


Gin 


Gly 


bill 


Lys 


Gly 


pi „ 
olu 




con 








695 










700 










Asn 


Gly 


Arg 


Pro 


Gly 


Lys Asp 


Gly Ala 


Thr 


Gly 


Pro 


Met 


Gly 


Pro 


Pro 


/Ub 










710 








715 










720 


Gly 


Glu 


Arg 


Gly 


Glu 


Thr Gly 


Pro 


He 


Gly 


Pro 


Ala 


Gly 


Pro 


Gin 


Gly 










725 








730 










735 




Ala 


Thr 


Gly 


Leu 


Pro Gly Arg 


Asp 


Gly 


Val 


Asp 


Gly 


Ser 


Val 


Gly 


Pro 








740 








745 










750 






Gin 


Gly 


Lys 


Arg 


Gly 


Leu He 


Gly 


Arg 


Thr 


Gly 


Arg 


Asp 


Gly Ala 


He 






755 








760 










765 








Gly 


Pro 


Val 


Gly 


Pro 


Ala Gly 


Pro 


Lys 


Gly 


Glu 


Thr 


Gly 


Leu 


Ala 


Gly 




770 








775 










780 










Leu 


Pro 


Gly 


He 


Asp 


Gly Lys 


Asp 


Gly 


Ser 


Val 


Gly 


Pro 


Gin 


Gly Ala 


785 










790 








795 










800 


lie 


Gly 


Pro 


He 


Gly 


Pro Arg 


Gly 


Glu 


Arg 


Gly 


Glu 


Thr 


Gly Arg 


Pro 










805 








810 










815 




Gly 


Arg 


Asp 


Gly 


Glu 


Asp Gly 


Ser 


Thr 


Gly 


Pro 


Met 


Gly 


Pro 


Gin 


Gly 
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820 










825 










830 






Leu 


Arg 


835 




J. I IX 




A 1 -a 
nX a, 


tr l. \j 
840 


Gly 


Prr> 

IT -L \*J 


Gin 


Gly 


Glu 
845 


Arg 


Glv 
oxy 


Leu 


T \/G 


Tit; 

vj J. y 


Arg 


Pro 


uiy 


Lys 


Asp 

O «J -J 


uiy 


vJ-U 


Thr 


Gly 


Pro 
860 


Pro 


Glv 


Arg 


Gin 


uJ.y 


Arg 


Asp 






Mo t- 


ul y 




Arg 






Arg 


Gly 


Glu 


Lys 


Gly 


O D-> 










ft7fi 

O / kJ 










875 










880 


iv 1 a 


Pro 


oi.y 


Asn 


Asp 


uiy 


Leu 


UlU 


biy 


Pro 


Glu 


Gly Arg 


ASp 


vj J. y 












885 










890 










895 




trZO 


oJLy 


Pro 


/v±a 
onn 




rTO 


i j.e 


(jiy 


Pro 
~ \j ~j 


Gin 


Gly 


He 


Arg 


biy 
910 


Leu 


Lys 




lie 


Cain 

Q1 


oxy 


Arg 


rlO 


ri,, 


Arg 
920 


Asp 


Gly 


Glu 


Met 


Gly 
925 


Pro 


Til ft 


iji y 


Lys 


Asp 
930 


Gly 


lie 


pi „ 

VjxU 


vaiy 


Pro 
935 


Arg 


Gly 


Gin 


Asp 


Gly 
940 


Thr 


Thr 


v>i.y 


AX a 


Lys 


Gly 


Pro 


Arg 


ijj.y 


Leu 


Arg 


Gly 


Phe 


Gin 


Gly 


Arg 


Thr 


Gly 




J. III. 


945 










950 










955 












Gly 


Ala 


Gin 


biy 


Ser 


Arg 


Gly 


Glu 


Lys 


Gly Asp Arg 


Gly 


Leu 


Thr 


Gly 










965 










970 










975 




Pro 


Gin 


Gly 


Arg 
980 


Asp 


Gly 


Pro 


Pro 


Gly 
985 


Glu 


Glu 


Gly 


Pro 


Gin 
990 


Gly 


Leu 


Arg 


Gly 


Glu 


Arg 


Gly 


Ala 


Pro 


Gly 


Pro 


Arg 


Gly 


Pro 


Arg 


Gly 


He 


Arg 






995 










1000 








1005 






Gly 


Arg 


Ser 


Gly 


Pro 


Gin 


Gly 


Ser 


Asn 


Gly Val 


Gin 


Gly 


Pro 


Arg 


Gly 




1010 








1015 








1020 








Pro 


Arg 


Gly 


Thr 


Lys 


Gly 


Arg 


Thr 


Gly 


He 


Gin 


Gly 


Leu 


Thr 


Gly 


He 


1025 








1030 








1035 








1040 


Glu 


Gly 


Pro 


Arg 


Gly 


Pro 


Arg 


Gly 


He 


Gin 


Gly 


Lys 


Glu 


Gly 


Arg 


Met 










1045 








1050 








1055 


Gly Lys 


He 


Gly 


His 


Arg 


Gly 


Glu 


Lys 


Gly 


Asp 


Lys 


Gly 


Asp 


Arg 


Gly 








1060 








1065 








1070 




Glu 


Gin 


Gly 


He 


Ala 


Gly Ala 


Asp 


Gly 


Glu 


Lys 


Gly 


Pro 


Arg 


Gly 


Leu 






1075 








1080 








1085 






Arg 


Gly 


He 


Arg 


Gly 


Pro 


He 


Gly 


Ala 


Pro 


Gly 


Lys 


Pro 


Gly 


Thr 


Glu 




1090 








1095 








1100 








Gly Val 


Arg 


Gly 


Pro 


Arg 


Gly 


Val 


Arg 


Gly 


Val 


Pro 


Gly 


Tyr 


Pro 


Gly 


1105 








1110 








1115 








1120 


Ala 


Gin 


Gly 


Glu 


Leu 


Gly 


Pro 


Gin 


Gly 


Pro 


Thr 


Gly 


Pro 


Gin 


Gly 


Pro 










1125 








1130 








1135 


Ala 


Gly 


Pro 


Gin 


Gly 


Pro 


Met 


Gly 


Arg 


Thr 


Gly 


Asp 


Thr 


Gly 


Pro 


Met 








1140 








1145 








1150 




Gly 


Pro 


Pro 


Gly 


Ala 


Val 


Gly 


Pro 


Arg 


Gly 


Glu 


Lys 


Gly 


Gly 


Arg 


Gly 






1155 








1160 








1165 






Arg 


Lys 


Gly 


Lys 


Asn 


Gly 


Pro 


Lys 


Gly 


Ala 


Asp 


Gly 


Lys 


Asp 


Ala 


Val 




1170 








1175 








1180 








Asn 


He 


He 


Gin 


Lys 


Tyr 


Ser 


He 


Thr 


His 


Ala 


Arg 


Ala 


Glu 


He 


Met 


1185 








1190 








1195 








1200 


Trp Glu 


Gly 


Asn 


Glu 


He 


Gly 


Glu 


Ala 


Tyr 


He 


Gly 


Arg 


Ser 


Tyr 


Gly 










1205 








1210 








1215 


Thr 


Asp 


Thr 


He 


Pro 


Val 


Met 


He 


Glu 


Asn 


Arg 


He 


Gly 


Met 


Thr 


Asn 








1220 








1225 








1230 




Glu 


Asp 


Lys 


Lys 


Asn 


Glu 


Tyr 


Cys 


He 


Gin 


Val 


Met 


Thr 


Met 


His 


Ser 






1235 








1240 








1245 






He 


Thr 


Thr 


Arg 


Gly Arg 


Thr 


Ser 


Gly 


Val 


Phe 


Val 


Val 


Ser 


Asn 


Lys 




1250 








1255 








1260 








Thr 


Asp 


Tyr 


He 


Leu 


Leu 


Val 


Thr 


Leu 


Leu 


Met 


Pro 


Glu 


Ser 


Val 


Ser 


1265 








1270 








1275 








1280 


Cys 


Arg 


Thr 


Asp 


Val 


Ser 


Thr 


Asn 


Ala 


Arg 


Ser 


Val 


Asn 


Ala 


Val 


Arg 










1285 








1290 








1295 


Glu 


Arg 


Glu 


Ser 


Lys 


Ser 


Tyr 


Arg 


Phe 


He 


Arg 


Pro 


Ser 


Asp 


Gin 


Ser 



1300 1305 1310 
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lie Gly Thr His Ser Arg Ser Lys lie Ala Val Val Met Tyr Pro Asp 

1315 1320 1325 

Ala Ser Met Ser Tyr Ser Val Asp Thr Leu Asp Ala Asp Val Ala Arg 

1330 1335 1340 

Arg Glu Thr Thr Ser Val Leu Leu Leu Ala Glu Thr lie His Gly Glu 
1345 1350 1355 1360 

Lys Asp Arg Gly Phe Tyr Ala Asp Arg Gly Thr Val Gly Arg Leu Met 

1365 1370 1375 

Val Pro Pro Thr Glu Glu Glu Leu Leu Val Leu Gin Xaa 
1380 1385 



<210> 162 
<211> 201 
<212> DNA 
<213> SHRIMP 



<400> 162 

atggccgtaa cagagatacc ttgtggtact 
gagttgatac ttgtaacagc agaagcagaa 
gcagcaatca taggggctgt agtggtgcaa 
gcagaggtag agttgatgtg a 

<210> 163 
<211> 64 
<212> PRT 
<213> SHRIMP 



cggaacattg cagaagaaga cgtagaatta 60 
gtccgagagg cgatggcagc agctctggcg 120 
atcggcagag tgttggatga agtcgtggca 180 

201 



<400> 163 

Met Ala Val Thr Glu lie Pro Cys Gly Thr Arg Asn lie Ala Glu Glu 

15 10 15 

Asp Val Glu Leu Glu Leu lie Leu Val Thr Ala Glu Ala Glu Val Arg 

20 25 30 

Glu Ala Met Ala Ala Ala Ala Ala lie lie Gly Ala Val Val val Gin 

35 40 45 

lie Gly Arg Val Leu Asp Glu Val Val Ala Ala Glu Val Glu Leu Met 
50 55 60 



<210> 164 
<211> 627 
<212> DNA 
<213> SHRIMP 



<400> 164 

atgcacatgt 

atatctatag 

gcctaccctg 

cactttaact 

ggccaaggaa 

ttacttggag 

ttgtatgcaa 

tctccaggtc 

aaaccaacag 

gaagacatcg 

gatctgttta 



ggggggttta 
ttgtaaccaa 
ttgaatctga 
ttgtaaacgg 
cgtccgattc 
acggagacca 
gagtatacaa 
taaatgctac 
ttttgaagaa 
atgatacaaa 
aggttgggga 



cgccgctata 
catagaactt 
aataataaac 
cacattacaa 
tgaactggta 
cacactaaat 
taatacaaag 
aggaagggaa 
aaataggatc 
taggcattat 
aaaataa 



ctggcgggtt 
aacaagaaat 
ttgaccatta 
accaggaact 
aaaaagaaag 
gtaaacaaag 
agggatataa 
ttttcagcta 
aacacacttg 
ctgttaagta 



tgacattgat 
tggacaagaa 
acggtgttgc 
atggaaaggt 
gagacataat 
ccgaatctaa 
cagtggactc 
acaaatttgt 
tgtttggagc 
tgcgattttc 



actcgtggtt 
ggataaagac 
tagaggaaac 
atatgtagct 
cctcacatct 
agaattagaa 
tgtttcactg 
attatatttc 
aacgtttgac 
tcctggcaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

627 



<210> 165 
<211> 204 
<212> PRT 
<213> SHRIMP 
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<400> 165 




























Met 


riXS 


Met 


Trp 


Gly 


Val 


Tyr 


Ala 


Aid 


i le 


T at* 

Leu 


Ala 


Gly 


Leu 


i nr 


Leu 


1 








5 










1 U 










1 o 




lie 


Leu 


vai 


Val 


He 


Ser 


He 


val 


val 

Zo 


Thr 


Asn 


He 


Glu 


Leu 

JU 


Asn 


Lys 


Lys 


Leu 


Asp 


Lys 


Lys 


Asp 


Lys 


Asp 
4 0 


Ala 


Tyr 


Pro 


Val 


Glu 
45 


Ser 


Glu 


He 


lie 


Asn 
OO 


Leu 


Thr 


He 


Asn 


0>iy 
55 


va jl 


Arg 


Asn 


His 


Phe 
60 


Asn 


Pne 


val 


Asn 


Gly 


Thr 


Leu 


Gin 


Thr 


Arg 


Asn 


Tyr 


Gly 


Lys 


Val 


Tyr 


Val 


Ala 


Gly 


Gin 


65 










70 










75 










80 


Gly 


Thr 


Ser 


Asp 


Ser 


Glu 


Leu 


Val 


Lys 


Lys 


Lys 


Gly Asp 


He 


He 


Leu 










85 










90 










95 




Thr 


Ser 


Leu 


Leu 


Gly Asp 




Asp 


nis 


ini 


Leu 


Asn 


Val 


Asn 


Lys 










100 










105 










110 






Glu 


Ser 


Lys 
115 


Glu 


Leu 


Glu 


Leu 


Tyr 
120 


Ala 


Arg 


Val 


Tyr 


Asn 
125 


Asn 


Thr 


Lys 


Arg 


Asp 
130 


lie 


Thr 


Val 


Asp 


Ser 
135 


Val 


Ser 


Leu 


Ser 


Pro 
140 


Gin 


Ala 


Thr 


Gly 


Arg 


Glu 


Phe 


Ser 


Ala 


Asn 


Lys 


Phe 


Val 


Leu 


Tyr 


Phe 


Lys 


Pro 


Thr 


Val 


145 










150 










155 










160 


Leu 


Lys 


Lys 


Asn 


Arg 


He 


Asn 


Thr 


Leu 


Val 


Phe 


Gly Ala 


Thr 


Phe 


Asp 










165 










170 










175 




Glu 


Asp 


He 


Asp 
180 


Asp 


Thr 


Asn 


Arg 


His 
185 


Tyr 


Leu 


Leu 


Ser 


Met 
190 


Arg 


Phe 


Ser 


Pro 


Gly Asn 


Asp 


Leu 


Phe 


Lys 


Val 


Gly 


Glu 


Lys 











195 200 



<210> 166 
<211> 12543 
<212> DNA 
<213> SHRIMP 



<400> 166 

atgtttaagg 

gatcacttaa 

ctctccaaca 

ggggaacaaa 

atggggcatc 

cttctgtatg 

aaattgataa 

ggcatgtaca 

aatgagcgtt 

actggaaagc 

agtgcagaga 

gcaacagacg 

aaggtaatat 

gaaaatgtgg 

atgcccagaa 

attgatattg 

gcgttcccct 

aacgaaaaca 

tgcaaaacac 

atggccaata 

ccttacagaa 

aagtttctgg 

gagaagatgg 

aatgaggaag 

aacaagcctc 



caaatgtgct 
taaaatgtgc 
aactgcccga 
ttctagtcac 
tagctgtaca 
ctggatgtaa 
tgtacagtca 
aagtaaaaaa 
cttgccagaa 
ttctcaggga 
aactatgtgt 
tgccagtgtt 
tctacccaac 
tgttgttaaa 
tcgtcgaacg 
gcgggagtga 
atcaccatgc 
tggtgacttc 
tggaaagttt 
ccgacgattg 
acaaaagttt 
cagaaggagg 
cgtgtaaagg 
gaggatctgg 
ctccagcccc 



caacctagga 
caaccagatg 
gtacgacaac 
tgataact tg 
agatagagtc 
tgttgttgaa 
acctgcaact 
ggttccagaa 
tgaaaatagg 
acttattatt 
gactcctgcc 
taaggttgcc 
catccttgaa 
gtcaatctta 
tgttgaaagt 
taactggaa t 
gcagttggtg 
tctgggtgtc 
cgagaaaaag 
gagaagaaag 
gattatggat 
tggcttgtta 
attggacgat 
agggaaatct 
taaacaaatc 



ggaggcaagt 
aaagaggaac 
agacgtctac 
accaagaatg 
ggcggagacg 
tatgacacag 
ttaaaggatg 
attactggtg 
cgcatggatg 
atgatgaggc 
ttcagagagt 
ttcatcacca 
gaggaacgtc 
aaggtacaac 
atgatcaaga 
cttcccggac 
ggagaaaaga 
gtgaaggcag 
tgtctatatc 
atcctttttt 
caagatttct 
ttgaccaaga 
tctggagatg 
ggcggcggtt 
cctccccttg 



ttcttgaaag 
caacgactct 
ctcttctcct 
gcaaccctct 
gcagtgtcaa 
tgaacagagg 
ttgcaaaatc 
atcaatttct 
aagaaggacc 
tttatgagga 
ttctgggttg 
atgcctcttt 
ttgcagctgt 
ttgaattact 
aaactgttgc 
attgtaaagt 
agaacattct 
atagagccga 
tcgaaaacct 
ctgaacttgg 
gtacaattgg 
caaatgctac 
gagacgacga 
ctggagacga 
ctgccaatgt 



tgatgtgcga 
acgcatctgt 
tctcaacgaa 
ggtcaaacaa 
ccccaacaat 
aaacgatgga 
caagaagaaa 
ggacaagctg 
ccacgttgga 
agaaacttct 
tggaagaaca 
gatggggctc 
ttcagacaca 
gagtgaatgt 
ctgtttcaag 
gagcgacact 
ctccatcagc 
agagtggatg 
catgggttct 
gccagaaatg 
catgtgttac 
cctattaaag 
ggaagaagat 
gaacaacatc 
ctataatagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 
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atcatcaatg 
ttctttctga 
aaagccatga 
aggaatttga 
caaattgttg 
caacaaagac 
tggcaattta 
cctcctccac 
tcaacaacac 
gatgaaaacc 
acattcgcca 
gctgagcggt 
ataattcgtg 
gcaattaacc 
tgcagaatgg 
agcaaattgt 
acatttaaaa 
gcccccatgg 
ttggcagaca 
agaaaatgtg 
gaaagcgatg 
ggaggtgcaa 
ggcgaatttg 
aacgtacaag 
agtttatgtc 
ttgtgtgatg 
attaatagaa 
gaaattgtca 
cctccctttt 
aacaatacca 
ctgaaagatc 
agtgatgtta 
tctattttag 
ctatcttcag 
attaacacgt 
gtatcagata 
caagatttag 
gtgtttaatc 
gatgcagcaa 
aaaatgatgg 
gagttctttt 
ctgtggacaa 
ggaagtttag 
aatgtggacg 
gcattttttg 
aatgacgaca 
gacaacgaca 
aacgtaatta 
tgggttgaag 
atggctgcaa 
gtcgctatgt 
ttgcgtctat 
tggtgcagct 
ggtagatttt 
attggacaag 
tcgtttgatt 
tctcacgtcc 
aatttggtta 
gaaaaacgtg 
gactctgtgc 
gtcagtgtta 



acgatgataa 
gtgacattga 
gcaagaagaa 
ctggcgttaa 
tgagatatga 
tcaagaataa 
cccacaaaga 
ctactcaaag 
cagatatggt 
agatagcttc 
cgggtgctga 
tttgtgcgct 
aagattttga 
gagaacttag 
gagttgcagc 
gcaaaattaa 
cgtttgatta 
agactttcca 
agaagtcgga 
acaacgacga 
aagaaagtga 
caggtgatta 
gttctagtga 
atgatgtgtt 
atcgccaaaa 
tactaacatt 
gatctttgag 
agcctaatgt 
gttcttgtgc 
tgtctaacga 
tgactgtagc 
ctgcagcttt 
ataataagag 
cattaccttt 
ttagacctag 
agagtgttaa 
agtctgttgt 
aatcgctttc 
attatttcaa 
aagaatgtat 
caaatttgat 
ttctagagac 
ttaaggattg 
aatctggcgc 
gtaaagatct 
ataaggactg 
ttagatctgt 
agaaattgaa 
acgaatatta 
cagaagaggt 
atcgtatcat 
ctgtggataa 
gtacaggcaa 
acgatattga 
ttgcaagtaa 
ttaatggatt 
taatgggacg 
gtgtttcctc 
ttgggaaggt 
gtgcttcttc 
aatctttgga 



gcttgaccag 
caactctcct 
ttgtgttgaa 
caccatcaag 
gaacagaaac 
cacatgggat 
atcactaaaa 
tgaagaagaa 
ttctagtcta 
tttggaaaat 
caagaacgac 
gtacaacatc 
atttgaagga 
catatatgta 
atacgaaaat 
gtatgatgaa 
tgatgaatct 
acgacttttt 
gaagagggag 
cgtttctgct 
tgaagagagt 
ttatggtggt 
tgatgaaaat 
tagggatgtt 
gtatgtttca 
ggacaattct 
aatgaggaat 
caatagtgaa 
ttctcttaat 
aaagtgcata 
tctccgtttc 
aggatttgtg 
gtgggatgcc 
gggtatctat 
aagtttagcc 
taatggtttc 
gctgggttct 
tgagaaggaa 
ggaccataac 
catgcgtaca 
aacaagatac 
tgttagggag 
ggacggtagt 
cgtatttgag 
ggacacaaat 
gaaagcgcag 
tgaaaatttc 
gataaaaccc 
tccacgaaac 
tctcaagaca 
gatgcaggga 
aatggtacca 
gggagattct 
agaagatcct 
tgatattgtc 
tgatactagt 
gcgaagcagg 
tttggccgac 
aatgaaacgt 
aaatgatttg 
cgagttccgc 



attgtgtgtt 
cttctggcaa 
cgcgtgaaac 
tttgacacaa 
ctgaaattgc 
gcactatcaa 
cctcctacaa 
gctgctgctg 
gaagaaggtg 
atcaagaagc 
accatattcg 
acttctcatc 
ggttttgaga 
cccaaaagtg 
tccatggaga 
agcacaatgg 
gacaagtctt 
gccagcgtca 
aaactctatc 
cgccagatcc 
gacgatgagg 
gatgatgaag 
gtaccatctg 
aatttcatca 
actgtcattg 
gcagctgaat 
tgggtagtgc 
gacggaaccg 
aacttcaaga 
aaactactcc 
aacacaatgg 
aaaaaggaca 
ataaaacagt 
gaaaatgtga 
agattggcat 
ttttctggta 
acagttgttg 
cttaaagaaa 
gtttcaaaac 
gctttgaatg 
ggttctggaa 
tgctttaata 
gacatgctaa 
ctatcagaat 
ttggatgctg 
gtcgccaagg 
ataaattcaa 
acccctagca 
aagtctactt 
gaaatgtccc 
gaaagtgtgc 
ctcattcgat 
cccaaaaagg 
ctccactttt 
attgaagaaa 
aattggttgc 
cttctttcta 
aaaaatagcg 
attgcaaggc 
ttggattgtt 
gccaagactc 



tcttcaagag 
tggagtttct 
cagaaacaaa 
tcatgccttt 
ccagagatac 
agggcaaatt 
ttgaagaatt 
ctgctgctgc 
caacatccac 
tcttgtccat 
cgtggacagt 
cagaagagta 
agtttagaca 
tgcttgaaaa 
ggatcaagaa 
tctacgaatt 
ttggaccaat 
agtctgataa 
aacaaaaaca 
tgaacaatgt 
agaattatgg 
atgattgtta 
acaatgctag 
aaacattcaa 
tggaggaaat 
ctggtgatat 
cttttactat 
ctaacagcaa 
gcgattctcc 
caattcctag 
catgtgaaag 
aagttaatgg 
gtaaattggc 
tcagtgaaga 
gttcatcggg 
tttgggcatt 
atcctctaaa 
aacgtcaaca 
ttaacatata 
gaaaaacaag 
ccaatagtcc 
acagccttcc 
atttaaaggc 
ttctgggcgt 
atacgtggga 
cctatgagtt 
gcaacttgct 
atgatgttag 
tgcgcagtcg 
tctcttgtgt 
gagaaattgc 
gctttaaaat 
ccgacgctag 
accgttttgc 
tgacccgaaa 
aatttatcac 
ggcccctttc 
aaaaaagcaa 
ttgttttagt 
gtattttgga 
gccaagaact 



aaaacacggt 
tcttccccag 
gaacattatc 
cgccatcctt 
ggacattcta 
tgcagaaatg 
ggagagtatt 
atcaacatca 
ttcgtcatct 
catcacgtca 
cgttactctt 
ctatcagcaa 
catgtgcgac 
gcaatctgtc 
caagacaaac 
gaacaacgat 
gtatgaatgc 
ggaagctgtt 
agaatatcta 
ggccagtaac 
agctgccaag 
tggttttcta 
tagtattaat 
ctttagatcc 
ggagaaaaat 
tttgaaagaa 
gcctgtacga 
caacaatatc 
ccttagtagt 
ttctaaacat 
gagatacttt 
taatattagg 
aggaaaatgt 
taacaaactc 
aggtgatggt 
gtgtgcgaat 
accaactaaa 
gatgtgttta 
cgagtgcttt 
taatgattct 
agcttctaga 
tattgattgg 
aggtgtgtca 
tagcgcgagg 
atgtcttttg 
tgctctcaaa 
aacaaataac 
gcatcaaatt 
ggcagagtgg 
gctagcaatg 
aactgctcct 
cactagcaag 
cataaaagaa 
cgcatatgtt 
gattctcatg 
ctatagattt 
actagttaag 
tgatatgtat 
aaaagcggca 
cgttaatgat 
gcaagagact 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
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agaatcgata 
gtagaaaagt 
caagctaaag 
aacaaggagg 
cgatcaaaga 
tctagtcggg 
cagatacaca 
ggctatgtac 
gtgaaagtaa 
gagtatgctt 
aggtggaata 
actgatactg 
gttggcgagg 
tctaataata 
gaaggaagtg 
aagaagcaag 
cgtcccttga 
acacgctatt 
tacattaaga 
ccctggaaga 
tcaattcaag 
gatttattgt 
aagattgtta 
tttgaccaga 
tataaaagtt 
gttttgctta 
cgtcacatta 
cactttgaaa 
tcagtatacc 
ttccctctta 
gatccttctc 
gttttgtctt 
tgttcagtgt 
tcttgttatc 
tgtcccaact 
ttaagtacca 
cattctggtc 
gatcaaggtg 
atgttaattt 
attaagaagg 
tccttcttta 
aagaagtgt t 
tttactgatt 
gagtctactc 
tttgacaatg 
tcagactaca 
aatccagtgt 
acaaatgcag 
gctagaacgt 
tgtattagcg 
cgccttgaag 
aaacacgtta 
tccatggaag 
aaaaataccg 
gaagaaaccg 
gaagaaaaca 
agttct tctc 
attttatacg 
tcctcctcct 
aagaggaagc 
tgt tacatct 



ctaactacaa 
cctctagaat 
taatggacgc 
ataatagtgg 
aatctggcgg 
gatcatcatc 
tggacgccag 
ttaaccgaat 
tttctgacat 
tgtacggtgt 
aggcagttga 
gaagcgttta 
gaggaaatgg 
ataatgaagc 
attgctcgtt 
gaaagggcag 
gaacttttat 
tctccaatca 
atatggccct 
actttaacaa 
atttggagaa 
ctctttatga 
acgattctaa 
ctttctccca 
tgggtaagat 
aaaaattgct 
gcgttttgtg 
tgacaaagaa 
attctggttt 
tgaatctgca 
ccgaaaaaat 
catgtttaga 
tgtttacacc 
ggcaacaaga 
ttgtttcatc 
actctgatgg 
tatatgctgg 
gtacagataa 
ctatgcggtc 
ataattttaa 
actcccctgt 
gcgatttctt 
gggcctgtat 
tccaaattag 
agtcagaaca 
acaagcagaa 
gtcctgccaa 
gaactgctct 
ttattagtaa 
ttgataataa 
ggataaattt 
ttttgaattc 
gcactacaat 
actctgtcag 
atgaaaatga 
cggaaaatac 
taaaacctcc 
aacttgctaa 
cctcctcttc 
aaaaggatga 
catgtgttga 



tctggttagt 
catcaacact 
taacgaagaa 
tgcaaaacaa 
aggaggtttt 
cgtagttgat 
aaacgaagaa 
taagaacatg 
tgagaacttt 
cactgaaact 
ggaagaaact 
tgatatgaac 
tgggtgtggg 
taaccagacc 
ttggtttaac 
tgttttggca 
ccagtacaag 
tatttctgac 
gggagatgag 
taactctaat 
aaaagactct 
tgattctcta 
agacgcgtac 
gcaatacttt 
gtggaactgt 
gtttaaaaat 
taactgggga 
taatgctcca 
cctgagtgtg 
ccgcactttc 
ttctgcagct 
tcccgagggt 
agggacaaat 
caatgattac 
agaaattgaa 
ccataagcgc 
cattgatact 
aacttgcctt 
tgtcattaat 
ctttttagca 
gcagggtaaa 
attgaacaaa 
tgtttcatca 
aaacggggga 
aggcgagttg 
tattttttcg 
gaatatgctc 
catttctaaa 
tggtacagca 
caggtacttt 
ggtcacggac 
tttattctca 
gggtagagga 
taatatgaac 
ggatgatgat 
tagcgttgta 
atcaattttt 
agccactagc 
ttcttcaaaa 
tgttaattct 
ccaaaaaact 



aattcctgta 
aacatttctt 
gcatttatcg 
atgacaggaa 
aataacgcgg 
gaggactcga 
gacagagaaa 
atcacacaaa 
tttaagattt 
gcgttaagtg 
aacaagataa 
atcatatgtc 
ggtggatcat 
aatgaaattt 
ttctacaaga 
aatacaggcc 
gggaaaggtt 
tcgtactggt 
gacaagtcaa 
agcagtagta 
ttaaagaatg 
actacttcca 
gttatcttgg 
acccaccaaa 
aacaatggta 
ctaaacattt 
tgtattcatc 
tgtggcgtta 
gaagactatg 
tctgctaaat 
agtttggcta 
aatttctgta 
ataagacgag 
tgtttcattg 
attgtcagta 
gtgcttcgtg 
ggctgcgcag 
ctgcaagaag 
gggaaaagtc 
ggaacagata 
tttgtcgctc 
ggcacaggag 
tcaaagggta 
tgtttcagcg 
ttccacgata 
gaacaagcat 
aaaagagcca 
attatggctg 
attcccttca 
ttgattgatg 
atgtatacaa 
accgaattta 
ctttgtttaa 
aaaaactttt 
gacgaaaatg 
aagtacgaac 
attgctgatg 
gattgtgaaa 
catagtagta 
acaactacag 
ggcatgccca 



caactgccca 
tccacaacat 
atccttctct 
agggtggaag 
gtggttttta 
gatcccgcac 
gtggcttgtt 
atcagataaa 
gtgttccttt 
ctggaatgga 
gaaaagagtg 
ctggcgatta 
cctcatcagg 
ctgaagacca 
aagttgtcaa 
atgagggtag 
ttgctgaaac 
ctcaagttat 
aaaagaagtt 
attcatcggt 
ttcctatggg 
ctgaaaaatt 
gctcttctaa 
aaatatccaa 
tgtcccctaa 
tgtggattaa 
ccaatagttc 
ccgattcaaa 
gccaactact 
ctaaagacaa 
aggctgttta 
cgtcttggat 
gaggagattt 
gtaaagaaga 
tccttaaaac 
tcatcaatta 
acgatgaaga 
attcaatgga 
tggacgaatc 
agggtttcta 
ctagaggaac 
gagtatttgc 
agaataacaa 
agcgattaac 
gatactgtcc 
acaaatgctc 
aaaatatacg 
aagttgaaaa 
ggttagctga 
gtacttatct 
ggtgtaaatt 
tttctgctgc 
tcgaacacgt 
ggtctatggc 
aagaagatga 
ctgtatcaaa 
aagattatat 
cggcttcttc 
gtagtagtag 
cccttcatgc 
gaatggatgt 



actcgctgct 
tcctgccggt 
agaagagatt 
taatagggga 
caatgatgat 
gggtttcagc 
ttcgtacgat 
taatgacatt 
ctcgaaaaaa 
tgccattgaa 
ccgcgatttg 
catgtcttct 
tcaccttctc 
attgaaacat 
ttcttcggaa 
aatagttggc 
aaaggtcctg 
gccaatttgt 
tggaaagaga 
gaaatatgtt 
ttatgatgaa 
agaaaacatt 
ccaatcttca 
tatcaacacc 
gaaccagata 
attgtacgag 
aaagaatagc 
ccctccactt 
gaaggacact 
taattcctct 
cgcccgtgaa 
cactaatagt 
tttcaacaaa 
aactaagaaa 
tgccgtgttt 
caataaggac 
tgatgatgac 
tgctaagagg 
tagtctggcc 
cttggataat 
aaaaatcttt 
ccgcatattc 
gaaggcgatc 
tccttcaatg 
cgattttttg 
atttttagct 
tctttgtatt 
aatgggaaat 
aaacactgcc 
tctaggagga 
aaaggcagaa 
tctagcttcg 
atcttatatg 
tgaagatcag 
ggatgagaac 
aacagcattc 
atttttatca 
ctcctcctcc 
tagtaataag 
gctaagaaaa 
cgtgtatctt 



5220 
5280 
5340 
5400 
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ttacgtggcc taatgaactt tggaggtatg tgcacagcca ttgcttcagg agatggagag 8880 
aaagcccacc acatggttca aactctttgt tccgttgcct tgaatattgc cacaaaaact 8940 
gctgttgtat ttgtgggaac aaagggaaac aatttaaaga ccactcttgt cgatttgtgt 9000 
aagaggactt ggttcgaaag gtttacaaat attaacgtta cggctctcaa taatgctggg 9060 
gattcttctt catctactca agcaaatttg gcttcctttg ctggcaaaaa gggtatcgtg 9120 
attattgacg aagttggcca ccaaggttca tttggctcaa agaagtcatc gagcgaagac 9180 
gataaggacg agtctgcttc cagaagtggt aatgtcgatt ttggcggcag tgggggagaa 924 0 
atgaatagtg ttgacatcaa tgaagcaaga aacgcgtracg gtgacggagg aaactctaag 9300 
atcgttttta gtaatattaa ccgtttaatg accgaatcaa aactaaaagt atgcgaccaa 9360 
gagtacgatt ttatttctga actcaaacac gaaaagaata ggaaaaatgc ttgtaatgat 9420 
acaaaaaaac gtaaaagagg tggcgaaatt gaagatgaag gggtagaatg tgaggaaatt 94 80 
gagagaaacg acggtaaaaa tgacgaaaat ggtgtccgca taaaggatcc tatcaatatt 954 0 
tctttctttg ccagaaaagc tcactggtgg aattgttctt ctggtgttgt ttcaactacc 9600 
tttaaagaga aaaatattgt gtataatatg ctccatagag gcgccatgcc tttctctatt 9660 
aaagactgca cagattctcc ttggcttaac gaaaccgatg ctgtgtacag gcactgtaag 9720 
aaacctatcg aatatgaagg caaattctct aaatccgaag taaaaactgc cttgaagtgc 9780 
atattaggta aatttggatc caaaatatgc gataacgaat catttgaatc cattattgat 9840 
gaaaactgcc aggttaataa tcttcattcc tggaacgatt gtaaagaaga tatagatgaa 9900 
tggaatgaga aatttatgag caagaacaag aaaaataagc agaatatgaa aattgaagac 9960 
aaagttgatg cgattatgaa cataatacag aaaaacaatg gccttttgaa atggaataca 10020 
tcatttgatc gcgacggttc gcctgtttta gtctgcaacc ctgccactga aaggttttcc 10080 
gagatgatca cgtcatcttt gtctgcccag gacatgctcg aaattaaaaa atacctgggg 10140 
gataattgcc tgtcgactaa cgggggcgtt aaaaagtcag ttattgatgg taatacatcc 10200 
gcccctggtg tattgattgc gtaccattgc gtttacactg gaaaaatttc tgatgatctt 10260 
tcaaaaacaa actacccagt tttacttccc cctcccaaga aacaacactt tgtagctgtc 10320 
gatgatgctg ccgagaaggc tcttctgggc cctactctgt cgaatattaa tattgacagt 10380 
attcgtaaca tcaagactat ttctcgcaag ttatcgtcca taataaagga cccagaagct 104 40 
gcaaaactat tagtggatag agatctggac tttatgaaca tgtacgaacg ttatgacgca 10500 
tctttattcg atgtggttaa gaaaccgtca aaatatagtt ttcctggatt tacttctgat 10560 
ggttcagtgg ttctgtcaac ttccacttct gactgcgaaa atgtcctttc ttgcttaaag 10620 
aagcgtattg agaaggataa gatgtcggct aagaattctg gatcctttat tcgcatgtgt 10680 
atggataaaa atttgctttc tgacgaaaag gacgactctt cctctaattc atcaaaaaat 10740 
acgtcttctt tacctaaaac agacgataac agcagtgata ttgcaaactt tttaagtgtt 10800 
tttggagaaa ataggcaaca aagttcacaa ttttcctttg cttctaactc tagtggtgga 10860 
ggagacagta ataaagaggc ttgtttcaat gtcgatacgc ctaaaagaag gcaattggtg 10920 
tctgcactcc aaaaacataa cagtgacgga tcttcttcca ttattaccga aatcgctaag 10980 
gccatccccc aaaagaatga tgtgtccagt tctataacca aacatatgct tccgggccaa 11040 
tttccctcca gtttattaaa gaatatgact tcaccccaaa acagcgtcat gattagaggc 11100 
attttccaac agggcgcaaa aagcagcatt actgtgtctc ctattatgat gtccaacagt 11160 
tacatcttct cattctttgt cgacgaagct atgtctaaaa gacttattgt ttttccttgt 11220 
gatacaacat ttgtctttga gaataagaat gaggatgtta agaaaattat cgggcttttg 11280 
gatagaggaa tgaagtatat ccattcatct ttaatgatgg aaagatgtat caaatttgga 11340 
aagcatggaa taaaacagcg tcaacacgaa tttaatcatc acaagaaagc atggaatgac 114 00 
tttagtgggc attccagtga caacaaaaag aaggatagaa ttagtgacgt atcatccgtt 114 60 
cttccatcag ttttaatgaa aaacttgatc agaaacaagg tcctagaatt gagagacgta 11520 
aaatcagttt cccgtttaga ggagaatact aacacctttt tccatttgta cacttcaatg 11580 
agtctttgtg ctaaggctgc aacaaattac ggagaatctt cttcttcttc tgctactatc 11640 
actgaagtag aagaagacaa tagttgtgat gctgaggaac aacaattgcg ccgtaagaaa 11700 
ccagcaaatt acgagtccat gtgcaataaa ttaccatcac ctcttcaaat gtgccagatt 11760 
aatcctaaat ctttgaatac aatggctatg aatattgctc gttctagaca aggcgcatgg 11820 
gctcaactaa atagcatgct caattcagtt ttatttgtag aaatgccctt tgttaagacc 11880 
acaagattct ttgggagaga ttttaacatc aagatgcact ctcctgccac aaagaatagg 11940 
ccagccatca attttgacaa ttgtattggt atgagtttac caaatcctga tatggatgtt 12000 
gtaggatatg ataaggaagg ggaattaata ggagttggtt cttcactaac aaaacatttg 12060 
tgtgatgcct ggggttcaat ggatgttagg gatttgatgt attcttgcca ccatttacac 12120 
atgttgtttg aaatggctct ccagtatact gaatgtaaga ggagactttc ctcattaaaa 12180 
acactaaaga gtgataaaac tggtgttgat tatgttgctg taatgttagc ctgtatggtg 12240 
taccaattga tggttagtaa tctgaaatac cctgtgtttt tgtcctcttc atcccacaag 12300 
agggcaaata cagaggatat tgcggatgag aatcaagttt cttcactatc tgttcccatg 12360 
tttctggcaa tggttgtgaa taaaccgttg catgctctta gacactctac caatttagct 12420 
cttcctaatg cttctcaaaa atctgaccat tctgatattg tcaagtacat tgtaatgaac 12480 
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caatggggtt tgagacttaa ccctgattat ctgtgcccaa actgtgtaaa acatgttctt 12540 
tga 12543 
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900 905 910 

Ala Lys Gly Gly Ala Thr Gly Asp Tyr Tyr Gly Gly Asp Asp Glu Asp 

915 920 925 

Asp Cys Tyr Gly Phe Leu Gly Glu Phe Gly Ser Ser Asp Asp Glu Asn 

930 935 940 

Val Pro Ser Asp Asn Ala Ser Ser lie Asn Asn Val Gin Asp Asp Val 
945 950 955 960 

Phe Arg Asp Val Asn Phe lie Lys Thr Phe Asn Phe Arg Ser Ser Leu 

965 970 975 

Cys His Arg Gin Lys Tyr Val Ser Thr Val lie Val Glu Glu Met Glu 

980 985 990 

Lys Asn Leu Cys Asp Val Leu Thr Leu Asp Asn Ser Ala Ala Glu Ser 

995 1000 1005 

Gly Asp lie Leu Lys Glu lie Asn Arg Arg Ser Leu Arg Met Arg Asn 

1010 1015 1020 

Trp Val Val Pro Phe Thr Met Pro Val Arg Glu lie Val Lys Pro Asn 
1025 1030 1035 1040 

Val Asn Ser Glu Asp Gly Thr Ala Asn Ser Asn Asn Asn He Pro Pro 

1045 1050 1055 

Phe Cys Ser Cys Ala Ser Leu Asn Asn Phe Lys Ser Asp Ser Pro Leu 

1060 1065 1070 

Ser Ser Asn Asn Thr Met Ser Asn Glu Lys Cys He Lys Leu Leu Pro 

1075 1080 1085 

He Pro Ser Ser Lys His Leu Lys Asp Leu Thr Val Arg Phe Asn Thr 

1090 1095 1100 

Met Ala Cys Glu Arg Arg Tyr Phe Ser Asp Val Thr Ala Ala Leu Gly 
1105 1110 1115 1120 

Phe Val Lys Lys Asp Lys Val Asn Gly Asn He Arg Ser He Leu Asp 

1125 1130 1135 

Asn Lys Arg Trp Asp Ala He Lys Gin Cys Lys Leu Ala Gly Lys Cys 

1140 1145 1150 

Leu Ser Ser Ala Leu Pro Leu Gly He Tyr Glu Asn Val He Ser Glu 

1155 1160 1165 

Asp Asn Lys Leu He Asn Thr Phe Arg Pro Arg Ser Leu Ala Arg Leu 

1170 1175 1180 

Ala Cys Ser Ser Gly Gly Asp Gly Val Ser Asp Lys Ser Val Asn Asn 
1185 1190 1195 1200 

Gly Phe Phe Ser Gly He Trp Ala Leu Cys Ala Asn Gin Asp Leu Glu 

1205 1210 1215 

Ser Val Val Leu Gly Ser Thr Val Val Asp Pro Leu Lys Pro Thr Lys 

1220 1225 1230 

Val Phe Asn Gin Ser Leu Ser Glu Lys Glu Leu Lys Glu Lys Arg Gin 

1235 1240 1245 

Gin Met Cys Leu Asp Ala Ala Asn Tyr Phe Lys Asp His Asn Val Ser 

1250 1255 1260 

Lys Leu Asn He Tyr Glu Cys Phe Lys Met Met Glu Glu Cys He Met 
1265 1270 1275 1280 

Arg Thr Ala Leu Asn Gly Lys Thr Ser Asn Asp Ser Glu Phe Phe Ser 

1285 1290 1295 

Asn Leu He Thr Arg Tyr Gly Ser Gly Thr Asn Ser Pro Ala Ser Arg 

1300 1305 1310 

Leu Trp Thr He Thr Val Arg Glu Cys Phe Asn Asn Ser Leu Pro He 

1315 1320 1325 

Asp Trp Gly Ser Leu Val Lys Asp Trp Asp Gly Ser Asp Met Leu Asn 

1330 1335 1340 

Leu Lys Ala Gly Val Ser Asn Val Asp Glu Ser Gly Ala Val Phe Glu 
1345 1350 1355 1360 

Leu Ser Glu Phe Leu Gly Val Ser Ala Arg Ala Phe Phe Gly Lys Asp 

1365 1370 1375 

Leu Asp Thr Asn Leu Asp Ala Asp Thr Trp Glu Cys Leu Leu Asn Asp 
1380 1385 1390 
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Asp Asn Lys Asp Trp Lys Ala Gin Val Ala Lys Ala Tyr Glu Phe Ala 

1395 1400 1405 

Leu Lys Asp Asn Asp lie Arg Ser Val Glu Asn Phe lie Asn Ser Ser 

1410 1415 1420 

Asn Leu Leu Thr Asn Asn Asn Val lie Lys Lys Leu Lys lie Lys Pro 
1425 1430 1435 1440 

Thr Pro Ser Asn Asp Val Arg His Gin lie Trp Val Glu Asp Glu Tyr 

1445 1450 1455 

Tyr Pro Arg Asn Lys Ser Thr Leu Arg Ser Arg Ala Glu Trp Met Ala 

1460 1465 1470 

Ala Thr Glu Glu Val Leu Lys Thr Glu Met Ser Leu Ser Cys Val Leu 

1475 1480 1485 

Ala Met Val Ala Met Tyr Arg lie Met Met Gin Gly Glu Ser Val Arg 

1490 1495 1500 

Glu lie Ala Thr Ala Pro Leu Arg Leu Ser Val Asp Lys Met Val Pro 
1505 1510 1515 1520 

Leu lie Arg Cys Phe Lys lie Thr Ser Lys Trp Cys Ser Cys Thr Gly 

1525 1530 1535 

Lys Gly Asp Ser Pro Lys Lys Ala Asp Ala Ser lie Lys Glu Gly Arg 

1540 1545 1550 

Phe Tyr Asp lie Glu Glu Asp Pro Leu His Phe Tyr Arg Phe Ala Ala 

1555 1560 1565 

Tyr Val lie Gly Gin Val Asn Asp lie Val lie Glu Glu Met Thr Arg 

1570 1575 1580 

Lys lie Leu Met Ser Phe Asp Phe Asn Gly Phe Asp Thr Ser Asn Trp 
1585 1590 1595 1600 

Leu Gin Phe lie Tyr Phe Ser His Val Leu Met Gly Arg Arg Ser Arg 

1605 1610 1615 

Leu Leu Ser Arg Pro Leu Ser Leu Val Lys Asn Leu Val Ser Val Ser 

1620 1625 1630 

Ser Leu Ala Asp Lys Asn Ser Glu Lys Ser Asn Asp Met Tyr Glu Lys 

1635 1640 1645 

Arg Val Gly Lys Val Met Lys Arg lie Ala Arg Leu Val Leu Val Lys 

1650 1655 1660 

Ala Ala Asp Ser Val Arg Ala Ser Ser Asn Asp Leu Leu Asp Cys Cys 
1665 1670 1675 1680 

lie Leu Asp Val Asn Asp Val Ser Val Lys Ser Leu Asp Glu Phe Arg 

1685 1690 1695 

Ala Lys Thr Arg Gin Glu Leu Gin Glu Thr Arg lie Asp Thr Asn Tyr 

1700 1705 1710 

Asn Leu Val Ser Asn Ser Cys Thr Thr Ala Gin Leu Ala Ala Val Glu 

1715 1720 1725 

Lys Ser Ser Arg lie lie Asn Thr Asn lie Ser Phe His Asn lie Pro 

1730 1735 1740 

Ala Gly Gin Ala Lys Val Met Asp Ala Asn Glu Glu Ala Phe lie Asp 
1745 1750 1755 1760 

Pro Ser Leu Glu Glu lie Asn Lys Glu Asp Asn Ser Gly Ala Lys Gin 

1765 1770 1775 

Met Thr Gly Lys Gly Gly Ser Asn Arg Gly Arg Ser Lys Lys Ser Gly 

1780 1785 1790 

Gly Gly Gly Phe Asn Asn Ala Gly Gly Phe Tyr Asn Asp Asp Ser Ser 

1795 1800 1805 

Arg Gly Ser Ser Ser Val Val Asp Glu Asp Ser Arg Ser Arg Thr Gly 

1810 1815 1820 

Phe Ser Gin He His Met Asp Ala Arg Asn Glu Glu Asp Arg Glu Ser 
1825 1830 1835 1840 

Gly Leu Phe Ser Tyr Asp Gly Tyr Val Leu Asn Arg He Lys Asn Met 

1845 1850 1855 

He Thr Gin Asn Gin He Asn Asn Asp He Val Lys Val He Ser Asp 

1860 1865 1870 

He Glu Asn Phe Phe Lys lie Cys Val Pro Phe Ser Lys Lys Glu Tyr 
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1875 1880 1885 

Ala Leu Tyr Gly Val Thr Glu Thr Ala Leu Ser Ala Gly Met Asp Ala 

1890 1895 1900 

lie Glu Arg Trp Asn Lys Ala Val Glu Glu Glu Thr Asn Lys lie Arg 
1905 1910 1915 1920 

Lys Glu Cys Arg Asp Leu Thr Asp Thr Gly Ser Val Tyr Asp Met Asn 

1925 1930 1935 

lie He Cys Pro Gly Asp Tyr Met Ser Ser Val Gly Glu Gly Gly Asn 

1940 1945 1950 

Gly Gly Cys Gly Gly Gly Ser Ser Ser Ser Gly His Leu Leu Ser Asn 

1955 1960 1965 

Asn Asn Asn Glu Ala Asn Gin Thr Asn Glu He Ser Glu Asp Gin Leu 

1970 1975 1980 

Lys His Glu Gly Ser Asp Cys Ser Phe Trp Phe Asn Phe Tyr Lys Lys 
1985 1990 1995 2000 

Val Val Asn Ser Ser Glu Lys Lys Gin Gly Lys Gly Ser Val Leu Ala 

2005 2010 2015 

Asn Thr Gly His Glu Gly Arg He Val Gly Arg Pro Leu Arg Thr Phe 

2020 2025 2030 

He Gin Tyr Lys Gly Lys Gly Phe Ala Glu Thr Lys Val Leu Thr Arg 

2035 2040 2045 

Tyr Phe Ser Asn His Asp Ser Tyr Trp Ser Gin Val Met Pro lie Cys 

2050 2055 2060 

Tyr He Lys Asn Met Ala Leu Gly Asp Glu Asp Lys Ser Lys Lys Lys 
2065 2070 2075 2080 

Phe Gly Lys Arg Pro Trp Lys Asn Phe Asn Asn Asn Ser Asn Ser Ser 

2085 2090 2095 

Ser Asn Ser Ser Val Lys Tyr Val Ser He Gin Asp Leu Glu Lys Lys 

2100 2105 2110 

Asp Ser Leu Lys Asn Val Pro Met Gly Tyr Asp Glu Asp Leu Leu Ser 

2115 2120 2125 

Leu Tyr Asp Asp Ser Leu Thr Thr Ser Thr Glu Lys Leu Glu Asn He 

2130 2135 2140 

Lys He Val Asn Asp Ser Lys Asp Ala Tyr Val lie Leu Gly Ser Ser 
2145 2150 2155 2160 

Asn Gin Ser Ser Phe Asp Gin Thr Phe Ser Gin Gin Tyr Phe Thr His 

2165 2170 2175 

Gin Lys He Ser Asn He Asn Thr Tyr Lys Ser Leu Gly Lys Met Trp 

2180 2185 2190 

Asn Cys Asn Asn Gly Met Ser Pro Lys Asn Gin He Val Leu Leu Lys 

2195 2200 2205 

Lys Leu Leu Phe Lys Asn Leu Asn He Leu Trp He Lys Leu Tyr Glu 

2210 2215 2220 

Arg His Val Leu Cys Asn Trp Gly Cys He His Pro Asn Ser Ser Lys 
2225 2230 2235 2240 

Asn Ser His Phe Glu Met Thr Lys Asn Asn Ala Pro Cys Gly Val Thr 

2245 2250 2255 

Asp Ser Asn Pro Pro Leu Ser Val Tyr His Ser Gly Phe Leu Ser Val 

2260 2265 2270 

Glu Asp Tyr Gly Gin Leu Leu Lys Asp Thr Phe Pro Leu Met Asn Leu 

2275 2280 2285 

His Arg Thr Phe Ser Ala Lys Ser Lys Asp Asn Asn Ser Ser Asp Pro 

2290 2295 2300 

Ser Pro Glu Lys He Ser Ala Ala Ser Leu Ala Lys Ala Val Tyr Ala 
2305 2310 2315 2320 

Arg Glu Val Leu Ser Ser Cys Leu Asp Pro Glu Gly Asn Phe Cys Thr 

2325 2330 2335 

Ser Trp He Thr Asn Ser Cys Ser Val Leu Phe Thr Pro Gly Thr Asn 

2340 2345 2350 

He Arg Arg Gly Gly Asp Phe Phe Asn Lys Ser Cys Tyr Arg Gin Gin 
. 2355 2360 2365 
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Asp Asn Asp Tyr Cys Phe lie Gly Lys Glu Glu Thr Lys Lys Cys Pro 

2370 2375 2380 

Asn Phe Val Ser Ser Glu lie Glu lie Val Ser lie Leu Lys Thr Ala 
2385 2390 2395 2400 

Val Phe Leu Ser Thr Asn Ser Asp Gly His Lys Arg Val Leu Arg Val 

2405 2410 2415 

lie Asn Tyr Asn Lys Asp His Ser Gly Ala Gly lie Asp Thr Gly Cys 

2420 2425 2430 

Ala Asp Asp Glu Asp Asp Asp Asp Asp Gin Gly Gly Thr Asp Lys Thr 

2435 2440 2445 

Cys Leu Leu Gin Glu Asp Ser Met Asp Ala Lys Arg Met Leu lie Ser 

2450 2455 2460 

Met Arg Ser Val lie Asn Gly Lys Ser Leu Asp Glu Ser Ser Leu Ala 
2465 2470 2475 2480 

lie Lys Lys Asp Asn Phe Asn Phe Leu Ala Gly Thr Asp Lys Gly Phe 

2485 2490 2495 

Tyr Leu Asp Asn Ser Phe Phe Asn Ser Pro Val Gin Gly Lys Phe Val 

2500 2505 2510 

Ala Pro Arg Gly Thr Lys lie Phe Lys Lys Cys Cys Asp Phe Leu Leu 

2515 2520 2525 

Asn Lys Gly Thr Gly Gly Val Phe Ala Arg lie Phe Phe Thr Asp Trp 

2530 2535 2540 

Ala Cys lie Val Ser Ser Ser Lys Gly Lys Asn Asn Lys Lys Ala lie 
2545 2550 2555 2560 

Glu Ser Thr Leu Gin lie Arg Asn Gly Gly Cys Phe Ser Leu Thr Pro 

2565 2570 2575 

Ser Met Phe Asp Asn Glu Ser Glu Gin Gly Glu Leu Phe His Asp Arg 

2580 2585 2590 

Tyr Cys Pro Asp Phe Leu Ser Asp Tyr Asn Lys Gin Asn lie Phe Ser 

2595 2600 2605 

Glu Gin Ala Tyr Lys Cys Ser Phe Leu Ala Asn Pro Val Cys Pro Ala 

2610 2615 2620 

Lys Asn Met Leu Lys Arg Ala Lys Asn lie Arg Leu Cys lie Thr Asn 
2625 2630 2635 2640 

Ala Gly Thr Ala Leu lie Ser Lys lie Met Ala Glu Val Glu Lys Met 

2645 2650 2655 

Gly Asn Ala Arg Thr Phe lie Ser Asn Gly Thr Ala lie Pro Phe Arg 

2660 2665 2670 

Leu Ala Glu Asn Thr Ala Cys lie Ser Val Asp Asn Asn Arg Tyr Phe 

2675 2680 2685 

Leu lie Asp Gly Thr Tyr Leu Leu Gly Gly Arg Leu Glu Gly lie Asn 

2690 2695 2700 

Leu Val Thr Asp Met Tyr Thr Arg Cys Lys Leu Lys Ala Glu Lys His 
2705 2710 2715 2720 

Val lie Leu Asn Ser Leu Phe Ser Thr Glu Phe lie Ser Ala Ala Ser 

2725 2730 2735 

Ser Met Glu Gly Thr Thr Met Gly Arg Gly Leu Cys Leu lie Glu His 

2740 2745 2750 

Val Ser Tyr Met Lys Asn Thr Asp Ser Val Ser Asn Met Asn Lys Asn 

2755 2760 2765 

Phe Trp Ser Met Ala Glu Asp Gin Glu Glu Thr Asp Glu Asn Glu Asp 

2770 2775 2780 

Asp Asp Asp Glu Asn Glu Glu Asp Glu Asp Glu Asn Glu Glu Asn Thr 
2785 2790 2795 2800 

Glu Asn Thr Ser Val Val Lys Tyr Glu Pro Val Ser Lys Thr Ala Phe 

2805 2810 2815 

Ser Ser Ser Leu Lys Pro Pro Ser lie Phe He Ala Asp Glu Asp Tyr 

2820 2825 2830 

He Phe Leu Ser He Leu Tyr Glu Leu Ala Lys Ala Thr Ser Asp Cys 

2835 2840 2845 

Glu Thr Ala Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser 
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2850 2855 2860 

Ser Lys His Ser Ser Ser Ser Ser Ser Ser Asn Lys Lys Arg Lys Gin 
2865 2870 2875 2880 

Lys Asp Asp Val Asn Ser Thr Thr Thr Ala Leu His Ala Leu Arg Lys 

2885 2890 2895 

Cys Tyr lie Ser Cys Val Asp Gin Lys Thr Gly Met Pro Arg Met Asp 

2900 2905 2910 

Val Val Tyr Leu Leu Arg Gly Leu Met Asn Phe Gly Gly Met Cys Thr 

2915 2920 2925 

Ala lie Ala Ser Gly Asp Gly Glu Lys Ala His His Met Val Gin Thr 

2930 2935 2940 

Leu Cys Ser Val Asn lie Ala Thr Lys Thr Ala Val Val Phe Val Gly 
2945 2950 2955 2960 

Thr Lys Gly Asn Asn Leu Lys Thr Thr Leu Val Asp Leu Cys Lys Arg 

2965 2970 2975 

Thr Trp Phe Glu Arg Phe Thr Asn lie Asn Val Thr Ala Leu Asn Asn 

2980 2985 2990 

Ala Gly Asp Ser Ser Ser Ser Thr Gin Ala Asn Leu Ala Ser Phe Ala 

2995 3000 3005 

Gly Lys Lys Gly lie Val lie lie Asp Glu Val Gly His Gin Gly Ser 

3010 3015 3020 

Phe Gly Ser Lys Lys Ser Ser Ser Glu Asp Asp Lys Asp Glu Ser Ala 
3025 3030 3035 3040 

Ser Arg Ser Gly Asn Val Asp Phe Gly Gly Ser Gly Gly Glu Met Asn 

3045 3050 3055 

Ser Val Asp lie Asn Glu Ala Arg Asn Ala Tyr Gly Asp Gly Gly Asn 

3060 3065 3070 

Ser Lys lie Val Phe Ser Asn lie Asn Arg Leu Met Thr Glu Ser Lys 

3075 3080 3085 

Leu Lys Val Cys Asp Gin Glu Tyr Asp Phe lie Ser Glu Leu Lys His 

3090 3095 3100 

Glu Lys Asn Arg Lys Asn Ala Cys Asn Asp Thr Lys Lys Arg Lys Arg 
3105 3110 3115 3120 

Gly Gly Glu lie Glu Asp Glu Gly Val Glu Cys Glu Glu lie Glu Arg 

3125 3130 3135 

Asn Asp Gly Lys Asn Asp Glu Asn Gly Val Arg lie Lys Asp Pro lie 

3140 3145 3150 

Asn lie Ser Phe Phe Ala Arg Lys Ala His Trp Trp Asn Cys Ser Ser 

3155 3160 3165 

Gly Val Val Ser Thr Thr Phe Lys Glu Lys Asn lie Val Tyr Asn Met 

3170 3175 3180 

Leu His Arg Gly Ala Met Pro Phe Ser lie Lys Asp Cys Thr Asp Ser 
3185 3190 3195 3200 

Pro Trp Leu Asn Glu Thr Asp Ala Val Tyr Arg His Cys Lys Lys Pro 

3205 3210 3215 

lie Glu Tyr Glu Gly Lys Phe Ser Lys Ser Glu Val Lys Thr Ala Leu 

3220 3225 3230 

Lys Cys lie Leu Gly Lys Phe Gly Ser Lys lie Cys Asp Asn Glu Ser 

3235 3240 3245 

Phe Glu Ser lie lie Asp Glu Asn Cys Gin Val Asn Asn Leu His Ser 

3250 3255 3260 

Trp Asn Asp Cys Lys Glu Asp lie Asp Glu Trp Asn Glu Lys Phe Met 
3265 3270 3275 3280 

Ser Lys Asn Lys Lys Asn Lys Gin Asn Met Lys lie Glu Asp Lys Val 

3285 3290 3295 

Asp Ala lie Met Asn lie lie Gin Lys Asn Asn Gly Leu Leu Lys Trp 

3300 3305 3310 

Asn Thr Ser Phe Asp Arg Asp Gly Ser Pro Val Leu Val Cys Asn Pro 

3315 3320 3325 

Ala Thr Glu Arg Phe Ser Glu Met lie Thr Ser Ser Leu Ser Ala Gin 
3330 3335 3340 
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Asp Met Leu Glu He Lys Lys Tyr Leu Gly Asp Asn Cys Leu Ser Thr 
3345 3350 3355 3360 

Asn Gly Gly Val Lys Lys Ser Val He Asp Gly Asn Thr Ser Ala Pro 

3365 3370 3375 

Gly Val Leu He Ala Tyr His Cys Val Tyr Thr Gly Lys He Ser Asp 

3380 3385 3390 

Asp Leu Ser Lys Thr Asn Tyr Pro Val Leu Leu Pro Pro Pro Lys Lys 

3395 3400 3405 

Gin His Phe Val Ala Val Asp Asp Ala Ala Glu Lys Ala Leu Leu Gly 

3410 . 3415 3420 

Pro Thr Leu Ser Asn He Asn He Asp Ser He Arg Asn He Lys Thr 
3425 3430 3435 3440 

He Ser Arg Lys Leu Ser Ser He He Lys Asp Pro Glu Ala Ala Lys 

3445 3450 3455 

Leu Leu Val Asp Arg Asp Leu Asp Phe Met Asn Met Tyr Glu Arg Tyr 

3460 3465 3470 

Asp Ala Ser Leu Phe Asp Val Val Lys Lys Pro Ser Lys Tyr Ser Phe 

3475 3480 3485 

Pro Gly Phe Thr Ser Asp Gly Ser Val Val Leu Ser Thr Ser Thr Ser 

3490 3495 3500 

Asp Cys Glu Asn Val Leu Ser Cys Leu Lys Lys Arg He Glu Lys Asp 
3505 3510 3515 3520 

Lys Met Ser Ala Lys Asn Ser Gly Ser Phe He Arg Met Cys Met Asp 

3525 3530 3535 

Lys Asn Leu Leu Ser Asp Glu Lys Asp Asp Ser Ser Ser Asn Ser Ser 

3540 3545 3550 

Lys Asn Thr Ser Ser Leu Pro Lys Thr Asp Asp Asn Ser Ser Asp He 

3555 3560 3565 

Ala Asn Phe Leu Ser Val Phe Gly Glu Asn Arg Gin Gin Ser Ser Gin 

3570 3575 3580 

Phe Ser Phe Asn Ser Ser Gly Gly Gly Asp Ser Asn Lys Glu Ala Cys 
3585 3590 3595 3600 

Phe Asn Val Asp Thr Pro Lys Arg Arg Gin Leu Val Ser Ala Leu Gin 

3605 3610 3615 

Lys His Asn Ser Asp Gly Ser Ser Ser He He Thr Glu He Ala Lys 

3620 3625 3630 

Ala He Pro Gin Lys Asn Asp Val Ser Ser Ser He Thr Lys His Met 

3635 3640 3645 

Leu Pro Gly Gin Phe Pro Ser Ser Leu Leu Lys Asn Met Thr Ser Pro 

3650 3655 3660 

Gin Asn Ser Val Met He Arg Gly He Phe Gin Gin Gly Ala Lys Ser 
3665 3670 3675 3680 

Ser He Thr Val Ser Pro He Met Met Ser Asn Ser Tyr He Phe Ser 

3685 3690 3695 

Phe Phe Val Asp Glu Ala Met Ser Lys Arg Leu He Val Phe Pro Cys 

3700 3705 3710 

Asp Thr Thr Phe Val Phe Glu Asn Lys Asn Glu Asp Val Lys Lys He 

3715 3720 3725 

He Gly Leu Leu Asp Arg Gly Met Lys Tyr He His Ser Ser Leu Met 

3730 3735 3740 

Met Glu Arg Cys He Lys Phe Gly Lys His Gly He Lys Gin Arg Gin 
3745 3750 3755 3760 

His Glu Phe Asn His His Lys Lys Ala Trp Asn Asp Phe Ser Gly His 

3765 3770 3775 

Ser Ser Asp Asn Lys Lys Lys Asp Arg He Ser Asp Val Ser Ser Val 

3780 3785 3790 

Leu Pro Ser Val Leu Met Lys Asn Leu He Arg Asn Lys Val Leu Glu 

3795 3800 3805 

Leu Arg Asp Val Lys Ser Val Ser Arg Leu Glu Glu Asn Thr Asn Thr 

3810 3815 3820 

Phe Phe His Leu Tyr Thr Ser Met Ser Leu Cys Ala Lys Ala Ala Thr 
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3825 3830 3835 3840 

Asn Tyr Gly Glu Ser Ser Ser Ser Ser Ala Thr He Thr Glu Val Glu 

3845 3850 3855 

Glu Asp Asn Ser Cys Asp Ala Glu Glu Gin Gin Leu Arg Arg Lys Lys 

3860 3865 3870 

Pro Ala Asn Tyr Glu Ser Met Cys Asn Lys Leu Pro Ser Pro Leu Gin 

3875 3880 3885 

Met Cys Gin He Asn Pro Lys Ser Leu Asn Thr Met Ala Met Asn He 

3890 3895 3900 

Ala Arg Ser Arg Gin Gly Ala Trp Ala Gin Leu Asn Ser Met Leu Asn 
3905 3910 3915 3920 

Ser Val Leu Phe Val Glu Met Pro Phe Val Lys Thr Thr Arg Phe Phe 

3925 3930 3935 

Gly Arg Asp Phe Asn He Lys Met His Ser Pro Ala Thr Lys Asn Arg 

3940 3945 3950 

Pro Ala He Asn Phe Asp Asn Cys He Gly Met Ser Leu Pro Asn Pro 

3955 3960 3965 

Asp Met Asp Val Val Gly Tyr Asp Lys Glu Gly Glu Leu He Gly Val 

3970 3975 3980 

Gly Ser Ser Leu Thr Lys His Leu Cys Asp Ala Trp Gly Ser Met Asp 
3985 3990 3995 4000 

Val Arg Asp Leu Met Tyr Ser Cys His His Leu His Met Leu Phe Glu 

4005 4010 4015 

Met Ala Leu Gin Tyr Thr Glu Cys Lys Arg Arg Leu Ser Ser Leu Lys 

4020 4025 4030 

Thr Leu Lys Ser Asp Lys Thr Gly Val Asp Tyr Val Ala Val Met Leu 

4035 4040 4045 

Ala Cys Met Val Tyr Gin Leu Met Val Ser Asn Leu Lys Tyr Pro Val 

4050 4055 4060 

Phe Leu Ser Ser Ser Ser His Lys Arg Ala Asn Thr Glu Asp He Ala 
4065 4070 4075 4080 

Asp Glu Asn Gin Val Ser Ser Leu Ser Val Pro Met Phe Leu Ala Met 

4085 4090 4095 

Val Val Asn Lys Pro Leu His Ala Leu Arg His Ser Thr Asn Leu Ala 

4100 4105 4110 

Leu Pro Asn Ala Ser Gin Lys Ser Asp His Ser Asp He Val Lys Tyr 

4115 4120 4125 

He Val Met Asn Gin Trp Gly Leu Arg Leu Asn Pro Asp Tyr Leu Cys 

4130 4135 4140 

Pro Asn Cys Val Lys His Val Leu 
4145 4150 



<210> 168 
<211> 315 
<212> DNA 
<213> SHRIMP 



<400> 168 

atgtacctga 

aaaatgtacc 

acttcttcct 

atccagtact 

tcaaaggatc 

atatctcagt 



gccacattag 
atcataacaa 
ccatcaactg 
tcaacaaaac 
gcaactattc 
ggtga 



acaaacacct 
caataatcag 
ttcaagctgt 
tagcagaaat 
ttctttcgag 



ttggttgaag 
cactcctttg 
tcccgtgaaa 
aatactgcac 
tatgctgaaa 



aaagacgtgc tctcaccttc 60 
tgaattgcca gtgcaggcga 120 
ctttcaactc agtaaaggct 180 
atcatttcaa gatgccggct 240 



cggcagttgc tgctcacaat 



300 
315 



<210> 169 
<211> 104 
<212> PRT 
<213> SHRIMP 
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<400> 169 




























1 itr i_ 


^ y x 


Leu 


Ser 


HIS 


lie 


Arg 


Cain 


inr 


fro 


Lieu 


vai 


(jlU 


VjIU 


Arg 


Arg 


1 








5 










10 










15 




Ala 


Leu 


Thr 


Phe 
20 


Lys 


Met 


Tyr 


His 


His 
25 


Asn 


Asn 


Asn 


Asn 


Gin 
30 


His 


Ser 


Phe 


Val 


Asn 
35 


Cys 


Gin 


Cys 


Arg 


Arg 
40 


Thr 


Ser 


Ser 


Ser 


He 
45 


Asn 


Cys 


Ser 


Ser 


Cys 


Ser. Arg 


Glu 


Thr 


Phe 


Asn 


Ser 


Val 


Lys 


Ala 


He 


Gin 


Tyr 


Phe 




50 










55 










60 










Asn 


Lys 


Thr 


Ser 


Arg 


Asn 


Asn 


Thr 


Ala 


His 


His 


Phe 


Lys 


Met 


Pro 


Ala 


65 










70 










75 










80 


Ser 


Lys 


Asp 


Arg 


Asn 
85 


Tyr 


Ser 


Ser 


Phe 


Glu 
90 


Tyr 


Ala 


Glu 


Thr 


Ala 
95 


Val 


Ala 


Ala 


His 


Asn 


He 


Ser 


Gin 


Trp 



















100 



<210> 170 
<211> 3696 
<212> DNA 
<213> SHRIMP 



<400> 170 

atgagcaaga 

aatgcacctt 

gctgatacag 

agtggttcaa 

acactcaatg 

aaacactcca 

ggagattttg 

gaagctcttc 

atgcatggca 

gatgtgcgta 

actgcaccag 

gacatgagga 

tgtagtggca 

gacaacaaca 

aatatcgcaa 

tttagtgcct 

agtgaaggaa 

ttcaagtcta 

ggaagatata 

tgttgctcta 

ggtggtggtg 

ggaggaaatg 

gattttggaa 

agtgttgaag 

gtacactggg 

atgatgtacg 

ggcaataatg 

tgcggaaaca 

tatgatgcac 

atgtggttct 

tttagtttct 

ttcgagtttg 

agaaatgttg 

ccagtctttt 

ctatttggag 

caaaaattgg 

atcaaatctg 

tcaaacatga 



gttctagttc 
cctccaagat 
atactccaac 
tgggaagtgt 
tgattgctcc 
ttgaaagtgg 
tgaagcatat 
attcttccct 
ttaccaggga 
ttgccaatag 
aagaagaatt 
aggcgctagt 
gtaaccagag 
gatggcaact 
cctttattat 
tccccatctt 
gatcagagcc 
gggttttaaa 
acaaccacaa 
agcaaacctt 
gtatcagcaa 
ccgcctctac 
ttgaccatac 
ctacatctga 
ccaagatgtt 
cttgcagtgc 
caggagtgtg 
aggcagaata 
tccaagtgat 
atgtgcccaa 
ccaattcata 
gacagtgctt 
aagatgtgct 
gtggatcaga 
gacgagaaga 
gaactttgta 
tgtgtgcagc 
tcctctctat 



tactgtgaaa 
tgaactttta 
tgcaaaacct 
gaaatcggct 
aggcattcaa 
acctgttgtg 
gcgcaagacc 
gatgtacgtg 
tgaaaaattc 
tgacactgga 
catcatgaag 
ggagaatgat 
tgaattgtgc 
ttttgaatcc 
gaggcaatct 
gagggaaatc 
tgcaggattt 
tatggcacct 
gcgtagcgtg 
taatccacag 
actggctgta 
tctggcactg 
aaacctctgt 
tcaaggaaga 
tgctgaaaaa 
tgtgcctgat 
ctcgagggtc 
tgccttccta 
taacaataca 
tgatgccttg 
tacaggagga 
tgatttcatc 
tttggagacg 
tgaccagaag 
tgtggcagaa 
tgatgtacgt 
cgcctcttgg 
tgcaaaaatg 



tctgcatctt 
gaagatggat 
actggtctaa 
gtggctgata 
aatgctatag 
cgtgcaccag 
gaagtgtgcg 
tttaacaaca 
cccatcctca 
aaattatgtg 
acttggggac 
tgttggctac 
caagaagatg 
tttgacagaa 
atttccctct 
aaccaagaag 
gaaaagtatg 
ttagattttg 
ttcttgaatt 
cagcaacaac 
gtgactcaaa 
cacatgaacg 
gattgtaagg 
aaaaccaagt 
ctcttcaaaa 
gaaattgggg 
tctaccatcc 
aaagaaggaa 
gatctcacac 
gaagaagctg 
ggattattat 
aagaaattgg 
tcaaaaacct 
gaagtgcttc 
atgatgttta 
ctgagtctgc 
gctgcctcct 
gcattcacaa 



tttttaactc 
ggactaagaa 
gtatctcgtt 
gttgttctgg 
tttactacaa 
actgttcaga 
gaggcggcgg 
tgattcctgc 
tttttgtgtt 
ccaactcgta 
aaaaaccctt 
gcatcctcaa 
tgatcaattt 
gatcgtgcaa 
tcaagaacct 
aattgaatgt 
gcgacgctca 
gaagagttgt 
gtgcatatga 
aacaacaaca 
gggcccagtc 
cgtgtttcca 
gttgtacaaa 
tgtctcgcaa 
tgatgatcaa 
ctatttatgc 
tttctgatct 
agcacatgaa 
ctgaacaatc 
gcaaaatctt 
cactggacga 
tatcatgtct 
ccaacagata 
gagaagaatt 
tcgacttgga 
cagaaggagg 
gtgaagtacc 
agtattacca 



tttgatggaa 
agctgctgct 
gatggatatt 
catcatggct 
tgatttcgac 
gtgggaaggg 
cggtggcagt 
attcaagaaa 
caccgatgag 
cgattcggaa 
gactattctt 
tttctctaga 
ttctggatac 
tgtgcgcaag 
gaatgatcaa 
ttttattgaa 
acgtgaatct 
acagggagga 
tagtgccttt 
gtcttctagc 
aattacaggg 
gtcgttggac 
attgatggca 
gtatgctcgc 
ggaacaatca 
ctttgttaca 
gggaactgaa 
atctgctagc 
ctccatgttt 
ccaccagtca 
atacaagagg 
aaagattact 
ttttgccatc 
ggccagcgat 
gactgtgatt 
ttatgcagca 
ctccaacact 
agagcagaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 
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agcagttccg 
ggagagatcg 
aataagatag 
atcttgacaa 
cgacttgacc 
ctaaagatgg 
atgggcaaac 
actgcatcac 
gagatggcta 
attgctgttt 
tacaagctat 
tgctgcaaat 
tgtccctggt 
ctgatggatt 
acatctggtg 
tatttcttga 
aataagagac 
agtgatcttc 
aagacgacag 
aatgacctgg 
tttaatagtc 
tggaatactt 
acttccccat 
gttaatgtac 



agactgattt 
agaataatct 
gagttgacaa 
aggctggcga 
tatcagagaa 
aatatgataa 
caatggtcta 
ccacaaccct 
gtgatatcat 
cagctgcagc 
tgaatgaggg 
ccaagtacat 
gcaggtacgc 
tgattgaatg 
atgtgatgta 
aactcaagaa 
cagcgcctct 
ccgattcctt 
tcaatctaat 
acgggaaaga 
tatttgtaaa 
tcttgattca 
tttctgttgt 
ttgtccgtca 



ggatattatt 
ttccggtgta 
gaactttacc 
ctcaaagatt 
ggacttgaaa 
tgatgaaaaa 
tcactgggac 
caatcttttg 
tggagctctc 
gaaggctgtg 
caatattgat 
gatgtctact 
caacaaactc 
tggagcacct 
tgaagaactg 
cactgcaatt 
tcaggtcact 
gttggctgcc 
tggactcggg 
ccccttcatt 
cccatggttg 
tgtaaagaag 
tgtaatgaat 
aaaaatatgt 



ttgccctcta 
gtatttttga 
aataaactag 
ggggagaaat 
tatatatgca 
ttgtgcttga 
aataaattga 
aatgccaaga 
aatcttcccc 
gggattttgt 
attccagttg 
cttggtccgg 
gtcgcaatgg 
tctttggcta 
ggagagggtg 
gccgaaggaa 
tctccttctt 
attggcgagt 
gaagtaaagg 
tctctccagg 
gggtacaggc 
aatgatgtat 
gatggtagtg 
gtgtga 



ttcttgaagg 
ggtgcttaat 
aacacttttt 
atgagacgtt 
aaagatgtgg 
gatgcaaggg 
ccagagatcc 
aaattgatga 
caacagataa 
atgggaagac 
ctgtatgtgt 
acaagccgca 
ggagaggagg 
tggtggaaga 
aagagtttta 
a taaactgca 
caccgccaaa 
gtgcgatcga 
tagtggaaaa 
aatactgttc 
tcgatgagca 
ggaagttctt 
gtctgttgaa 



aactgccgat 
tacatgggcc 
ggctctgaga 
ccctgtacgt 
tgtaaaatct 
aaactacagg 
tcgtgccaaa 
caaagtaaag 
agacaatgaa 
ttgtcttctc 
agaatgtgat 
gaacagaaaa 
gaagaaactg 
agccatcagg 
cattatcgac 
acaaaataat 
gaaaatgagg 
gacaaaagaa 
tgttggacca 
atgggataaa 
atgggatgat 
gtgcaacaaa 
tgttgataat 



2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3696 



<210> 171 
<211> 1227 
<212> PRT 
<213> SHRIMP 



<400> 171 



Met 


Ser 


Lys 


Ser 


Ser 


Ser 


Ser 


Thr 


1 








5 








Ser 


Leu 


Met 


Glu 


Asn 


Ala 


Pro 


Ser 








20 










Gly 


Trp 


Thr 


Lys 


Lys 


Ala 


Ala 


Ala 






35 










40 


Lys 


Pro 


Thr 


Gly 


Leu 


Ser 


He 


Ser 




50 










55 




Gly 


Ser 


Val 


Lys 


Ser 


Ala 


Val 


Ala 


65 










70 






Thr 


Leu 


Asn 


Val 


He 


Ala 


Pro 


Gly 










85 








Asn 


Asp 


Phe 


Asp 


Lys 


His 


Ser 


He 








100 










Pro 


Asp 


Cys 


Ser 


Glu 


Trp 


Glu 


Gly 






115 










120 


Lys 


Thr 


Glu 


Val 


Cys 


Gly 


Gly 


Gly 




130 










135 




Ser 


Ser 


Leu 


Met 


Tyr 


Val 


Phe 


Asn 


145 










150 






Met 


His 


Gly 


He 


Thr 


Arg 


Asp 


Glu 










165 








Phe 


Thr 


Asp 


Glu 


Asp 


Val 


Arg 


He 








180 










Cys 


Ala 


Asn 


Ser 


Tyr 


Asp 


Ser 


Glu 






195 










200 


Met 


Lys 


Thr 


Trp 


Gly 


Gin 


Lys 


Pro 




210 










215 




Ala 


Leu 


Val 


Glu 


Asn 


Asp 


Cys 


Trp 


225 










230 







val 


Lys 


Ser 


Ala 


Ser 


Phe 


Phe 


Asn 




10 










15 




Ser 


Lys 


He 


Glu 


Leu 


Leu 


Glu 


Asp 


25 










30 






Ala 


Asp 


Thr 


Asp 


Thr 


Pro 


Thr 


Ala 










45 








Leu 


Met 


Asp 


He 


Ser 


Gly 


Ser 


Met 








60 










Asp 


Ser 


Cys 


Ser 


Gly 


He 


Met 


Ala 






75 










80 


He 


Gin 


Asn 


Ala 


He 


Val 


Tyr 


Tyr 




90 










95 




Glu 


Ser 


Gly 


Pro 


Val 


Val 


Arg 


Ala 


105 










110 






Gly 


Asp 


Phe 


val 


Lys 


His 


Met 


Arg 










125 








Gly 


Gly 


Gly 


Ser 


Glu 


Ala 


Leu 


His 








140 










Asn 


Met 


He 


Pro 


Ala 


Phe 


Lys 


Lys 






155 










160 


Lys 


Phe 


Pro 


He 


Leu 


He 


Phe 


Val 




170 










175 




Ala 


Asn 


Ser 


Asp 


Thr 


Gly 


Lys 


Leu 


185 










190 






Thr 


Ala 


Pro 


Glu 


Glu 


Glu 


Phe 


He 










205 








Leu 


Thr 


He 


Leu 


Asp 


Met 


Arg 


Lys 








220 










Leu 


Arg 


He 


Leu 


Asn 


Phe 


Ser 


Arg 






235 










240 
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Cys 


Ser 


Gly 


Ser 


Pne 


Ser 


Gly 


Tyr 










Arg 


Arg 


Ser 


Cys 






27 5 




Gin 


Ser 


lie 


Ser 




290 






Pro 


He 


Leu 


Arg 


305 








Ser 


Glu 


Gly 


Arg 


Gin 


Arg 


Glu 


Ser 








34 0 


Phe 


Gly 


Arg 


Val 










Ser 


Val 


Phe 


Leu 




37 0 






Gin 


Thr 


Phe 


Asn 


385 








Gly 


Gly 


Gly 


Gly 


Ser 


lie 


Thr 


Gly 








420 


Asn 


Ala 


Cys 


Phe 






435 




Leu 


Cys 


Asp 


Cys 




450 






Thr 


Ser 


Asp 


Gin 


465 








Val 


His 


Trp 


Ala 


Lys 


Glu 


Gin 


Ser 








500 


Gly 


Ala 


He 


Tyr 






515 




Arg 


Val 


Ser 


Thr 




530 






Ala 


Glu 


Tyr 


Ala 


545 








Tyr 


Asp 


Ala 


Leu 


Ser 


Ser 


Met 


Phe 








580 


Ala 


Gly 


Lys 


lie 






595 




Gly 


Gly 


Gly 


Leu 




610 






Gin 


Cys 


Phe 


Asp 










Arg 


Asn 


Val 


Glu 


Tyr 


Phe 


Ala 


He 








660 


Leu 


Arg 


Glu 


Glu 






675 




Ala 


Glu 


Met 


Met 




690 






Thr 


Leu 


Tyr 


Asp 


705 








He 


Lys 


Ser 


val 



Asn Gin Ser Glu 
245 

Asp Asn Asn Arg 

Asn Val Arg Lys 
280 

Leu Phe Lys Asn 
295 

Glu He Asn Gin 
310 

Ser Glu Pro Ala 
325 

Phe Lys Ser Arg 

Val Gin Gly Gly 
360 

Asn Cys Ala Tyr 
375 

Pro Gin Gin Gin 
390 

He Ser Lys Leu 
405 

Gly Gly Asn Ala 

Gin Ser Leu Asp 
440 

Lys Gly Cys Thr 
455 

Gly Arg Lys Thr 
470 

Lys Met Phe Ala 
485 

Met Met Tyr Ala 

Ala Phe Val Thr 
520 

He Leu Ser Asp 
535 

Phe Leu Lys Glu 
550 

Gin Val He Asn 
565 

Met Trp Phe Tyr 

Phe His Gin Ser 
600 

Leu Ser Leu Asp 
615 

Phe He Lys Lys 
630 

Asp Val Leu Leu 
645 

Pro Val Phe Cys 

Leu Ala Ser Asp 
680 

Phe He Asp Leu 
695 

Val Arg Leu Ser 
710 

Cys Ala Ala Ala 



Leu 


Cys 




(Z~\ ii 
ul U 










Trp 


bin 


Leu 


rne 


265 








Asn 


He 


Ala 


Thr 


Leu 


Asn 


Asp 


\j±Tl 








*a a r\ 


bill 


CalU 


Leu 


Asn 










Gly 


pne 


Glu 


Lys 










vaj_ 


Leu 


Asn 


Met 


345 








Gly 


Arg 


Tyr 


Asn 


Asp 


Ser 


Ala 


Pne 








JOU 


Gin 


Gin 


Gin 


Gin 










ax a 


val 


val 


Tnr 




410 






Ala 


Ser 


Thr 


Leu 


425 








Asp 


Phe 


Gly 


He 


Lys 


Leu 


Met 


Ala 








A £L fi 

4 bU 


Lys 


Leu 


Ser 


Arg 






47b 




Glu 


Lys 


Leu 


Phe 




490 






Cys 


Ser 


Ala 


Val 


505 








Gly 


Asn 


Asn 


Ala 


Leu 


Gly 


Thr 


Glu 








540 


Gly 


Lys 


His 


Met 






t c c 

555 




Asn 


Thr 


Asp 


Leu 




D /O 






val 


Pro 


Asn 


Asp 


585 








Phe 


Ser 


Phe 


Ser 


Glu 


Tyr 


Lys 


Arg 








tort 
62 0 


Leu 


Val 


Ser 


Cys 






O _5 13 




Glu 


Thr 


Ser 


Lys 




650 






Gly 


Ser 


Asp 


Asp 


665 








Leu 


Phe 


Gly 


Gly 


Glu 


Thr 


Val 


He 








700 


Leu 


Pro 


Glu 


Gly 






715 




Ser 


Trp 


Ala 


Ala 



Asp Val He Asn 
255 

Glu Ser Phe Asp 
270 

Phe He Met Arg 
285 

Phe Ser Ala Phe 

Val Phe lie Glu 
320 

Tyr Gly Asp Ala 
335 

Ala Pro Leu Asp 
350 

Asn His Lys Arg 
365 

Cys Cys Ser Lys 

Gin Ser Ser Ser 
400 

Gin Arg Ala Gin 
415 

Ala Leu His Met 
430 

Asp His Thr Asn 
445 

Ser Val Glu Ala 

Lys Tyr Ala Arg 
480 

Lys Met Met He 
495 

Pro Asp Glu He 
510 

Gly Val Cys Ser 
525 

Cys Gly Asn Lys 

Lys Ser Ala Ser 
560 

Thr Pro Glu Gin 
575 

Ala Leu Glu Glu 
590 

Asn Ser Tyr Thr 
605 

Phe Glu Phe Gly 

Leu Lys He Thr 
640 

Thr Ser Asn Arg 
655 

Gin Lys Glu Val 
670 

Arg Glu Asp Val 
685 

Gin Lys Leu Gly 

Gly Tyr Ala Ala 
720 

Ser Cys Glu Val 
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725 
























Pro 


Ser 


Asn 


Thr 
740 


Ser 


Asn 


Met 


He 


Leu 
745 


Ser 


He 


Ala 


Lys 


Met 
750 


Aid 


t*ne 


Thr 


Lys 


Tyr 


Tyr 


Gin 


Glu 


Gin 


Asn 


Ser 


Ser 


Ser 


Glu 


Thr 


Asp 


Leu 


Asp 




755 










760 










I DO 








lie 


lie 


Leu 


Pro 


Ser 


He 


Gly 


Thr 


Ala 


Asp 


Gly Glu 


lie 


GlU 


Asn 


Asn 




770 










775 










780 










Leu 


Ser 


Gly 


Val 


Val 


Phe 


Leu 


Arg 


Cys 


Leu 


He 


Thr 


Trp 


Ala 


Asn 


Lys 


785 








790 










795 










oUU 


lie 


Gly 


Val 


Asp 


Lys 


Asn 


Phe 


Thr 


Asn 


Lys 


Leu 


Glu 


His 


Phe 


Leu 


Ala 






805 










810 










Old 

ol t> 




Leu 


Arg 


He 


Leu 


Thr 


Lys 


Ala 


Gly 


Asp 


Ser 


Lys 


He 


Gly 


Glu 


Lys 


Tyr 






820 










825 










O O A 






Glu 


Thr 


Phe 
835 


Pro 


Val 


Arg 


Arg 


Leu 
840 


Asp 


Leu 


Ser 


Glu 


Lys 
845 


Asp 


Leu 


Lys 


Tyr 


He 


Cys 


Lys 


Arg 


Cys 


Gly 


Val 


Lys 


Ser 


Leu 


Lys 


Met 


Glu 


Tyr 


Asp 


850 










855 










860 










Asn 


Asp 


Glu 


Lys 


Leu 


Cys 


Leu 


Arg 


Cys 


Lys 


Gly Asn 


Tyr 


Arg 


Met 


Gly 


865 








870 










875 










880 


Lys 


Pro 


Met 


Val 


Tyr 


His 


Trp 


Asp 


Asn 


Lys 


Leu 


Thr 


Arg 


Asp 


Pro 


Arg 








885 










890 










one 




Ala 


Lys 


Thr 


Asp 


Thr 


Thr 


Leu 


Asn 


Leu 


Leu 


Asn 


Ala 


Lys 


Lys 


He 


Asp 






900 










905 










910 






Asp 


Lys 


Val 


Lys 


Glu 


Met 


Ala 


Ser 


Asp 


He 


He Gly Ala 


Leu 


Asn 


Leu 




915 










920 










925 








Pro 


Pro 
930 


Thr 


Asp 


Lys 


Asp 


Asn 
935 


Glu 


He 


Ala 


Val 


Ser 
940 


Ala 


Ala 


Ala 


Lys 


Ala 


Val 


Gly 


He 


Leu 


Tyr 


Gly 


Lys 


Thr 


Cys 


Leu 


Leu 


Tyr 


Lys 


Leu 


Leu 


94 5 








950 










955 










960 


Asn 


Glu 


Gly 


Asn 


He 


Asp 


He 


Pro 


Val 


Ala 


Val 


Cys 


Val 


Glu 


Cys 


Asp 








965 










970 










975 




Cys 


Cys 


Lys 


Ser 


Lys 


Tyr 


Met 


Met 


Ser 


Thr 


Leu 


Gly 


Pro 


Asp 


Lys 


Pro 






980 










985 










990 






Gin 


Asn 


Arg 


Lys 


Cys 


Pro 


Trp 


Cys 


Arg 


Tyr 


Ala 


Asn 


Lys 


Leu 


Val 


Ala 






995 










1000 








1005 






Met 


Gly 


Arg 


Gly Gly 


Lys 


Lys 


Leu 


Leu 


Met 


Asp 


Leu 


He 


Glu 


Cys 


Gly 




1010 








1015 








1020 








Ala 


Pro 


Ser 


Leu 


Ala 


Met 


Val 


Glu 


Glu 


Ala 


He 


Arg 


Thr 


Ser 


Gly 


Asp 


1025 








1030 








1035 








1040 


Val 


Met 


Tyr 


Glu 


Glu 


Leu 


Gly 


Glu 


Gly 


Glu 


Glu 


Phe 


Tyr 


He 


He 


Asp 








1045 








1050 








1055 


Tyr 


Phe 


Leu 


Lys 


Leu 


Lys 


Asn 


Thr 


Ala 


He 


Ala 


Glu 


Gly 


Asn 


Lys 


Leu 






1060 








1065 








1070 




Gin 


Gin 


Asn 


Asn 


Asn 


Lys 


Arg 


Pro 


Ala 


Pro 


Leu 


Gin 


Val 


Thr 


Ser 


Pro 



1075 1080 1085 



Ser Ser Pro Pro Lys Lys Met Arg Ser Asp Leu Pro Asp Ser Leu Leu 

1090 1095 1100 

Ala Ala He Gly Glu Cys Ala He Glu Thr Lys Glu Lys Thr Thr Val 
1105 1110 1115 1120 

Asn Leu He Gly Leu Gly Glu Val Lys Val Val Glu Asn Val Gly Pro 

1125 H30 1135 

Asn Asp Leu Asp Gly Lys Asp Pro Phe He Ser Leu Gin Glu Tyr Cys 

1140 1145 1150 

Ser Trp Asp Lys Phe Asn Ser Leu Phe Val Asn Pro Trp Leu Gly Tyr 

1155 1160 1165 

Arg Leu Asp Glu Gin Trp Asp Asp Trp Asn Thr Phe Leu He His Val 

1170 1175 1180 

Lys Lys Asn Asp Val Trp Lys Phe Leu Cys Asn Lys Thr Ser Pro Phe 
1185 1190 1195 1200 

Ser val Val Val Met Asn Asp Gly Ser Gly Leu Leu Asn Val Asp Asn 
1205 1210 1215 
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Val Asn Val Leu Val Arg Gin Lys He Cys Val 
1220 1225 



<210> 172 
<211> 294 
<212> DNA 
<213> SHRIMP 



<400> 172 

atggacattt 

tttgtgaatg 

agagaagctg 

ggagatgtgg 

atggcggcag 



tggaagacat ctacaagagc gcgatcacgc tcgtactaca atcgcccgaa 60 

atgtaaaaca ggaagcttct caggtagttg aggggctaat accttcaatt 120 

tctttagacg gcttctagaa gaagaaagga aaaaacacga agacgaggtg 180 

aagataaaag acaagcagtg atagacaagg caaatacaat gattacaaca 240 

agtacctgga atctgtagat attttagaag agtttgggtt ttaa 294 



<210> 173 
<211> 93 
<212> PRT 
<213> SHRIMP 

<400> 173 

Met Asp He Asp He Tyr Lys Ser Ala He Thr Leu Val Leu Gin Ser 

15 10 15 

Pro Glu Phe Val Asn Asp Val Lys Gin Glu Ala Ser Gin Val Val Glu 

20 25 30 

Gly Leu He Pro Ser He Arg Glu Ala Val Phe Arg Arg Leu Leu Glu 

35 40 45 

Glu Glu Arg Lys Lys His Glu Asp Glu Val Gly Asp Val Glu Asp Lys 

50 55 60 

Arg Gin Ala Val He Asp Lys Ala Asn Thr Met He Thr Thr Met Ala 
65 70 75 80 

Ala Glu Tyr Leu Glu Ser Val Asp He Glu Phe Gly Phe 
85 90 



<210> 174 
<211> 1530 
<212> DNA 
<213> SHRIMP 



<400> 174 

atggcgtccg 

atcgagtcta 

ttgcataaaa 

gcaggcaacc 

aatttctata 

gatttgagga 

actttctccc 

tgcaagtaca 

gtcgaaaaga 

cacgatatag 

agttgtcatg 

aatatgtacc 

gttacacatt 

ccaaacgagg 

accaacaaat 

agaataggag 

acaaacagat 

atgtgttcaa 

tctgcctacg 



ggtttgctat 
tcaaaagtat 
atactccaaa 
cctgcttgga 
caaggaaagg 
atatcaacaa 
ctctatatag 
taattgtaac 
aactgttaag 
cacatcgatt 
accaatgctt 
ccacatttaa 
cggccgaaaa 
tataccttga 
tggatgctat 
acaaaagtac 
tcaatgaaga 
attattgtac 
ggccaacatc 



caagggaatc 
acggagaagc 
acatgaggtg 
aaaaacgtca 
tgctgaatct 
tatagtgaaa 
aa ttctcact 
tgctgattta 
tatgggtagt 
atcctacaaa 
agttgttcca 
aaatt ttgat 
aatggacggt 
cgccgcaagg 
cgatgaagaa 
gtgtctcgtg 
cacaccccta 
aatgagagat 
ttaccctatc 



gttaaaaact 
gaactcgctg 
gatgaaattg 
ttatttcttc 
gaatctgatt 
agagaagctt 
gatagactat 
ttgatgggtt 
attttagggg 
ggcctccgca 
gtgagtatgt 
caatgcatgg 
aagcatgaac 
aggaaatacc 
gcaagggaag 
tttgcattat 
tattctggta 
gagggtggtt 
atcttcaata 



atagaagaat 
aaggtgtata 
tgaataaaat 
aacatcattc 
ggctcaaaag 
taccccatga 
tcaatgcagc 
gtgggataac 
gcgaatcaat 
tagaaaatcc 
tagggaagat 
cattattttt 
gtaataaggt 
tagaagaaaa 
aatatggaaa 
ctgcacgaga 
cagaaagagg 
tcaggccccg 
ctttatatga 



accgtccatt 
tatcgtctct 
acgcctctca 
acaaatgagg 
actaccagaa 
caagtctttc 
aattcacaac 
caacaacaaa 
ggtaccttta 
tatagtgggt 
tttttcaagt 
gaatgcagtt 
catccatatg 
actagaggaa 
cgaaatagga 
ctttttcctc 
aatcagattc 
tttgatcatg 
tcaattcaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
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gtgcaatatt atccatgtgt ttctggagtt gttttatctt ttattggcga tgatcagtta 1200 

gcaccagaac cagaatcatt agtggacatt gttgtacgtt ctataaaaaa tccgtctatt 1260 

agaatttttt ctggtgatgg tgaaacagta taccaggatg gacgtagggt cgatgttggt 1320 

ggtgagggaa agaatcagaa gtttaaccga gaagagcgca ccattttaaa tgtattgagg 1380 

ataattaaag catataatga agaacgaact aaagaagatg aagatgaaga ggaagaagag 1440 

gaggaagaag aggaacaaca aacagcagca acagtgacag tagaaagtga ttgggatctc 1500 

tcactagaga ggggggagaa ttgggtgtag 1530 

<210> 175 
<211> 507 
<212> PRT 
<213> SHRIMP 



<400> 175 



Met 


Ala 


Ser 


Gly 


Phe 


Ala 


lie 


Lys 


Gly 


He 


Val 


Lys 


Asn 


Tyr 


Arg 


Arg 


1 








5 










10 










15 




He 


Pro 


Ser 


He 


He 


Glu 


Ser 


He 


Lys 


Ser 


He 


Arg 


Arg 


Ser 


Glu 


Leu 








20 










25 










30 






Ala 


Glu 


Gly 


Val 


Tyr 


He 


Val 


Ser 


Leu 


His 


Lys 


Asn 


Thr 


Pro 


Lys 


His 






35 










40 










45 








Glu 


Val 


Asp 


Glu 


He 


Val 


Asn 


Lys 


He 


Arg 


Leu 


Ser 


Ala 


Gly 


Asn 


Pro 




50 










55 










60 










Cys 


Leu 


Glu 


Lys 


Thr 


Ser 


Leu 


Phe 


Leu 


Gin 


His 


His 


Ser 


Gin 


Met 


Arg 


65 










70 










75 










80 


Asn 


Phe 


Tyr 


Thr 


Arg 


Lys 


Gly 


Ala 


Glu 


Ser 


Glu 


Ser 


Asp 


Trp 


Leu 


Lys 










85 










90 










95 




Arg 


Leu 


Pro 


Glu 


Asp 


Leu 


Arg 


Asn 


He 


Asn 


Asn 


He 


Val 


Lys 


Arg 


Glu 








100 










105 










110 






Ala 


Leu 


Pro 


His 


Asp 


Lys 


Ser 


Phe 


Thr 


Phe 


Ser 


Pro 


Leu 


Tyr 


Arg 


lie 






115 










120 










125 








Leu 


Thr 


Asp 


Arg 


Leu 


Phe 


Asn 


Ala 


Ala 


He 


His 


Asn 


Cys 


Lys 


Tyr 


He 




130 










135 










140 










He 


Val 


Thr 


Ala 


Asp 


Leu 


Leu 


Met 


Gly 


Cys 


Gly 


He 


Thr 


Asn 


Asn 


Lys 


145 










150 










155 










160 


Val 


Glu 


Lys 


Lys 


Leu 


Leu 


Ser 


Met 


Gly 


Ser 


He 


Leu 


Gly 


Gly 


Glu 


Ser 








165 










1 /o 










1 /o 




Met 


Val 


Pro 


Leu 


His 


Asp 


He 


Ala 


His 


Arg 


Leu 


Ser 


Tyr 


Lys 


Gly 


Leu 








180 










185 










190 






Arg 


He 


Glu 


Asn 


Pro 


He 


Val 


Gly 


Ser 


Cys 


His 


Asp 


Gin 


Cys 


Leu 


Val 






195 










200 










205 








Val 


Pro 


Val 


Ser 


Met 


Leu 


Gly 


Lys 


He 


Phe 


Ser 


Ser 


Asn 


Met 


Tyr 


Pro 




210 










215 










220 










Thr 


Phe 


Lys 


Asn 


Phe 


Asp 


Gin 


Cys 


Met 


Ala 


Leu 


Phe 


Leu 


Asn 


Ala 


Val 


225 








230 










235 










240 


Val 


Thr 


His 


Ser 


Ala 


Glu 


Lys 


Met 


Asp 


Gly 


Lys 


His 


Glu 


Arg 


Asn 


Lys 










245 










250 










255 




Val 


He 


His 


Met 


Pro 


Asn 


Glu 


val 


Tyr 


Leu 


Asp 


Ala 


Ala 


Arg 


Arg 


Lys 








260 










265 










270 






Tyr 


Leu 


Glu 


Glu 


Lys 


Leu 


Glu 


Glu 


Thr 


Asn 


Lys 


Leu 


Asp 


Ala 


He 


Asp 






275 










280 










285 








Glu 


Glu 


Ala 


Arg 


Glu 


Glu 


Tyr 


Gly 


Asn 


Glu 


He 


Gly 


Arg 


He 


Gly 


Asp 




290 










295 










300 










Lys 


Ser 


Thr 


Cys 


Leu 


Val 


Phe 


Ala 


Leu 


Ser 


Ala 


Arg 


Asp 


Phe 


Phe 


Leu 


305 










310 










315 










320 


Thr 


Asn 


Arg 


Phe 


Asn 


Glu 


Asp 


Thr 


Pro 


Lys 


Gly 


Thr 


Glu 


Arg 


Gly 


He 










325 










330 










335 




Arg 


Phe 


Met 


Cys 


Ser 


Asn 


Tyr 


Cys 


Thr 


Met 


Arg 


Asp 


Glu 


Gly 


Gly 


Phe 








340 










345 










350 






Arg 


Pro 


Arg 


Leu 


He 


Met 


Ser 


Ala 


Tyr 


Gly 


Pro 


Thr 


Ser 


Tyr 


Pro 


He 






355 










360 










365 








He 


Phe 


Asn 


Thr 


Leu 


Tyr 


Asp 


Gin 


Phe 


Asn 


Val 


Gin 


Tyr 


Tyr 


Pro 


Cys 
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vet J. 


Se r 




vax 


385 








Glu 


Pro 


Glu 


Ser 


Ser 


lie 


Arg 


lie 








420 


Arg 


Arg 


val 


Asp 






435 




Glu 


Glu 


Arg 


Thr 




450 






Glu 


Glu 


Arg 


Thr 


465 








Glu 


Glu 


Glu 


Gin 


Asp 


Leu 


Ser 


Leu 



500 



375 

Val Leu Ser Phe 
390 

Leu Val Asp lie 
405 

Phe Ser Gly Asp 

Val Gly Gly Glu 
440 

lie Leu Asn Val 
455 

Lys Glu Asp Glu 
470 

Gin Thr Ala Ala 
485 

Glu Arg Gly Glu 



380 

lie Gly Asp Asp 
395 

Val Val Arg Ser 
410 

Gly Glu Thr Val 
425 

Gly Lys Asn Gin 

Leu Arg lie lie 
460 

Asp Glu Glu Glu 
475 

Thr Val Thr Val 

490 
Asn Trp Val 
505 



Gin Leu Ala Pro 
400 

lie Lys Asn Pro 
415 

Tyr Gin Asp Gly 
430 

Lys Phe Asn Arg 
445 

Lys Ala Tyr Asn 

Glu Glu Glu Glu 
480 

Glu Ser Asp Trp 
495 



<210> 176 
<211> 246 
<212> DNA 
<213> SHRIMP 



<400> 176 

atgacttgtc cagaaatctc 
gctgacccag gtggcctcag 
aaaaactttt caaaaatttt 
ccttatgtca agttcgaagg 
agctag 

<210> 177 
<211> 77 
<212> PRT 
<213> SHRIMP 



taaacacatt tctggaacag 
ctatcctttc aaccctcttt 
ttcagctcac tccagtttag 
gtggaccgct gggtcgaccc 



acagacgttt ctggaacacg 60 
ttacccttca tctccatctc 120 
ggggtggacc gctgactagg 180 
aacgtcagat tacagagagg 240 

246 



<400> 177 




























Met 


Thr 


Cys 


Pro 


Glu 


He 


Ser 


Lys 


His 


Gly 


Thr 


Asp 


Arg 


Arg 


Phe 


Trp 


1 








5 










10 










15 




Asn 


Thr 


Ala 


Asp 


Pro 


Gly 


Gly 


Leu 


Ser 


Tyr 


Pro 


Phe 


Asn 


Pro 


Leu 


Phe 








20 










25 










30 






Thr 


Leu 


His 


Leu 


His 


Leu 


Lys 


Asn 


Phe 


Ser 


Lys 


He 


Phe 


Ser 


Ala 


His 






35 










40 










45 








Ser 


Ser 


Leu 


Gly 


Gly Gly 


Pro 


Leu 


Trp 


Tyr 


Val 


Lys 


Phe 


Glu 


Gly 


Trp 




50 










55 










60 










Thr 


Ala 


Gly 


Ser 


Thr 


Gin 


Arg 


Gin 


He 


Thr 


Glu 


Arg 


Ser 









65 70 75 



<210> 178 
<211> 738 
<212> DNA 
<213> SHRIMP 



<400> 178 

atggtttcca ccaggtctat 
tctcccacga ccagtaagag 
gaatcagtag aaacacaccc 
attgatgaaa atgttgatgg 
atggagagag aaagacttgc 
gccgcaacgg aagcgaggaa 



ggaagcaaaa gctgcagcag 
aaaggcggag gacctcactg 
gccgagtaag ctccccagag 
tgatgtgcag atcctcgcct 
cgaagccatg gtccgagaca 
agaaatagcc tctcgcctaa 



cagcaaaagc aaaagaagtt 60 
aaggaacaga agaagaagaa 120 
tcgatgaaga tgaagtctat 180 
catcaatcga agtcgccaga 240 
taaaaatcga ggaagaaaaa 300 
tttataaaga aatggtatat 360 
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cttttgcctc 
aacgaaatga 
tttgaattcc 
ggttcgagat 
atgtgtaata 
gttgaagtta 
atagtaaaac 



aactggaaaa 
ccattacaga 
ctatactgac 
tctacaacga 
aattcataga 
gtgtatcaag 
acacctag 



catgactaac 
ccgcacgttc 
tgatattgct 
tatgaagatt 
gtctgttgtg 
tgaacttgaa 



cgcctccgtc 
agtgatttgc 
ttccttgccc 
ggacctataa 
caaaaggtga 
gggtcacctt 



cgagatcact 
agatattcaa 
gtgaaaaatc 
cagcctacaa 
aggcagaaat 
tttgggattt 



tctcaggcac 420 
caaagtcact 480 
acgtgtcgag 540 
attgaatttg 600 
atccccattt 660 
caagcaaaga 720 
738 



<210> 179 
<211> 245 
<212> PRT 
<213> SHRIMP 



<400> 179 



Met 


Val 


Ser 


Thr 


Arg 


Ser 


Met 


Glu 


Ala 


Lys 


Ala 


Ala 


Ala 


Ala 


Ala 


Lys 


1 








5 










10 










15 




Ala 


Lys 


Glu 


Val 


Ser 


Pro 


Thr 


Thr 


Ser 


Lys 


Arg 


Lys 


Ala 


Glu 


Asp 


Leu 








20 










25 










30 






Thr 


Glu 


Gly 


Thr 


Glu 


Glu 


Glu 


Glu 


Glu 


Ser 


Val 


Glu 


Thr 


His 


Pro 


Pro 






35 










40 










A C 

HD 








Ser 


Lys 


Leu 


Pro 


Arg 


Val 


Asp 


Glu 


Asp 


Glu 


Val 


Tyr 


He 


Asp 


Glu 


Asn 




50 










55 










60 










Val 


Asp 


Gly 


Asp 


Val 


Gin 


He 


Leu 


Ala 


Ser 


Ser 


He 


Glu 


Val 


Ala 


Arg 


65 










70 










75 










80 


Met 


Glu 


Arg 


Glu 


Arg 


Leu 


Ala 


Glu 


Ala 


Met 


Val 


Arg 


Asp 


He 


Lys 


He 










85 










90 










95 




Glu 


Glu 


Glu 


Lys 


Ala 


Ala 


Thr 


Glu 


Ala 


Arg 


Lys 


Glu 


He 


Ala 


Ser 


Arg 








100 










105 










110 






Leu 


He 


Tyr 


Lys 


Glu 


Met 


Val 


Tyr 


Leu 


Leu 


Pro 


Gin 


Leu 


Glu 


Asn 


Met 






115 










120 










125 








Thr 


Asn 


Arg 


Leu 


Arg 


Pro 


Arg 


Ser 


Leu 


Leu 


Arg 


His 


Asn 


Glu 


Met 


Thr 




130 










135 










140 










He 


Thr 


Asp 


Arg 


Thr 


Phe 


Ser 


Asp 


Leu 


Gin 


He 


Phe 


Asn 


Lys 


Val 


Thr 


145 










150 










155 










160 


Phe 


Glu 


Phe 


Pro 


He 


Leu 


Thr 


Asp 


He 


Ala 


Phe 


Leu 


Ala 


Arg 


Glu 


Lys 










165 










170 










175 




Ser 


Arg 


Val 


Glu 


Gly 


Ser 


Arg 


Phe 


Tyr 


Asn 


Asp 


Met 


Lys 


He 


Gly 


Pro 








180 










185 










190 






He 


Thr 


Ala 


Tyr 


Lys 


Leu 


Asn 


Leu 


Met 


Cys 


Asn 


Lys 


Phe 


He 


Glu 


Ser 






195 










200 










205 








Val 


Val 


Gin 


Lys 


Val 


Lys 


Ala 


Glu 


He 


Ser 


Pro 


Phe 


Val 


Glu 


Val 


Ser 




210 






215 










220 










Val 


Ser 


Ser 


Glu 


Leu 


Glu 


Gly 


Ser 


Pro 


Phe 


Trp 


Asp 


Phe 


Lys 


Gin 


Arg 


225 










230 










235 










240 


He 


Val 


Lys 


His 


Thr 

























245 



<210> 180 
<211> 1221 
<212> DNA 
<213> SHRIMP 

<400> 180 

atgtctcaca tcaactctac 
accactacag cccctatgat 
gcttctgctg ttacaagtat 
ccacaaggta tctctgttac 
tctactggaa cgaggatgtc 
aagaacaagt ccaagtctaa 
cgtagcagtg acgaggacga 



ctctgctgcc acgacttcat 
tgctgccgcc agagctgctg 
caactctaat tctacgtctt 
ggccatgcct cccgtgccag 
ttctacaccc aatgtggatg 
gaagaaggat tcaaagagga 
accatctctt gttatcgacg 



ccaacactct gccgatttgc 60 
ccatcgcctc tcggacttct 120 
cttctgcaat gttccgagta 180 
cacttacatc tctgactgaa 240 
ttatacctgt tcctggcccc 300 
agaagaacca gaatggcaac 360 
acggttctgg aagacagtct 420 
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aagaacaaga 
gacactctcg 
attgacgcag 
gcactcactt 
acttcttacg 
acagttagca 
agtaaggtga 
gctagctgta 
tttgatttta 
ttttttgaga 
agaggcatct 
caacaacgcc 
acaacttctg 
caagaaatca 



aatattcttg 
ccccacctag 
ggctaagtaa 
ctggagttgg 
gagattgcga 
gaggcacatc 
tccataccgt 
tggcctacgc 
tggctatgac 
agcgcgataa 
ttacacgcac 
gtgaagctgc 
ctgccggagt 
gataccagta 



ggtcacatct 
gcccttcctt 
tccagtcact 
agataatgga 
aacacccata 
tcttctgcac 
taaaccaaga 
tgtggacctt 
tgcatgggca 
ggacatcgga 
cactgaaatg 
tgccgctgcc 
tggtgctact 
a 



cttgctacta 
cccacacccg 
cgccaaatca 
ctatatcctt 
cctggacctg 
agagaagagg 
aacagcaagt 
ctaaacaacc 
gcccgtcaac 
gaatatagga 
aataaacgcg 
gctgccaccg 
tcttctgcta 



ctacggctga 
aagaaggaaa 
ccggagaagt 
cccacttcac 
cttttgtcct 
cagaattctt 
actccaatat 
ataatatcac 
gttgtggaga 
ataaggtggt 
caaagattat 
gtgccaccgc 
ctactaactc 



aagaaacaac 
tatgtctgaa 
ttatagcgct 
ggttgctgac 
cgacgacggg 
gaatgatgga 
tcaacgtgcc 
ctctgaccaa 
aatggccaag 
ccaatacaac 
cctggaacaa 
ccctatccct 
tctcgaatat 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1221 



<210> 181 
<211> 402 
<212> PRT 
<213> SHRIMP 



<400> 181 



Met 


Ser 


His 


He 


Asn 


Ser 


Thr 


Ser 


Ala 


Ala 


Thr 


Thr 


Ser 


Ser 


Asn 


Thr 


1 








5 










10 










15 




Leu 


Pro 


He 


Cys 


Thr 


Thr 


Thr 


Ala 


Pro 


Met 


He 


Ala 


Ala 


Ala 


Arg 


Ala 








20 










25 










30 






Ala 


Ala 


He 


Ala 


Ser 


Arg 


Thr 


Ser 


Ala 


Ser 


Ala 


Val 


Thr 


Ser 


He 


Asn 






35 










40 










45 








Ser 


Asn 


Ser 


Thr 


Ser 


Ser 


Ser 


Ala 


Met 


Phe 


Arg 


Val 


Pro 


Gin 


Gly 


He 




50 










55 










60 










Ser 


Val 


Thr 


Ala 


Met 


Pro 


Pro 


Val 


Pro 


Ala 


Leu 


Thr 


Ser 


Leu 


Thr 


Glu 


65 










70 










75 










80 


Ser 


Thr 


Gly 


Thr 


Arg 


Met 


Ser 


Ser 


Thr 


Pro 


Asn 


Val 


Asp 


Val 


He 


Pro 










85 










90 










95 




Val 


Pro 


Gly 


Pro 


Lys 


Asn 


Lys 


Ser 


Lys 


Ser 


Lys 


Lys 


Lys 


Asp 


Ser 


Lys 








100 










105 










110 






Arg 


Lys 


Lys 


Asn 


Gin 


Asn 


Gly 


Asn 


Arg 


Ser 


Ser. 


Asp 


Glu 


Asp 


Glu 


Pro 






115 










120 










125 








Ser 


Leu 


Val 


He 


Asp 


Asp 


Gly 


Ser 


Gly 


Arg 


Gin 


Ser 


Lys 


Asn 


Lys 


Lys 




130 










135 










140 










Tyr 


Ser 


Trp 


Val 


Thr 


Ser 


Leu 


Ala 


Thr 


Thr 


Thr 


Ala 


Glu 


Arg 


Asn 


Asn 


145 










150 










155 










160 


Asp 


Thr 


Leu 


Ala 


Pro 


Pro 


Arg 


Pro 


Phe 


Leu 


Pro 


Thr 


Pro 


Glu 


Glu 


Gly 










165 










170 










175 




Asn 


Met 


Ser 


Glu 


lie 


Asp 


Ala 


Gly 


Leu 


Ser 


Asn 


Pro 


Val 


Thr 


Arg 


Gin 








180 










185 










190 






He 


Thr 


Gly 


Glu 


Val 


Tyr 


Ser 


Ala 


Ala 


Leu 


Thr 


Ser 


Gly 


Val 


Gly 


Asp 






195 










200 










205 








Asn 


Gly 


Pro 


Ser 


His 


Phe 


Thr 


Val 


Ala 


Asp 


Thr 


Ser 


Tyr 


Gly 


Asp 


Cys 




210 










215 










220 










Glu 


Thr 


Pro 


He 


Pro 


Gly 


Pro 


Ala 


Phe 


Val 


Leu 


Asp 


Asp 


Gly 


Thr 


Val 


225 










230 










235 










240 


Ser 


Arg 


Gly 


Thr 


Ser 


Leu 


Leu 


His 


Arg 


Glu 


Glu 


Ala 


Glu 


Phe 


Leu 


Asn 










245 










250 










255 




Asp 


Gly 


Ser 


Lys 


Val 


He 


His 


Thr 


Val 


Lys 


Pro 


Arg 


Asn 


Ser 


Lys 


Tyr 








260 










265 










270 






Ser 


Asn 


He 


Gin 


Arg 


Ala 


Ala 


Ser 


Cys 


Met 


Ala 


Tyr 


Ala 


Val 


Asp 


Leu 






275 










280 










285 








Leu 


Asn 


Asn 


His 


Asn 


He 


Thr 


Ser 


Asp 


Gin 


Phe 


Asp 


Phe 


Met 


Ala 


Met 




290 










295 










300 










Thr 


Ala 


Trp 


Ala 


Ala 


Arg 


Gin 


Arg 


Cys 


Gly 


Glu 


Met 


Ala 


Lys 


Phe 


Phe 


305 










310 










315 










320 



WO 01/38351 



268 



PCTAJS00/28888 



Glu 


Lys 


Arg 


Asp 


Lys 
325 


Asp 


lie 


Gly 


Glu 


Tyr 
330 


Arg 


Asn 


Lys 


Val 


Val 
335 


Gin 


Tyr 


Asrt 


Arg 


Gly 
340 


lie 


Phe 


Thr 


Arg 


Thr 
345 


Thr 


Glu 


Met 


Asn 


Lys 
350 


Arg 


Ala 


Lys 


lie 


lie 
355 


Gin 


Gin 


Gin 


Arg 


Arg 
360 


Glu 


Ala 


Ala 


Ala 


Ala 

365 


Ala 


Ala 


Ala 


Thr 


Gly 
370 


Ala 


Thr 


Ala 


Pro 


He 
375 


Pro 


Thr 


Thr 


Ser 


Ala 
380 


Ala 


Gly 


Val 


Gly 


Ala 


Thr 


Ser 


Ser 


Ala 


Thr 


Thr 


Asn 


Ser 


Leu 


Glu 


Tyr 


Gin 


Glu 


He 


Arg 


385 










390 










395 










400 


Tyr 


Gin 































<210> 182 
<211> 1617 
<212> DNA 
<213> SHRIMP 



<400> 182 

atggaagact 

gaaaattatg 

ctaaaggaag 

ggtaacgaaa 

acatccagtc 

aaatggaacg 

gctctagtta 

ctttcaaatg 

gggtgtgggt 

gtgcaacacg 

tacaatggct 

gtaccaatgg 

attggaaatt 

aatcaaaaat 

ctagttagcg 

gcctcagaag 

tccagatatg 

ttgtgcagaa 

gtctcttcgg 

tccataactg 

aaccatttca 

aggttgttga 

aatggaaaag 

ggcaaggaca 

cgagagcgtc 

ggggaagaag 

gaaggcgatt 



ttaaacaatt 
aacgggtact 
gacattttat 
gattgagaca 
tcccaaaaca 
aatcactacc 
atgacaagaa 
cggccgttaa 
tttctcctag 
gtggtacctc 
taagtatcgc 
ataaacttgg 
acatgtcatg 
tagataattt 
aaagaagtgt 
aacaaaggat 
acggcggtgc 
ccaatggaaa 
acgactacag 
gctcagatgc 
ttgcatccgg 
tggagatggt 
tatatcaaga 
taataaaaaa 
gcatgatttt 
aagaagtagc 
atgttgccct 



aaaagtaaaa 
attaacattc 
agttcgtctt 
attaacagga 
aggtagtttc 
aagcccattc 
ctttaaattt 
gaaatgtgat 
aaattgtacc 
ttctcgtgta 
aaaccccata 
attacttttc 
ccttttcaat 
cgaaaaggaa 
tctggaagaa 
ttctcttcga 
ctgtctggtc 
tggttccttt 
aaagagggac 
acctatatgc 
aaagggtaat 
cagagaggtt 
tggtgtagaa 
ggaagaaaca 
caacacagtt 
cacaagcagt 
tttgaacaag 



aatggtattt 
aaatcagtca 
agagacaagg 
gatcctacgc 
ttagaagatg 
caggaaatga 
tcacccctat 
tatatgataa 
cgtactctta 
tcagtgtatg 
gttggcagtt 
tacaacagca 
gctgcagttg 
atcagat ttg 
aaacttaaag 
gatgtgcata 
tttgcctttt 
tactctgcca 
gtggatgaac 
atcagagata 
gaaatatcat 
actggatcag 
ataaaagtga 
t taccagagg 
aaggcaattg 
ggaggaacaa 
gcatgcaaag 



gtttgtctgg 
agagtgtcag 
aagtactcca 
ttcagattgg 
aagaccctaa 
acaaaattgt 
acagaatcat 
tcacaacaga 
agaatatgga 
atatctgtga 
tttcaaatat 
cacacccgtc 
tatacacgct 
caaaaaatga 
aatccaaaaa 
aaaagtcctc 
ctgaccgaga 
cagaagaagg 
gtaggcccag 
gtatacgaaa 
tcatcgatcc 
acatcaaaat 
ttgacccctc 
aggaaaggaa 
agacgtacaa 
agagaaagag 
aaattaaagt 



ggaaaatacc 
gagaagtgag 
catcaagaac 
actaaaatac 
ttatggaaaa 
ggaagaaaag 
acatgaacgt 
cttcttagta 
acaagtgtta 
taggttaacg 
gtgcctaatt 
agctaaaagc 
agaaaagagt 
agtcaacctt 
gctatatgct 
aattgcatca 
tttctccttg 
aatcagatac 
attggtcatg 
ccattttaat 
tccgaatgaa 
cttcatggat 
ttctaaagaa 
gcgtctgcgc 
cgaggaacgt 
ggaggagaaa 
ttgttga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1617 



<210> 183 
<211> 534 
<212> PRT 
<213> SHRIMP 



<400> 183 

Met Glu Asp Phe Lys Gin Leu Lys Val Lys Asn Gly He Cys Leu Ser 

15 10 15 

Gly Glu Asn Thr Glu Asn Tyr Glu Arg Val Leu Leu Thr Phe Lys Ser 

20 25 30 

Val Lys Ser Val Arg Arg Ser Glu Leu Lys Glu Gly His Phe He Val 

35 40 45 

Arg Leu Arg Asp Lys Glu Val Leu His He Lys Asn Gly Asn Glu Arg 
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50 










55 




Leu 


Arg 


Gin 


Lieu 


Thr 


Gly 


Asp 


Pro 


65 










70 






Thr 


Ser 


Sex 


Leu 


Pro 


Lys 


Gin 


Gly 










85 








Asn 


Tvr 
x yi 


Gly 


Lys 


Lys 


Trp 


Asn 


Glu 








100 










Met 


Asn 




Tift 
X JLKS 


Val 


Glu 


Glu 


x*y & 






IIS 

X X %J 










i ?n 

X£r\J 


Ly s 




Oft 


Pro 

JTX Is 


Leu 


Tyr 


Arg 


lie 




130 










135 




Ala 


vox 


Lys 


Lys 


Cys 


Asp 


Tyr 


i IC L. 


145 










150 






Gly 




oiy 


pKp 


Ser 


Pro Arg 


Asn 










165 








m ii 


Gin 


Wa 1 
vdX 


Xieu 


Val 


Gin 


His 


uiy 








1 fin 
x o u 










Tvr* 


Asp 


He 




Asp Arg 


Leu 


X I1X 






195 










200 


c x. v 


Tip 


V d X 


uiy 


Ser 


Phe 


Ser 


Asn 




210 










215 






L€U 


oiy 


Leu 


Leu 


Phe 


Tyr 


Asn 


225 










230 






Tip 

lie 


f^l \/ 
uiy 


Asn 


Tyr 


Met 


Ser 


Cys 


Leu 










245 








L6U 




Lys 


Ser 


Asn 


Gin 


Lys 


Leu 








260 












Ala 


Lys 


Asn 


Glu 


Val 


Asn 


Leu 


















Lys 


Leu 


Lys 


Glu 


Ser 


Lys 


Lys 


Leu 














295 




X 1c 




Leu 


Arg Asp 


Val 


His 


Lys 












310 






Tyr 


Asp 


oxy 


Gly Ala 


Cys 


Leu 


vax 










325 








Ser 


Leu 


Leu 


Cys 


Arg 


Thr 


Asn 


vjxy 








340 












VJ± li 


vjxy 


lie 


Arg 


Tyr 


Val 


Cor 

del 




















Asp 




Arg 


Arg 


Pro 


Arg 


Leu 




O * U 










375 




nid 


ri u 


Tip 

X ic 


Cys 


He 


Arg 


Asp 


Oft 
OCX 


JO J 










390 






Xr IIS 




MX el 


Ser 


Gly 


Lys 


Gly 


Asn 










405 








Asn 




Arg 


Leu 


Leu 


Met 


Glu 










420 










-L X fcr 


Lys 


Tic 
X lc 


Phe 


Met 


Asp 


Asn 


bi y 
















a a n 

H H \J 


x xe 


Lys 


vai 


He 


Asp 


Pro 


Ser 


Ser 




450 










455 




Lys 


Glu 


Glu 


Thr 


Leu 


Pro 


Glu 


Glu 


465 










470 






Arg 


Arg 


Met 


He 


Phe 


Asn 


Thr 


Val 










485 








Glu 


Arg 


Gly 


Glu 


Glu 


Glu 


Glu 


Val 








500 










Arg 


Lys 


Arg 


Glu 


Glu 


Lys 


Glu 


Gly 






515 










520 


Lys 


Glu 


lie 


Lys 


Val 


Cys 








530 















60 



Thr 


Leu 


Gin 


lie 


Glv 


Leu 


Lys 


Tvr 






75 










80 


Ser 


Phe 


Leo 


Glu 


Asp 


Glu 


Asp 


Pro 


















Oft*- 
OCi. 


Leu 


Pro 


Ser 


Pro 


Php 


Gin 


uxu 


X V^/ 










X X V 






Ala 


Leu 


vax 


Asn 


Asp 


T.v/e 
xjy a 


Asn 


Pha 

rue 










X £. *J 








Tip 

X Xt3 


fix 0 


vjIU 


Arg 


Leu 


Ser 


Asn 


nla 








1 4 n 

X H \) 










lie 




Thr 

x nr 


Th r- 


Asp 


xr ne 


Leu 


vax 
















IOU 


vya 


Thr 

X ilX 


Arg 


Thr 


i^eu 


Lys 


Asn 






T *7n 
X / U 










1 "7 ^ 
X * 0 




uiy 


i nr 


Oft -r- 


Oft 

oer 


Arg 


vax 


C ft.T- 


vax 


xoo 










i on 






Tyr 


Asn 


oiy 


Leu 


otrx 


Tift 

X xe 


Jkla 
Ala 


Asn 










?n*% 








nc 1, 


Cys 


Leu 


Tift 
IXC 


VOX 


Pro 


Mfikt- 

irJe U 


Asp 


















Oft 


1 nr 


MX 5 


rto 


Oft r 

oex 


ax a 


Lys 


Oft y~ 

oer 
















n 


rile 


Asn 


J-\.-L d 


nio 


Val 


vax 


Tyr 


1 nr 


















Asp 


Asn 


Pha 


OX LI 


juys 


IjIU 


116 


Arg 


ZOO 










^ / u 






Leu 


vax 


Oft ^ 


Ser 


vax 


Leu 


V7lU 


bill 










/OD 








Tyr 


Ala 




0 ft 
oer 


laiU 


bill 


Gin 


Arg; 








-3 r\ r\ 










Lys 


Ser 


Ser 


Tift 

lie 


Ala 


Oft. V- 

ber 


Ser 


Arg 






■51 e 










o^U 


irne 


Aid 


rfie 




Asp 


Arg 


Asp 


r»> ft 




jJU 














Asn 


fix,/ 

(jxy 


Car- 

oer 


rne 


Tyr 


Oft y~ 

oer 


nla 


inr 


"3 y| c: 
















oci 


Asp 


Asp 


Tyr 


Arg 


Lys 


Arg 


Asp 


















vox 


Mot* 


Oft y* 


Tift 
IXC 


Thr 
1 ni 


vsxy 




Asp 








ion 










Tip 

x xe 


Arg 


Asn 


nis 


xrne 


Asn 


Asn 


rixs 






one 










4 ft A 




Tie* 


Q ft 


pho 

rile 


Tift 

x xe 


Asp 


Pro 


p rA 




.din 










iij 




Val 


A r*n 


Glu 


Val 


Thr 


Gly 


Ser 


Asp 


425 










430 






Lys 


Val 


Tyr 


Gin 


Asp 


Gly 


Val 


Glu 










445 








Lys 


Glu 


Gly 


Lys 


Asp 


He 


He 


Lys 








460 










Glu 


Arg 


Lys 


Arg 


Leu 


Arg 


Arg 


Glu 






475 










480 


Lys 


Ala 


He 


Glu 


Thr 


Tyr 


Asn 


Glu 




490 










4 95 




Ala 


Thr 


Ser 


Ser 


Gly 


Gly 


Thr 


Lys 


505 










510 






Asp 


Tyr 


Val 


Leu 


Asn 


Lys 


Ala 


Cys 



525 
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<210> 184 
<211> 1386 
<212> DNA 
<213> SHRIMP 



<400> 184 

atggactcat 

cttccgccca 

atggatatcg 

cccgacggct 

agtgtaccta 

aaccatagtg 

ttcttaagat 

gagagaacga 

cgtggaactg 

agaaaccaac 

gaaggtttta 

aacgaaactg 

aatatagatg 

atggactctt 

gattcatctg 

ataagacgtc 

ggaagaaatt 

aaaagagagg 

caagtgccca 

caccaccacc 

tccaacaatt 

cgtgatcgca 

tcaaggtata 

tattag 



ctgcatctgt 
gacgtgccac 
aacctatggg 
gttatggaca 
ctggaacgat 
cagaaaaaag 
attgtgatgt 
ttattgactc 
gtggatttgg 
aagaagagaa 
ttgacattcc 
gtgtgacata 
tcgtcagaga 
ctgactcatc 
actctgaatc 
atcagtatca 
ctgagaaaat 
acgaacccag 
aggactatta 
gccaccacca 
cttctgacgt 
actctcgatt 
caactttcga 



cgtgtttatg 
gcccggttct 
actggccaag 
aattgtgtca 
tgatgtggat 
tgtgccaatc 
cgaggaagaa 
tagttctaaa 
atctacagat 
caaaaaggaa 
ttttcttgaa 
ccctaatacg 
attggaagct 
cgattcttct 
atctgatgat 
ccggcgccag 
ggagatggac 
atacgaagaa 
ttgtgagcaa 
acaccgccac 
tactgcttac 
ctcagaaaga 
cccttataga 



agattcgccc 
gtcgcctacg 
atctctactg 
cgttctggga 
tataggggag 
agaaagggaa 
cagattgtgt 
aaggacaaca 
aacccaaatt 
gaattggaag 
ggtttcgaaa 
aatcaagatg 
gaatttagta 
tcttcttcct 
tcagaaggag 
ttgagttatt 
agagtaactc 
agagaaagat 
tacgaacact 
caacaccaga 
gtcaatgaaa 
cccaacaatg 
tatggcgcaa 



ctcccgggga 
acctatttcc 
gatatggaat 
tgacatggaa 
aattgaaagt 
ccagcattgc 
atattaatga 
aaaatcaagc 
ttactgaaac 
aaggggagat 
atatcctcgc 
tggaagaaaa 
gtggaattgg 
ctgactcatc 
gggataataa 
cggatgacgt 
acataaaaac 
atattcatcc 
acgacgcccc 
ggcactttaa 
attcccccac 
gcggttataa 
gaagagggcg 



ggaaactgca 


60 


ctctgaagaa 


120 


agacaagttt 


180 


gaacaacact 


240 




300 


ccagttgatt 


360 


aaccacggga 


420 


aagaagcgtg 


480 


caccgtctca 


540 


cgtagaaatg 


600 


agaacaaagc 


660 


agatactaaa 


720 


gagtggctcc 


780 


cgattcgtct 


840 


ggtccgaaga 


900 


caatggaggg 


960 


tgaacacata 


1020 


aagaagaatg 


1080 


tgctgctgct 


1140 


ccaaccccgc 


1200 


gaggccatgc 


1260 


ccggatcaac 


1320 


tggaggagta 


1380 




1386 



<210> 185 
<211> 457 
<212> PRT 
<213> SHRIMP 



<400> 185 



Met 


Asp 


Ser 


Ser 


Ala 


Ser 


Val 


Val 


Phe 


Met 


Arg 


Phe 


Ala 


Pro 


Pro 


Gly 


1 








5 










10 










15 




Glu 


Glu 


Thr 


Ala 
20 


Leu 


Pro 


Pro 


Arg 


Arg 
25 


Ala 


Thr 


Pro 


Gly 


Ser 
30 


Val 


Ala 


Tyr 


Asp 


Leu 
35 


Phe 


Pro 


Ser 


Glu 


Glu 
40 


Met 


Asp 


He 


Glu 


Pro 
45 


Met 


Gly 


Leu 


Ala 


Lys 
50 


He 


Ser 


Thr 


Gly 


Tyr 
55 


Gly 


He 


Asp 


Lys 


Phe 
60 


Pro 


Asp 


Gly 


Cys 


Tyr 


Gly 


Gin 


He 


Val 


Ser 


Arg 


Ser 


Gly 


Met 


Thr 


Trp 


Lys 


Asn 


Asn 


Thr 


65 










70 










75 










80 


Ser 


Val 


Pro 


Thr 


Gly 
85 


Thr 


He 


Asp 


Val 


Asp 
90 


Tyr 


Arg 


Gly 


Glu 


Leu 
95 


Lys 


Val 


He 


Leu 


Arg 
100 


Asn 


His 


Ser 


Ala 


Glu 
105 


Lys 


Ser 


Val 


Pro 


He 
110 


Arg 


Lys 


Gly 


Thr 


Ser 
115 


He 


Ala 


Gin 


Leu 


He 
120 


Phe 


Leu 


Arg 


Tyr 


Cys 
125 


Asp 


Val 


Glu 


Glu 


Glu 
130 


Gin 


He 


Val 


Tyr 


He 
135 


Asn 


Glu 


Thr 


Thr 


Gly 
140 


Glu 


Arg 


Thr 


He 


He 


Asp 


Ser 


Ser 


Ser 


Lys 


Lys 


Asp 


Asn 


Lys 


Asn 


Gin 


Ala 


Arg 


Ser 


Val 


145 










150 










155 










160 


Arg 


Gly 


Thr 


Gly 


Gly 
165 


Phe 


Gly 


Ser 


Thr 


Asp 
170 


Asn 


Pro 


Asn 


Phe 


Thr 

175 


Glu 
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Thr 


Thr 


Val 


Ser 








180 


Glu 


Glu 


Glv 


Glu 






195 




Leu 


Glu 


Glv 


Phe 




210 






Val 


Thr 


Tvr 


Pro 


225 








Asn 


He 


Asp 


Val 


Gly 


Ser 


Gly 


Ser 








260 


Ser 


Ser 


Asp 


Ser 






275 




Asp 


Asp 


Ser 


Glu 




290 






Gin 


Tyr 


His 


Ara 


305 








Gly Arg 


Asn 


Ser 


Thr 


Glu 


His 


He 








340 


Arg 


Tyr 


He 


His 






355 




Glu 


Gin 


1 Y r 


yJ -L U 




370 






His 


His 


Gin 


His 


385 








Ser 


Asn 


Asn 


Ser 


Trp 


Cys 


Arg 


Asp 








420 


Asn 


Arg 


He 


Asn 






435 




Ala 


Arg 


Arg 


Gly 




450 







Arg 


Asn 


Gin 


Gin 


He 


Val 


Glu 


Met 








200 


Glu 


Asn 


He 


Leu 






215 




Asn 


Thr 


Asn 


Gin 




230 






Val 


Ara 


Glu 


Leu 


245 








Met 


Asp 


Ser 


Ser 


Ser 


Asp 


Ser 


Ser 








280 


Glv 


Glv 


Asp 


Asn 






295 




Arg 


Gin 


Leu 


Ser 




310 






Glu 


Lys 


Met 


Glu 


325 








Lys 


Arg 


Glu 


Asp 


Pro 


Arg 


Arg 


Met 








360 


His 


Tyr 


Asp 


Ala 






375 




Arg 


His 


Gin 


His 




390 






Ser 


Asp Val 


Thr 


405 








Arg 


Asn 


Ser 


Arg 


Ser 


Arg 


Tyr 


Thr 








440 


Arg 


Gly Gly 


Val 






455 





Glu 


Glu 


Asn 


Lvs 


185 








Glu 


Gly 


Phe 


He 


Ala 


Glu 


Gin 


Ser 








220 


ASD 


Val 


Glu 


Glu 






235 




Glu 


Ala 


Glu 


Phe 




250 






Asp 


Ser 


Ser 


Asp 


265 








Asp 


Ser 


Ser 


Asp 


Lys 


Val 


Arg 


Arg 








300 


Tvr 


Ser 


Asp Asp 






315 




Met 


Asp 


Arg 


Val 




330 






Glu 


Pro 


Arg 


Tyr 


345 








Gin 


val 


Pro 


Lys 


Pro 


Ala 


Ala 


Ala 








380 


Gin 


Arg 


His 


Phe 






395 




Ala 


Tyr 


Val 


Asn 




410 






Phe 


Ser 


Pro 


Asn 


425 








Thr 


Phe 


Asp 


Pro 



Tyr 



Lys Glu Glu Leu 
190 

Asp lie Pro Phe 
205 

Asn Glu Thr Gly 

Lys Asp Thr Lys 
240 

Ser Ser Gly He 
255 

Ser Ser Ser Ser 
270 

Ser Glu Ser Ser 
285 

He Arg Arg His 

Val Asn Gly Gly 
320 

Thr His He Lys 
335 

Glu Glu Arg Glu 
350 

Asp Tyr Tyr Cys 
365 

His His His Arg 

Asn Gin Pro Arg 
400 

Glu Asn Ser Pro 
415 

Asn Gly Gly Tyr 
430 

Tyr Arg Tyr Gly 
445 



<210> 186 
<211> 1014 
<212> DNA 
<213> SHRIMP 



<400> 186 

atgtcctctt 

tgcacatcct 

gaacttggaa 

ttaaacaaaa 

ttctttcttc 

aaactggcca 

tccagaatga 

cgcaatatcc 

agagaacaga 

gaaacccctt 

ttaaagtatg 

atgcctactg 

aatcctagta 

gataaattat 

caacattgta 

agctctgatt 

tatttggtac 



ctcaaggttt 
cttccttgtc 
tacctcttgt 
gggacacaat 
gttttggaaa 
atgctatcgc 
tgacgactgc 
aaaaatttga 
taggtttaag 
tctttgcacc 
attctgagtc 
aggatgtaaa 
tgaggttagt 
ccaaggaaca 
tcattggagg 
gtggatttaa 
ttttattttt 



gaataataat 
tttagaggag 
tgaaggaaag 
acgtgtaatt 
gaagaagact 
cctatccatg 
aaagaagatc 
agatgtacat 
tgtattaggt 
ctttaataaa 
tgtttctatt 
gacatcaaga 
gtttcgtccc 
agactcggct 
gagaaatgta 
aaagatgtta 
tcatataatc 



atgtgcacca 
agtgtggatt 
gaagtactac 
ggtgacgagc 
tttaatccac 
ggtgttgtcc 
cctgttccag 
tatatagata 
aaaaatgatg 
tctaaaatag 
ataagagatg 
agttgtataa 
atgtactgga 
ttgattgaaa 
ttattgtatt 
actaatgatg 
tgtcctatta 



cagaaatctt 
atttagaaaa 
tggaatttgc 
aaggagacgt 
aaacaaaaat 
cagaacctgc 
aatcatacaa 
tcaattttca 
tccaaaagaa 
gaggtgaatg 
tgtttaattt 
acccttccca 
gaaattctaa 
agtatatggg 
gtataactgc 
aaataaaaca 
tacaatccaa 



gctgcccaaa 
ggattttgaa 
ctacaaaata 
atgtagcgtc 
gtggctagtg 
ctgcacgtgt 
aaatgttaat 
gtcctttgta 
gaagaaggaa 
catagaagat 
attgggtgaa 
caatgatacg 
gctggtcatg 
aggagaacat 
actatgtttt 
attgatatgg 
atga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1014 
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<210> 187 




























<211> 335 




























<212> PRT 




























<213> SHRIMP 


























<400> 187 




























Met 


Ser 


Ser 


Ser 


Gin 


Gin 


Asn 


Asn 


Met 


Cys 


Thr 


Thr 


Glu 


lie 


lieu 


Leu 


1 








5 










1 U 










lo 




Pro 


Lys 


Cys 


Thr 


Ser 


Ser 


Ser 


Leu 


Ser 


Leu 


GlU 


Glu 


Ser 


val 


Asp 


Tyr 








20 










25 










iu 






Leu 


Glu 


Lys 


Asp 


Phe 


Glu 


Glu 


Leu 


Gly 


i le 


Pro 


T a** 

Leu 


val 


GlU 


Gly 


Lys 






35 










40 










45 








Glu 


Val 


Leu 


Leu 


Glu 


Phe 


Ala 


Tyr 


Lys 


lie 


Leu 


Asn 


Lys 


Arg 


Asp 


Thr 




50 










55 










60 










lie 


Arg 


Val 


He 


Gly 


Asp 


Glu 


Gin 


Gly 


Asp 


val 


Cys 


Ser 


val 


rne 


Pne 


65 










70 










75 










80 


Leu 


Arg 


Phe 


Gly 


Lys 


Lys 


Lys 


Thr 


Pne 


Asn 


Pro 


Gin 


Thr 


Lys 


Met 


Trp 










85 










90 










95 




Leu 


Val 


Lys 


Leu 


Ala 


Asn 


Ala 


1 le 


Ala 


Leu 


Ser 


Met 


Gly 


val 


val 


Pro 








100 










105 










110 






Glu 


Pro 


Ala 


Cys 


Thr 


Cys 


Ser 


Arg 


Met 


Met 


Thr 


Thr 


Ala 


Lys 


Lys 


Tl A 

lie 






115 










120 










IOC 








Pro 


Val 


Pro 


Glu 


Ser 


Tyr 


Lys 


Asn 


val 


Asn 


Arg 


Asn 


Tim 

lie 


Gin 


Lys 


pne 




130 










135 










140 










Glu 


Asp 


Val 


His 


Tyr 


He 


Asp 


He 


Asn 


Phe 


Gin 


Ser 


Phe 


IT- «| 

Val 


Arg 


GlU 


145 










150 










155 










160 


Gin 


He 


Gly 


Leu 


Ser 


Val 


Leu 


Gly 


Lys 


Asn 


Asp 


Val 


Gin 


Lys 


Lys 


Lys 










165 










170 










175 




Lys 


Glu 


Glu 


Thr 


Pro 


Phe 


Phe 


Ala 


Pro 


Phe 


Asn 


Lys 


Ser 


Lys 


He 


Gly 








180 










185 










190 






Gly 


Glu 


Cys 


lie 


Glu 


Asp 


Leu 


Lys 


Tyr 


Asp 


Ser 


Glu 


Ser 


Val 


Ser 


lie 






195 










200 










205 








He 


Arg 


Asp 


Val 


Phe 


Asn 


Leu 


Leu 


Gly 


Glu 


Met 


Pro 


Thr 


Glu 


Asp 


Val 




210 










215 










220 










Lys 


Thr 


Ser 


Arg 


Ser 


Cys 


He 


Asn 


Pro 


Ser 


His 


Asn 


Asp 


Thr 


Asn 


Pro 


225 










230 










235 










240 


Ser 


Met 


Arg 


Leu 


Val 


Phe 


Arg 


Pro 


Met 


Xyr 


Trp 


Arg 


Asn 


Ser 


Lys 


Leu 










245 










250 










255 




Val 


Met 


Asp 


Lys 


Leu 


Ser 


Lys 


Glu 


Gin 


Asp 


Ser 


Ala 


Leu 


He 


Glu 


Lys 








260 










265 










270 






Tyr 


Met 


Gly 


Gly 


Glu 


His 


Gin 


His 


Cys 


He 


He 


Gly 


Gly 


Arg 


Asn 


Val 






275 










280 










285 








Leu 


Leu 


Tyr 


Cys 


He 


Thr 


Ala 


Leu 


Cys 


Phe 


Ser 


Ser 


Asp 


Cys 


Gly 


Phe 




290 










295 










300 










Lys 


Lys 


Met 


Leu 


Thr 


Asn 


Asp 


Glu 


He 


Lys 


Gin 


Leu 


He 


Trp 


Tyr 


Leu 


305 










310 










315 










320 


Val 


Leu 


Leu 


Phe 


Phe 


His 


He 


He 


Cys 


Pro 


He 


He 


Gin 


Ser 


Lys 












325 










330 










335 





<210> 188 
<211> 3627 
<212> DNA 
<213> SHRIMP 

<400> 188 

atgggtgtcc cagaagccaa aaaagtgtac 
gtcatcaagg agaaaactgg atatgaagat 
tgttcaggag aagaagattg cacaactagc 
atcaactcca agaacttcct ggattttgga 



gaaaatgcgt acggtgcaca gaatgggaga 60 

tgttacgatg acgaagacga cgaggattat 120 

tcacttctca aagccacttc tcttgccaac 180 

agaggcaaga aatcttcctc ttcttcacct 240 
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acctgtgact 
atgttgggca 
atcctgttta 
gttatcaggg 
aatttttact 
cacgagaaaa 
gaagttgatt 
aatcacaaaa 
atgaagaaga 
ttcaattctt 
cagatggagg 
ctcaagtcta 
gccatcaaga 
gcaaaatcaa 
tgcgagcgga 
atgaggatcc 
ttcaccgaag 
cctgggaaac 
atgtctggta 
cacaccaacg 
ctggtgaaat 
caatttaagg 
aacagtaatt 
agaacaaagc 
agtacactat 
tgtgagtacg 
tgtgtgggcg 
tgtgatgata 
tgtgataaga 
atagattttg 
gaaactgacg 
aatgtcatgt 
accttatcag 
gggtcggatg 
agtggtaatg 
agtgcacctg 
gagagccaca 
gatgtttttg 
ggaggaggaa 
actgaacaaa 
gatgatgtag 
gatctagaaa 
gactttttag 
aatatagaag 
cccgtgatgg 
cttgacaata 
gacgccaata 
ggcatcgacg 
gttatcgtgt 
atgcccataa 
agaactaatt 
aaaaagactc 
ttggacgatt 
actctcaata 
caattcatgc 
tcttcattct 
cccaattctc 

<210> 189 
<211> 1204 
<212> PRT 



acacactcga 
ggcaaattgc 
tcaatactac 
aagaagatac 
gcagggatga 
ttaaggctag 
ggctaactga 
tggacgacta 
ggaagttttt 
ccccgacagc 
gggtcatctt 
aaaaccttaa 
tgatgacgag 
ggaatgaaaa 
tgatgatgcg 
ctgttctaaa 
cagaggatga 
acttctccct 
acagaaaaga 
atgaaaatca 
ggaacctgat 
actctgaaac 
cttcgggtga 
caaatacgat 
tgattatgct 
acacatgggt 
gatgggagat 
gtgttggtaa 
aactagtact 
tacgagaccc 
ttagtttgaa 
catatttgac 
atatcagtat 
tagaatctga 
accctcaaaa 
ttactaaaaa 
ctgaaggtgg 
gtaataattt 
tagtatcgtc 
ataacagcac 
aggaggagga 
ctgacgactg 
aagctgtaac 
aggatgaaca 
atgaagaaaa 
taacgtacaa 
agagagaaga 
atgaaagatt 
atatgaacat 
tacatagtga 
ttatgtctgc 
ccagaatcaa 
ttaaagacaa 
aagagggaaa 
cagtgctcgg 
acctccagca 
ctcgtcctga 



catggttga t 
aaccaccatg 
aaatcaacag 
tatgcattca 
aaagaaggtt 
aattgaaaga 
acctccaatc 
tgagggtctc 
gactaacagt 
agttttgaaa 
gtactacgct 
agggaggtat 
agccaaacga 
ttgcctcaat 
tcttcctcaa 
gactacgggg 
tatcgtacgt 
actagaaaag 
atccaagagg 
aacactttct 
ggttggacat 
gttcttaaag 
aaatcaagca 
atataatttg 
aactaaagga 
aaggaaagaa 
ttttttgagt 
tatttctatg 
tcctttctgg 
gcttcagttg 
acacgcactt 
aatggggtcc 
acatctttcc 
tactgaagat 
atctacccgt 
attatcatcg 
tataattaat 
agatacgaat 
aataccaggt 
gagtgacttt 
ggaaaaaatg 
gttagatgag 
tgcagccacg 
ttcaagatgt 
gtgggcagaa 
tgacaatagt 
cgggaacaat 
ctctggtgtt 
gtgcggtcta 
attttgtggt 
tgcactagaa 
gcaaataaaa 
ggtatcgaca 
aataccgagt 
cgtggccata 
tcagcagcgg 
acagtaa 



ttgcccactt 
ttgaagggac 
ataatagacg 
cggatgcaga 
atttctgaat 
gtgcctatta 
gaagacatga 
gattactgga 
ttcttgttca 
tccagattca 
tttaggatga 
actgtactct 
cagatcagac 
aggaagacca 
ggtctgatgg 
agcaaatgca 
ttaacatcat 
aggaagatat 
tgggaagatt 
atgaatcttt 
cataaaaacg 
attggccatg 
aacgagttgg 
gcagaatcgc 
tcagactaca 
gttgcagtat 
gaacaagaag 
gggaatttgt 
acaatcaaat 
tgttttcctc 
catagagccc 
tttaa tcaac 
ggtgctaata 
cttatcccat 
aagaaaagta 
tcagttttcg 
gataggggaa 
ccagaagcat 
ctatctactg 
ttagatttct 
gaggaaggtg 
gcaaggaaag 
aacgaaa tga 
tctgtatcat 
attgttaacg 
ttattatcac 
agtaacgtgg 
ccttatagtg 
gaggataata 
aaaacagaag 
tacacaatgt 
cgcaagaact 
tgcacagaca 
gaaaacacta 
tctaagccat 
caagacgtat 



ataatgtatc 
agaaaaatat 
ttttacatga 
acaagaaaca 
tcttttctag 
taatcccttc 
tgatggctcc 
tcaacaaaca 
ggaacgtgcc 
aagatgcatt 
tccgagtgat 
ttacggatgg 
aagaaagaag 
acgatctact 
catctgcatt 
tgtacctttc 
gccttttgaa 
tggagtacga 
tattgaatgt 
tctcccacga 
tgtgtcgtct 
tcaagttctt 
gtggttttgc 
cgctcatgct 
atagtgcaat 
ttgaaaacac 
ctaggaagaa 
ctaaaagtaa 
ttttctatct 
caacacactt 
tggctgtaaa 
gtgtttttgg 
ataatgagag 
tttctaataa 
aggttaatgc 
aatcaattcg 
tattgactaa 
tgggagaaga 
agcagacgtc 
ttaaaaaatt 
aaaaagagga 
cgttcgagta 
cgtcatcgct 
caaaacttaa 
aattcgacaa 
gtctaagtgg 
tgctctatga 
taaaaaccat 
ctattgtgta 
aggataaaat 
tgcagtatat 
gggagaggat 
actataataa 
aatggctacc 
ggtcgcccct 
cactaactaa 



agatctggtc 
gggtcagatg 
cggttttaat 
tatttatgaa 
aaaatataaa 
atcccaagag 
tccagtttct 
cacagatgtt 
tactacttca 
ttttgctagc 
gaaaaatctc 
aaaggctcca 
taaggaaaag 
attctactcg 
actggacata 
taatgcatca 
tctggagact 
ttcttataat 
tttgaagcaa 
ttctgatgtg 
tactggtaca 
taggtgcatg 
agctaaaaga 
ttcacctgaa 
tgttagtaac 
gtactgtact 
taataaagac 
ttgtagaaaa 
gtctcaagct 
gattgatttg 
tgcagcggca 
tacaataaca 
taaaaataca 
taaacgtaaa 
aaccagaaaa 
tggatttttt 
agaaaggatt 
aaatggagga 
aatcttaaaa 
caatgaaatg 
agaggaggca 
taaagattct 
cgctaaaaat 
caataaacaa 
atgcatttct 
agttttaatg 
accagttcaa 
gaacctttta 
ccagcaatta 
atgcacagat 
gcctgaacta 
tccaaaagta 
acttttggca 
ctctcagggg 
cacgctgtgg 
cataactcct 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3627 
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<213> SHRIMP 



<400> 189 



171 tr l_ 


\j J- 


vol 


Pro 


1 








bin 


Asn 


(jiy 


Arg 








on 


Asp 


Asp 


QjIU 


Asp 










Thr 


Ser 


Ser 


Leu 




50 






Asn 


rne 


Leu 


Asp 


DO 








inr 


Cys 


Asp 


Tyr 


oer 


Asp 


Leu 


vai 








1UU 


fsi w 
oiy 


OJ.fl 


Lys 


Asn 






llo 




Gin 


Gin 


lie 


He 




130 






Glu 


Asp 


Thr 


Met 


IXC 








Asn 


rne 


Tyr 


Cys 


Arg 


Lys 


Tyr 


Lys 








loO 


lie 


lie 


lie 


Pro 






195 




Pro 


lie 


Glu 


Asp 




210 






Asp 


Asp 


Tyr 


GlU 


225 








Met 


Lys 


Lys 


Arg 


Pro 


Thr 


Thr 


Ser 








260 


Phe 


Lys 


Asp 


Ala 






275 




Tyr 


Ala 


Phe 


Arg 




290 






Asn 


Leu 


Lys 


Gly 


305 








A13 


lie 


Lys 


net 


Ser 


Lys 


GlU 


Lys 








340 


Thr 


Asn 


Asp 


Leu 






355 




Pro 


Gin 


Gly 


Leu 




370 






Val 


Leu 


Lys 


Thr 


385 








Phe 


Thr 


Glu 


Ala 


Asn 


Leu 


Glu 


Thr 








420 


lie 


Tyr 


Asp 


Ser 






435 




Trp 


Glu 


Asp 


Leu 




450 







Glu Ala Lys Lys 
5 

Val He Lys Glu 

Asp Glu Asp Tyr 
40 

Leu Lys Ala Thr 
55 

Phe Gly Arg Gly 
70 

Thr Leu Asp Met 
85 

Met Leu Gly Arg 

Met Gly Gin Met 
120 

Asp Val Leu His 
135 

His Ser Arg Met 
150 

Arg Asp Glu Lys 
165 

His Glu Lys He 

Ser Ser Gin Glu 
200 

Met Met Met Ala 
215 

Gly Leu Asp Tyr 
230 

Lys Phe Leu Thr 
245 

Phe Asn Ser Ser 

Phe Phe Ala Ser 
280 

Met He Arg Val 
295 

Arg Tyr Thr Val 
310 

Met Thr Arg Ala 
325 

Ala Lys Ser Arg 

Leu Phe Tyr Ser 
360 

Met Ala Ser Ala 
375 

Thr Gly Ser Lys 
390 

Glu Asp Asp He 
405 

Pro Gly Lys His 

Tyr Asn Met Ser 
440 

Leu Asn Val Leu 
455 



Val 


Tyr 


Glu 


Asn 




10 






Lys 


Thr 




Tyr 










Cys 


OCI 


fl^ \t 
uiy 


OX Li 


O A V 


Leu 


Ala 


Asn 








bU 


Lys 


T lie 

Lys 


Coy* 


oer 






"7 C 

1 D 




Val 


Asp 


Leu 


Pro 




90 






Gin 


lie 


Ala 


Thr 


1 \JO 








i xe 


Leu 


irne 


lie 


Asp 


Gly 


Phe 


Asn 








1 A A 

140 


Gin 


Asn 


Lys 


Lys 










Lys 


Val 


He 


Ser 




170 






Lys 


Ala 


Arg 


T "I „ 

lie 










Glu 


Val 


Asp 


Trp 


Pro 


pro 


val 


ser 








220 


Trp 


He 


Asn 


Lys 






235 




Asn 


Ser 


Phe 


Leu 




250 






Pro 


Thr 


Ala 


Val 


265 








Gin 


Met 


GlU 


Gly 


Met 


Lys 


Asn 


Leu 








JU0 


Leu 


Phe 


Thr 


Asp 










Lys 


Arg 


Gin 


He 




330 






Asn 


Glu 


Asn 


Cys 


345 








Cys 


Glu 


Arg 


Met 


Leu 


Leu 


Asp 


He 








380 


Cys 


Met 


Tyr 


Leu 






395 




Val 


Arg 


Leu 


Thr 




410 






Phe 


Ser 


Leu 


Leu 


425 








Gly 


Asn 


Arg 


Lys 


Lys 


Gin 


His 


Thr 








460 





Tyr 


Gly 


nla 






15 




Glu 


Asp Cys 


Tyr 










(Z 1 M 


Asp 


Cys 


i nr 


*» -J 








i ie 


Asn 


oer 


Lys 


oer 


ber 


Ser 


fro 








on 

ou 


inr 


Tyr 


Asn 


vai 






Q <i 
3D 




Thr 


Met 


Leu 


Lys 




1 1U 






Asn 


Tnr 


inr 


Asn 










vai 


He 


Arg 


V?1U 


nis 


He 


Tyr 


blu 








1 oil 


GlU 


Phe 


Phe 


Ser 










Glu 


Arg 


val 


Pro 
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Leu 


Thr 


Glu 


Pro 


o c 








Asn 


His 


Lys 


Met 


His 


Thr 


Asp 


Val 








24 0 


Phe 


Arg 


Asn 


Val 






255 




Leu 


Lys 


Ser 


Arg 




^ /o 






vai 


He 


Leu 


Tyr 


Zoo 








Leu 


Lys 


Ser 


Lys 


vji y 


Lys 


Ala 


Pro 








320 


Arg 


Gin 


Glu 


Arg 






335 




Leu 


Asn 


Arg 


Lys 




350 






Met 


Met 


Arg 


Leu 


365 








Met 


Arg 


He 


Pro 


Ser 


Asn 


Ala 


Ser 








400 


Ser 


Cys 


Leu 


Leu 






415 




Glu 


Lys 


Arg 


Lys 




430 






Glu 


Ser 


Lys 


Arg 


445 








Asn 


Asp 


Glu 


Asn 
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V? .X 1 1 


Thr 


Leu 


O 0 r 


1*1 e i» 


Asn 


Leu 


Phe 


Ser 


His 


Asp 


Ser 


Asp 


Val 


Leu 


Val 


n uj 










470 










475 










480 


T \/<5 


Trp 


Asn 


L u 
eu 


Met, 


Val 


Gly 


His 


His 


Lys 


Asn 


Val 


Cys 


Arg 


Leu 


Thr 










4 85 










490 










4 95 




uiy 


1 III 


OIU 


Jri 1C 


Lys 


ASp 


Oft v- 


O JL U 


TK t 

im 


Phe 


Leu 


Lys 


He 




His 


VOX 








500 
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510 






Lys 


trne 


it lie 


Arg 


C\i<s. 
t»ya 


ne t. 


Asn 


CTft t- 


Asn 


Ser 


Ser 


Gly Glu 


Asn 


c: 1 n 


nld 
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Leu 


oiy 


O J. y 


Pho 
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Lys 


Arg Arg 


TK v 
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Pro 


Asn 


TK r 
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Tyr 


Asn 


Leu 




Ol u 


Car 
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Leu 


Met 
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Pro 


OlU 
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TK v* 
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ji j 










^ sn 

J *J V 
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JwU 


Leu 


L6U 


i ie 


we t. 


Leu 


TK t* 

1 nx 


Lys 


oiy 


Oft 

oer 


ASp 


Tyr 


Asn 


Oft r- 


nidi 


Tift 

1 ie 


vai 










JD J 










0 1 V 










~j / 






Asn 


Cys 


fit 11 

bill 


Tyr 


ASp 


TK r- 

inr 


Trp 


vai 


Arg 


iiys 


olU 


vai 


nla 


vai 




































UXU 


Asn 


1X1 XT 


Tyr 


Cys 


Thr 

1 nr 


Cys 


vai 


uiy 


oiy 


Trp 


CalU 


Tift 

1 ie 


Pho 
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Leu 


Oft r- 

oer 






D :?D 
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Ala 


Arg 


Lys 


Asn 
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Lys 


ASp 


Cys 


Asp 


Asp 


Ser 


Vol 


ui y 
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0 1 j 




















Asn 
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Ser 
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Asn 


Leu 


CftT" 
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Lys 


Ser 
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Cys 
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Lys 


Cys 


Asp 
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Leu 
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Pho 
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irp 


inr 
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Lys 


Phe 


Pne 


Tyr 


Leu 


Oft r- 

oer 










01 j 










650 
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oin 
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lie 
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Asp 


Pro 


Leu 


Gin 


Leu 


Cys 


DK ft 

irne 


Pro 


Pro 


















D O -J 
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TK *- 

inr 


nlS 


Leu 


Tift 

1 ie 


Asp 


Leu 


Glu 
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Asp 


Val 


Ser 


Leu 


Lys 


HIS 


Aia 


Leu 
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Arg 


Ala 


vai 


Asn 


Ala 


Ala 


Ala 


Asn 


Val 


Met 


Ser 


Tyr 


Leu 


inr 


Met 
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*"t f\ rt 
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Ser 


Pne 


Asn 


oin 


Arg 


Val 


Pne 


oiy 


Thr 


lie 


Thr 


Thr 


Leu 


oer 
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/ LP J 
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Ti- 
ne 


Ser 


lie 


riiS 


Leu 


Ser 


Gly 


Ala 


Asn 


Asn 


Asn 


GlU 


Ser 


Lys 


Asn 
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oer 


Asp 
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Asp 


Thr 


olU 


Asp 


Leu 


lie 


Pro 


irne 


oer 


Asn 
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/ OU 






Asn 
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Arg 


Lys 


Ser 


Gly Asn 


Asp 


Pro 


Gin 


Lys 


Ser 


inr 


Arg 


i*ys 


Tarn 

Lys 
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"J c c 








Ser 


Lys 


vai 


Asn 


n 1 a 

Aia 


Thr 


Arg 


Lys 


Ser 


Ala 


Pro 


val 


Thr 


Lys 


Lys 


Leu 




770 
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~i O y\ 
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Ser 


Ser 


Oat- 

oer 


vai 


irne 


Glu 


Ser 


Tift 

lie 


Arg 


uiy 


rue 


Pne 


GlU 


C A V 

oer 


tllS 


i nr 


785 










790 










795 










DAA 
OUU 


Glu 


Gly 


01 y 


lie 


Tift 

lie 


Asn 


Asp 


Arg 


oiy 


lie 


Leu 


TK r- 


Lys 


V3lU 


Arg 


Tift 

1 ie 










AOS 

0 *j j 










810 










01 c. 




Asp 


Val 


XT fie 


biy 


Asn 


Asn 


Leu 


Asp 


TK -r 

inr 


Asn 


Pro 


Glu 


Ala 


Leu 


oiy 


olU 




























o^u 






Glu 


Asn 


wly 


uiy 


oiy 


Gly 


Gly 


Tift 

lie 


vai 


Ser 


Ser 


He 


Pro 


oiy 


Leu 


Ser 






ft ^ 










Q A f\ 
O fi U 










845 








Thr* Glu 


bin 


inr 




lie 


Leu 


Lys 


TK v* 

inr 


Glu 


Gin 


Asn 


Asn 


oer 


TK -r* 

inr 


O A V 

oer 




850 










855 










860 










Asp 


Phe 


Leu 


Asp 


Phe 


Phe 


Lys 


Lys 


Phe 


Asn 


Glu 


Met 


Asp 


Asp 


Val 


Glu 


865 










870 










875 










880 


Glu 


Glu 


Glu 


Glu 


Lys 


Met 


Glu 


Glu 


Gly 


Glu 


Lys 


Glu 


Glu 


Glu 


Glu 


Ala 










885 










890 










895 




Asp 


Leu 


Glu 


Thr 


Asp 


Asp 


Trp 


Leu 


Asp 


Glu 


Ala 


Arg 


Lys 


Ala 


Phe 


Glu 








900 










905 










910 






Tyr 


Lys 


Asp 


Ser 


Asp 


Phe 


Leu 


Glu 


Ala 


Val 


Thr 


Ala 


Ala 


Thr 


Asn 


Glu 






915 










920 










925 








Met 


Thr 


Ser 


Ser 


Leu 


Ala 


Lys 


Asn 


Asn 


He 


Glu 


Glu 


Asp 


Glu 


His 


Ser 




930 










935 










940 










Arg 


Cys 


Ser 


Val 


Ser 


Ser 


Lys 


Leu 


Asn 


Asn 


Lys 


Gin 


Pro 


Val 


Met 


Asp 
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945 










950 










955 










960 


Glu 


Glu 


Lys 




Ala 


Glu 


He 


Val 


Asn 


Glu 


Phe 


Asp 


Lys 


Cys 


He 


Ser 
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970 










975 




Leu 


Asp 


Asn 


He 
980 


Thr 


Tyr 


Asn 


Asp 


Asn 
985 


Ser 


Leu 


Leu 


Ser 


Arg 
990 


Leu 


Ser 


Gly 


Val 


Leu 


Met 


Asp 


Ala 


Asn 


Lys 


Arg 


Glu 


Asp 


Gly Asn 


Asn 


Ser 


Asn 
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Val 


Val 


Leu 


Tyr 


Glu 


Pro 


Val 


Gin 


Gly 


He 


Asp 


Asp 


Glu 


Arg 


Phe 


Ser 




1010 








1015 








1020 
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Val 


Pro 


Tyr 


Ser 


Val 


Lys 


Thr 


Met 


Asn 


Leu 


Leu 


Val 


He 


Val 


Tyr 


1025 








1030 








1035 








1041 


Met 


Asn 


Met 


Cys 


Gly 


Leu 


Glu 


Asp 


Asn 


Thr 


He 


Val 


Tyr 


Gin 


Gin 


Leu 










1045 








1050 








1055 


Met 


Pro 


He 


He 


His 


Ser 


Glu 


Phe 


Cys 


Gly 


Lys 


Thr 


Glu 


Glu 


Asp 


Lys 








1060 








1065 








1070 




He 


Cys 


Thr 


Asp Arg 


Thr 


Asn 


Phe 


Met 


Ser 


Ala 


Ala 


Leu 


Glu 


Tyr 


Thr 






1075 








1080 








1085 






Met 


Leu 


Gin 


Tyr 


Met 


Pro 


Glu 


Leu 


Lys 


Lys 


Thr 


Pro 


Arg 


He 


Lys 


Gin 




1090 








1095 








1100 








He 


Lys 


Arg 


Lys 


Asn 


Trp 


Glu 


Arg 


He 


Pro 


Lys 


Val 


Leu 


Asp 


Asp 


Phe 


1105 








1110 








1115 








112! 


Lys 


Asp 


Lys 


Val 


Ser 


Thr 


Cys 


Thr 


Asp 


Asn 


Tyr 


Asn 


Lys 


Leu 


Leu 


Ala 










1125 








1130 








1135 


Thr 


Leu 


Asn 


Lys 


Glu 


Gly 


Lys 


He 


Pro 


Ser 


Glu 


Asn 


Thr 


Lys 


Trp 


Leu 








1140 








1145 








1150 




Pro 


Ser 


Gin 


Gly 


Gin 


Phe 


Met 


Pro 


Val 


Leu 


Gly 


Val 


Ala 


He 


Ser 


Lys 






1155 








1160 








1165 






Pro 


Trp 


Ser 


Pro 


Leu 


Thr 


Leu 


Trp 


Ser 


Ser 


Phe 


Tyr 


Leu 


Gin 


His 


Gin 




1170 








1175 








1180 








Gin 


Arg 


Gin 


Asp 


Val 


Ser 


Leu 


Thr 


Asn 


He 


Thr 


Pro 


Pro 


Asn 


Ser 


Pro 



1185 1190 1195 1200 

Arg Pro Glu Gin 



<210> 190 
<211> 414 
<212> DNA 
<213> SHRIMP 



<400> 190 

atggatggag 

gagacactta 

aattctattg 

ttccctactg 

attaagaaga 

ttgctgcaga 

ggtaaaaaca 



attcttcctc 
aggaacaagg 
agactgtgaa 
aatctataac 
ggaaactcgc 
ttccaagcac 
ccagactacg 



cttacagtta 
tgtgtgggaa 
agaggaagaa 
actctacaaa 
ttctggcaag 
tacaaccttt 
cagagtgtgt 



ttgagtgaat 
ttggccctag 
gactacactg 
caacaacagg 
tctccgagaa 
aaagcagctc 
aaaaattacg 



ccgaatttga 
aagtgttcaa 
ttcttcgatc 
aagaggaaga 
gtctctgtag 
cacgaagttc 
gcgcccatca 



ttacgtggtc 60 
cgaggtgtcc 120 
tagaaactac 180 
aagtacccct 240 
agagctgcgt 300 
ttctaggagg 360 
gtga 414 



<210> 191 
<211> 137 
<212> PRT 
<213> SHRIMP 



<400> 191 

Met Asp Gly Asp Ser Ser Ser Leu Gin Leu Leu Ser Glu Ser Glu Phe 

15 10 15 

Asp Tyr Val Val Glu Thr Leu Lys Glu Gin Gly Val Trp Glu Leu Ala 

20 25 30 

Leu Glu Val Phe Asn Glu Val Ser Asn Ser He Glu Thr Val Lys Glu 
35 40 45 
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GlU 


Glu 
50 


Asp 


Tyr 


i nr 


Val 


Leu 
55 


Arg 


Ser 


lie 


Thr 


Leu 


Tyr 


Lys 


Gin. 


Gin 


65 










70 






lie 


Lys 


Lys 


Arg 


Lys 
85 


.Leu 


Ala 


Ser 


Arg 


Glu 


Leu 


Arg 
100 


Leu 


Leu 


Gin 


He 


Ala 


Pro 


Arg 


Ser 


Ser 


Ser 


Arg Arg 






115 










120 


Val 


Cys 


Lys 


Asn 


Tyr 


Gly Ala 


His 




130 










135 





Ser 


Arg 


Asn 


Tyr 
60 


Phe 


Pro 


Thr 


Glu 


Gin 


Glu 


Glu 
75 


Glu 


Glu 


Ser 


Thr 


Pro 
80 


Gly 


Lys 
90 


Ser 


Pro 


Arg 


Ser 


Leu 
95 


Cys 


Pro 


Ser 


Thr 


Thr 


Thr 


Phe 


Lys 


Ala 


105 










110 






Gly 


Lys 


Asn 


Thr 


Arg 
125 


Leu 


Arg 


Arg 


Gin 

















<210> 192 
<211> 924 
<212> DNA 
<213> SHRIMP 



<400> 192 

atgtggtgtt 

cttttcagaa 

tgtcaagttc 

tggttccaga 

gtcgacccag 

ctatctcaaa 

agctcgaccg 

accagaaatg 

tttctggaga 

ttttactctt 

gggtggaccg 

atgtcagatt 

ttctggacca 

ccccctttta 

cacaaccacc 

tggagtagtc 



ctacacacct 
attttttcag 
cgagggtgga 
aacgtacatt 
aaatgagagg 
aacttttcaa 
aatgtcaagt 
taatagctcc 
ctggtcgacc 
cctccatctc 
ctagctcgac 
acaccagaaa 
ggtgtttctg 
aaaagggccc 
accaccacca 
ttctttgact 



ttcatatagt 
ggcactcgag 
cctctgggtc 
aacttttgtg 
gtataacctt 
aaattttttc 
tccgagggtg 
agaaacgtac 
cagaaatgag 
aaaaactttt 
cgaatgtcaa 
tgtaatagct 
gacgaagata 
acgtgtatat 
tggaagaaca 
ttag 



gagtttttta 
tttagggggt 
gggccaatgt 
tgtttctgga 
ctgttttcac 
agggcactcc 
gaccgctggg 
attaactttt 
agggtatacc 
caaaaatttt 
gttccgaggg 
ccagaaatag 
ttgcttgatg 
aatgactgta 
tctatccttc 



ctcctcttca 
ggaccgctag 
cagattacac 
acaggcgttt 
tagagaattt 
agtttagggg 
tcgggccaat 
gtgtgtttct 
cttctgtttt 
ttagggcgct 
tggacctctg 
aggaagtatc 
gcaaccctct 
ccacacctca 
aacaaaccct 



atctcaaaaa 
ctcgaccgaa 
cagaaattgt 
ctggagactg 
tactcttcct 
gtggaccgct 
gtcagattac 
ggaacaggcg 
cactagagaa 
ccagtttagg 
ggtcgggcca 
ttatggccat 
ccccctccct 
ttcaaaccat 
ctccagaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

924 



<210> 193 
<211> 305 
<212> PRT 
<213> SHRIMP 



<400> 193 



Met 


Trp 


Cys 


Ser 


Thr 


His 


Leu 


Ser 


Tyr 


Ser 


Glu 


Phe 


Phe 


Thr 


Pro 


Leu 


1 








5 










10 










15 




Gin 


Ser 


Gin 


Lys 
20 


Leu 


Phe 


Arg 


Asn 


Phe 
25 


Phe 


Arg 


Ala 


Leu 


Glu 
30 


Phe 


Arg 


Gly 


Trp 


Thr 

35 


Ala 


Ser 


Ser 


Thr 


Glu 
40 


Cys 


Gin 


Val 


Pro 


Arg 
45 


Val 


Asp 


Leu 


Trp 


Val 
50 


Gly 


Pro 


Met 


Ser 


Asp 
55 


Tyr 


Thr 


Arg 


Asn 


Cys 
60 


Trp 


Phe 


Gin 


Lys 


Arg 


Thr 


Leu 


Thr 


Phe 


Val 


Cys 


Phe 


Trp 


Asn 


Arg 


Arg 


Phe 


Trp 


Arg 


Leu 


65 










70 










75 










80 


Val 


Asp 


Pro 


Glu 


Met 
85 


Arg 


Gly 


Tyr 


Asn 


Leu 
90 


Leu 


Phe 


Ser 


Leu 


Glu 
95 


Asn 


Phe 


Thr 


Leu 


Pro 
100 


Leu 


Ser 


Gin 


Lys 


Leu 
105 


Phe 


Lys 


Asn 


Phe 


Phe 
110 


Arg 


Ala 


Leu 


Gin 


Phe 
115 


Arg 


Gly 


Trp 


Thr 


Ala 
120 


Ser 


Ser 


Thr 


Glu 


Cys 
125 


Gin 


Val 


Pro 


Arg 


Val 
130 


Asp 


Arg 


Trp 


Val 


Gly 
135 


Pro 


Met 


Ser 


Asp 


Tyr 
140 


Thr 


Arg 


Asn 


Val 
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lie Ala Pro Glu 
145 

Phe Leu Glu Thr 

Phe Thr Arg Glu 
180 

Phe Phe Arg Ala 
195 

Cys Gin Val Pro 
210 

Thr Arg Asn Val 
225 

Phe Trp Thr Arg 

Leu Pro Leu Pro 
260 

Cys Thr Thr Pro 
275 

Arg Thr Ser lie 
290 

Leu 
305 



Thr Tyr lie Asn 
150 

Gly Arg Pro Arg 
165 

Phe Tyr Ser Ser 

Leu Gin Phe Arg 
200 

Arg Val Asp Leu 
215 

lie Ala Pro Glu 
230 

Cys Phe Trp Thr 
245 

Pro Pro Phe Lys 

His Ser Asn His 
280 

Leu Gin Gin Thr 
295 



Phe 


Cys 


val 


Phe 






ICC 

loo 




Asn 


Glu 


Arg 


Val 




X /U 






Ser 


lie 


Ser 


Lys 


185 








Gly 


Trp 


Thr 


Ala 


Trp 


Val 


Gly 


Pro 








220 


lie 


Glu 


Glu 


Val 






235 




Lys 


lie 


Leu 


Leu 




250 






Lys 


Gly 


Pro 


Arg 


265 








His 


Asn 


His 


His 


Leu 


Ser 


Arg 


Lys 



300 



Leu Glu Gin Ala 
160 

Tyr Pro Ser Val 
175 

Thr Phe Gin Lys 
190 

Ser Ser Thr Glu 
205 

Met Ser Asp Tyr 

Ser Tyr Gly His 
240 

Asp Gly Asn Pro 
255 

Val Tyr Asn Asp 
270 

His His His Gly 
285 

Trp Ser Ser Leu 



<210> 194 
<211> 447 
<212> DNA 
<213> SHRIMP 



<400> 194 

atgacccacc 

accccatatc 

ccccacctta 

caga tctgga 

ctagcggtgt 

agaagagcaa 

aagctacaag 

ctaagttact 



ttgtccttct 
tctcacccca 
ttcactcgct 
ccccagtctt 
atataaggcg 
cacaacaagc 
aatggcctct 
tagttctgag 



tatcctatcc 
tctcacctat 
ccagtttcaa 
cccttttatc 
gcgcggccag 
actctctctc 
ccagcccccg 
gaactaa 



ttatccttgt 
acccccatct 
caccctgttc 
cctaaccggc 
gccagaagca 
cttctaccta 
ccgcaccaag 



<210> 195 
<211> 146 
<212> PRT 
<213> SHRIMP 



cccctgtcta 
cacccatcac 
ttgccgagcc 
accatttatg 
tcagttctct 
gaagagacct 
tccttacacc 



ccaccacctc 
ctctatattc 
aacccataac 
cccccaggcg 
gcaagccagc 
gccaatactc 
atgttggact 



60 

120 

180 

240 

300 

360 

420 

447 



<400> 195 



Met 


Thr 


His 


Leu 


Val 


Leu 


Leu 


He 


Leu 


Ser 


Leu 


Ser 


Leu 


Ser 


Pro 


Val 


1 








5 










10 










15 




Tyr 


His 


His 


Leu 
20 


Thr 


Pro 


Tyr 


Leu 


Ser 
25 


Pro 


His 


Leu 


Thr 


Tyr 
30 


Thr 


Pro 


He 


Ser 


Pro 
35 


He 


Thr 


Ser 


He 


Phe 
40 


Pro 


His 


Leu 


He 


His 
45 


Ser 


Leu 


Gin 


Phe 


Gin 
50 


His 


Pro 


Val 


Leu 


Ala 
55 


Glu 


Pro 


Thr 


His 


Asn 
60 


Gin 


He 


Trp 


Thr 


Pro 


Val 


Phe 


Pro 


Phe 


He 


Pro 


Asn 


Arg 


His 


His 


Leu 


Cys 


Pro 


Gin 


Ala 


65 










70 










75 










80 


Val 


Tyr 


He 


Arg 


Arg 
85 


Arg 


Gly 


Gin 


Ala 


Arg 
90 


Ser 


He 


Ser 


Ser 


Leu 
95 


Gin 


Ala 


Ser 


Arg 


Arg 
100 


Ala 


Thr 


Gin 


Gin 


Ala 
105 


Leu 


Ser 


Leu 


Leu 


Leu 
110 


Pro 


Arg 


Arg 


Asp 


Leu 
115 


Pro 


He 


Leu 


Lys 


Leu 
120 


Gin 


Glu 


Trp 


Pro 


Leu 
125 


Gin 


Pro 


Pro 
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Pro His Gin Val Leu Thr Pro Cys Trp Thr Leu Ser Tyr Leu Val Leu 

130 135 140 

Arg Asn 
145 



<210> 196 
<211> 339 
<212> DNA 
<213> SHRIMP 



<400> 196 

atgttggact 

actagctctc 

aagatcttcc 

attttctatg 

cctttactct 

cccgcctcac 



ctaagttact 
gccggtctga 
aattcataca 
ttatgctgaa 
ttgacagaaa 
ctgaacgctg 



tagttctgag 
tatgaagaaa 
aggtaagcac 
tattttgttg 
tctccaacca 
cagccgatct 



gaactaaagg 
cacttgctcc 
aagttctcac 
gttgaagtga 
gtacggagac 
cttggataa 



aactaacttc 
atctattcga 
tatacacttt 
aaaatattct 
tatggatgtt 



atacgtctcg 60 
ggagcacgag 120 
ggactttgaa 180 
aagtccaatt 240 
tcacaatggc 300 
339 



<210> 197 
<211> 110 
<212> PRT 
<213> SHRIMP 



<400> 197 

Met Leu Asp Ser Lys Leu Leu Ser 

1 5 - 

Ser Tyr Val Ser Thr Ser Ser Arg 
20 

Leu His Leu Phe Glu Glu His Glu 

35 40 
Lys His Lys Phe Ser Leu Tyr Thr 

50 55 
Met Leu Asn lie Leu Leu Val Glu 
65 70 
Pro Leu Leu Phe Asp Arg Asn Leu 
35 

Phe His Asn Gly Pro Asp Glu Arg 
100 



Ser 


Glu 


Glu 


Leu 


Lys 


Glu 


Leu 


Thr 




10 










15 




Arg 


Ser 


Asp 


Met 


Lys 


Lys 


His 


Leu 


25 










30 






Lys 


He 


Phe 


Gin 


Phe 


He 


Gin 


Gly 










45 








Leu 


Asp 


Phe 


Glu 


He 


Phe 


Tyr 


Val 








60 










Val 


Lys 


Asn 


He 


Leu 


Ser 


Pro 


He 






75 










80 


Gin 


Pro 


Val 


Arg 


Arg 


Leu 


Trp 


Met 




90 










95 




Cys 


Ser 


Arg 


Ser 


Leu 


Gly 






105 










110 







<210> 198 
<211> 3438 
<212> DNA 
<213> SHRIMP 



<400> 198 

atggccccgc 

ctatctcccg 

aaggcactgg 

gttaaagcca 

ccccaacgcc 

atgggagtgt 

ccctctggac 

gtgtggagtt 

aggacagtct 

gtgtctaatg 

gtggctccaa 

tcgacctcta 

ggcggcggaa 

ggggacaatc 



ctcacctgaa 
aaggcgccca 
tgagcttttg 
tggagttcca 
ccgtgactga 
ctgcaaactt 
aagaagtcga 
tgcttgaaga 
tagtttgcac 
tgaacgtact 
gtaataataa 
ataatagtaa 
gatcaaagaa 
atggcatgtt 



cgctgcagcc 
aacttcgtcc 
ccagaagacc 
aggagacgat 
gaggcagatg 
ctctccagtc 
ctcatcaact 
gtgtggtagt 
cctgtttata 
taaccagttg 
tagtgtccca 
caacatcagt 
agttacagcc 
tgccggtgcc 



gatctcttgg 
ccggctgctt 
cgcttcacca 
tttaactact 
ttcgccctta 
tctgatgacg 
tccgctcaaa 
ggctctaata 
atccaagtgt 
tttggacatg 
tcaactgttg 
aacaagcgtg 
acagccaaaa 
cctgttgatg 



ataaggtgat 
gtgttggtgc 
ccaacattgt 
ctgccttgtg 
tgaagagtga 
tcatcaaccc 
tttctggtat 
gtaatagttc 
tcaagttttt 
ttgtttttgg 
ttaacaacaa 
ttggtggtag 
atccctttaa 
ttaatttgga 



gtccggacct 
gaaggt tgtg 
gatgagagaa 
tgcaagtatg 
ggacgaagaa 
t tcaagcctc 
gtttcaaaac 
ccctgtctct 
ggtgactaaa 
atcacttgat 
caacaaaccc 
taataacagt 
taatgtagat 
tgactttgtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 
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tttccccaag 
gatgaagatt 
tttaatgtgt 
ggatttttga 
cgtaactata 
tcaattaagg 
aaccctgatc 
aacttgatgg 
tatatgtttg 
gtatacgcaa 
cctgaatacg 
gagttatacg 
ggacagagtg 
attcacggtg 
tcaaaaattg 
agagtacaag 
agaacagagg 
gtaccttcta 
gaggaagtag 
gaagaaatgg 
ggtaatgaga 
aataaggatg 
agcagcagta 
gctgaaaaga 
gaaaggccat 
caacaactat 
attcctgcgc 
aagacaggat 
acttccccta 
aatgactttt 
tgcgacgtga 
tgcacaggtt 
tgctttacct 
tgcagttcta 
acaaatgagg 
aagtcccacc 
gaatcagagg 
catgttaaac 
aatgttaata 
agccataata 
aaccaacaat 
agttttagaa 
acaaagccag 
gtacagaggt. 



ttgaaactct 
tgagtaaaat 
tcatctctga 
ctgcatggga 
tcctcactcg 
ggactgttta 
tggacaagag 
cattcatggg 
caaagcaatt 
aggttgccgc 
aaaattgtca 
ccaaacaact 
cattcccatt 
acacaatcat 
aggacgaaaa 
ccgagaagac 
aagaattctt 
atgtggagga 
aaagggaaga 
aggaggagaa 
accaagaaga 
cagatagtga 
gtagtagtag 
agaaagaaga 
cagaagctgc 
caccatcatt 
ctgagcctat 
ttagttactc 
ctgaaccagc 
tggacatgtt 
ccgcaaaatt 
tcgatgatct 
ccccgattca 
ttaagaggtc 
ttgaaaagga 
gtggttcctc 
atgaaaggga 
attcccctaa 
gtacaacatg 
aaagccactc 
gctcaatcaa 
gttttagtcc 
ttaggcccta 
ctgcctga 



tacaagtaag 
gtgccgtaaa 
gattaacccc 
taagtttgta 
ctcaaactat 
ttataatgac 
ctggggtaac 
tattgacatt 
ccaacatccg 
aggattgaag 
gtttatcaag 
tctccgcaca 
tgcaaatatt 
tgaattgatg 
ca tea tcaag 
cctgtatttt 
caggcctaca 
agaggaagag 
aggatctgat 
ggaagaagaa 
agaacaacaa 
cagegacagt 
cagcagtagt 
ggaagtacct 
ttcctctccc 
ggacatactc 
cgttgctaca 
ttcattcgta 
agctgtgccc 
ggacggttta 
ctttgagggc 
tettaaagee 
cccttctagc 
tagggcaggt 
agcccctaaa 
tggttctgct 
tatgaaatca 
atatgatget 
catgtegtea 
tegtaaacac 
tactcaaaat 
taagaaagat 
taacaagaag 



agcaccatcc 
actgccctta 
tccaaatatg 
gagggggata 
gcctcagccg 
aagtcaaagt 
aagaatggga 
gtaaaggtgt 
gaagtggaag 
teggggaett 
tatgacacag 
ggacttaatc 
gtgacggtaa 
tacaagacaa 
gtgaatccag 
gagattgatt 
tctgttgttg 
gaagaagagc 
aaggaagatg 
caacaacaac 
caacaacaac 
gatagcagca 
agcagcagca 
gecaagatte 
aagagaatga 
cagactgeag 
acctcaccca 
agaggagatg 
gctgctgcca 
ectggegata 
atcaccctac 
accgagactg 
aactcagccc 
tcactttttg 
egtaagaage 
tcttcttctc 
acatcaaagg 
gtaaatagtg 
tcagatagtg 
tcttcttcct 
gtcaagaaga 
gagcttggtg 
cgtgataatg 



ctaaagaaga 
cccccctaga 
accgttcaat 
ctgctggcgt 
ccagggccgt 
ttctgttcca 
agaaacctag 
gcgctaaggg 
aacttgtgcc 
tgtttgatga 
aagggtgcaa 
aatacaataa 
catccgcctc 
aggatggcgt 
cagaagaaaa 
ccgatgatga 
ctgccccgac 
agatggaaga 
aeggagaege 
agecagaaga 
aaccagaaag 
gcagcagtag 
gtagtagcag 
agaagagaaa 
gagtagaaga 
ttgatgagat 
aggcagegae 
acctttcagt 
cttgcacttc 
tagtaatgea 
cagatggtac 
ataacattat 
ccagaaagga 
acactgatga 
acttgaaaaa 
attgtatgag 
ttcacaagtc 
atgtaaataa 
atgeagaage 
ccacaagtga 
ctgttgtaca 
atttcttgtc 
ttaacaccac 



ggtaaatgta 
aattcatacc 
gttttgcaag 
taaaegctte 
gtatgaagcg 
cgataatgtt 
actcccagct 
gattcaaaag 
tcctatggct 
ctgggacctg 
aaagcacagt 
actggaagag 
tagtgatgat 
aaagggagtc 
gaagaataat 
ggtgtgtgag 
aacacccctc 

agaggaggaa 

accagcacag 
agaaagcaat 
agaagaggag 
tagcagcagt 
tgaaaatgaa 
gaggctaagt 
agaacaacaa 
gatggaagaa 
acttgcactc 
agctggtaat 
cgatgttgga 
acctggcgaa 
tgataatgaa 
aaccaccaca 
tattgataat 
tgatagtgaa 
gaggegtaac 
tagtgatgaa 
accaaaagct 
ctcatacaac 
acagectaaa 
taagaaacag 
gtctccacct 
acgcaagcac 
taataatgta 



900 

960 

1020 

1080 

li40 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3438 



<210> 199 
<211> 1141 
<212> PRT 
<213> SHRIMP 



<400> 199 




























Met 


Ala 


Pro 


Pro 


His 


Leu 


Asn 


Ala 


Ala 


Ala 


Asp 


Leu 


Leu 


Asp 


Lys 


Val 


1 








5 










10 










15 




Met 


Ser 


Gly 


Pro 


Leu 


Ser 


Pro 


Glu 


Gly Ala 


Gin 


Thr 


Ser 


Ser 


Pro 


Ala 








20 










25 










30 






Ala 


Cys 


Val 

35 


Gly 


Ala 


Lys 


Val 


Val 
40 


Lys 


Ala 


Leu 


Val 


Ser 
45 


Phe 


Cys 


Gin 


Lys 


Thr 
50 


Arg 


Phe 


Thr 


Thr 


Asn 
55 


He 


val 


Met 


Arg 


Glu 
60 


Val 


Lys 


Ala 


Met 


Glu 


Phe 


Gin 


Gly 


Asp 


Asp 


Phe 


Asn 


Tyr 


Ser 


Ala 


Leu 


Cys 


Ala 


Ser 


Met 


65 










70 










75 










80 
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Pro 


Gin 


Arg 


Pro 


bXU 


ASp 


Ol U 


Glu 








100 


ASp 


v d J. 


lie 


Asn 






1 1 C 
11 J 




Ser 


X 111 


Cor 


aio 










liGU 




(Sin 

Ol u 




145 








Arg 


Thr 


Val 


Leu 


Leu 


vox 


X £11 


Lys 








180 


Hid 

n is 


Vol 


Vol 


Phe 






1 Z7»J 




V Ol 




oci 


Thr 




210 






Asn 


Cot- 


Asn 


Asn 


225 








Gly 


Gly 


Gly 


Arg 


Asn 


Asn 


vol 


ASp 








260 


Asp 


Vol 


Asn 


Leu 












Lys 


Ser 


1 in 










Sex 


Lys 


net 


Cys 


305 








Phe 


Asn 


Val 


Phe 


Met 


rile 


Cys 


Lys 








O H \J 


Asp 


inr 


Ala 


oJ.y 






"5 Q 




Asn 


Tyr 


/Via 


Ser 










inr 


vai 


Tyr 


Tyr 


385 








Asn 


Pro 


Asp 


Leu 


Arg 


Leu 


Pro 


Ala 








*l VJ 


Vdl 


cys 


TV 1 -s 

Ala 


Lys 






ft jj 




Hie 
tils 


riO 


fZl 11 
ulU 


v al 










vai 




Ala 


oiy 


465 








Pro 


Glu 


Tyr 


Glu 


Lys 


Lys 


His 


Ser 








500 


Tyr 


Asn 


Lys 


Leu 






515 




Val 


Thr 


Val 


Thr 




530 






lie 


Glu 


Leu 


Met 


545 








He 


Glu 


Asp 


Glu 



Val 


Thr 


Glu 


Arg 


85 








Met 


Gly 


Val 


Ser 


Pro 


Ser 


Ser 


Leu 








120 


Gin 


He 


Ser 


Gly 






135 




Gly 


Ser 


Gly 


Ser 




150 






Val 


Cys 


Thr 


Leu 


165 








Val 


Ser 


Asn 


val 


Gly 


Ser 


Leu 


Asp 








200 


Val 


Val 


Asn 


Asn 






215 




He 


Ser 


Asn 


Lys 




230 






Ser 


Lys 


Lys 


Val 


245 








Gly 


Asp 


Asn 


His 


Asp 


Asp 


Phe 


Val 








280 


He 


Pro 


Lys 


Glu 



295 

Arg Lys Thr Ala 
310 

He Ser Glu He 
325 

Gly Phe Leu Thr 

Val Lys Arg Phe 
360 

Ala Ala Arg Ala 
375 

Asn Asp Lys Ser 
390 

Asp Lys Ser Trp 
405 

Asn Leu Met Ala 

Gly He Gin Lys 
440 

Glu Glu Leu Val 
455 

Leu Lys Ser Gly 
470 

Asn Cys Gin Phe 
485 

Glu Leu Tyr Ala 

Glu Glu Gly Gin 
520 

Ser Ala Ser Ser 
535 

Tyr Lys Thr Lys 
550 

Asn lie He Lys 



Gin Met Phe Ala 
90 

Ala Asn Phe Ser 
105 

Pro Ser Gly Gin 

Met Phe Gin Asn 
140 

Asn Ser Asn Ser 
155 

Phe He He Gin 
170 

Asn Val Leu Asn 
185 

Val Ala Pro Ser 

Asn Asn Lys Pro 
220 

Arg Val Gly Gly 
235 

Thr Ala Thr Ala 
250 

Gly Met Phe Ala 
265 

Phe Pro Gin Val 

Glu Val Asn Val 
300 

Leu Thr Pro Leu 
315 

Asn Pro Ser Lys 
330 

Ala Trp Asp Lys 
345 

Arg Asn Tyr He 

Val Tyr Glu Ala 
380 

Lys Phe Leu Phe 
395 

Gly Asn Lys Asn 
410 

Phe Met Gly He 
425 

Tyr Met Phe Ala 

Pro Pro Met Ala 
460 

Thr Leu Phe Asp 
475 

He Lys Tyr Asp 
4 90 

Lys Gin Leu Leu 
505 

Ser Ala Phe Pro 

Asp Asp He His 
540 

Asp Gly Val Lys 
555 

Val Asn Pro Ala 



Leu 


Met 


Lys 


Ser 






95 




Pro 


Val 


Ser 


Asp 




110 






Glu 


Val 


Asp 


Ser 


125 








Val 


Trp 


Ser 


Leu 


Ser 


Pro 


Val 


Ser 








160 


Val 


Phe 


Lys 


Phe 






175 




Gin 


Leu 


Phe 


Gly 




190 






Asn 


Asn 


Asn 




205 








Ser 


Thr 


Ser 


Asn 


Ser 


Asn 


Asn 


Ser 








240 


Lys 


Asn 


Pro 








255 




Gly Ala 


Pro 


Val 




270 






Glu 


Thr 


Leu 




285 








Asp Glu Asp 


Leu 


Glu 


He 


His 


X 111 








320 


Tyr 


Asp 


Arg 








335 




Phe 


Val 


Glu 


v»iy 




350 






Leu 


Thr 


Arg 


061 


365 








Ser 


He 


Lys 


Gly 


His 


Asp Asn 


Vol 








4 00 


Gly Lys 


Lys 








415 




Asp 


He 


Val 


Lys 




430 






Lys 


Gin 


Phe 


CZl rt 
vjxi 1 


445 








Val 


Tyr 


Ala 


Lys 


Asp Trp Asp 


Leu 








480 


Thr 


Glu 


Gly 


Cys 






495 




Arg 


Thr 


Gin 


Gin 




510 






Phe 


Ala 


Asn 


He 


525 








Gly Asp 


Thr 


He 


Gly Val 


Ser 


Lys 








560 


Glu 


Glu 


Lys 


Lys 
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565 570 575 

Asn Asn Arg Val Gin Ala Glu Lys Thr Leu Tyr Phe Glu lie Asp Ser 

580 585 590 

Asp Asp Glu Val Cys Glu Arg Thr Glu Glu Glu Phe Phe Arg Pro Thr 

595 600 605 

Ser Val Val Ala Ala Pro Thr Thr Pro Leu Val Pro Ser Asn Val Glu 

610 615 620 

Glu Glu Glu Glu Glu Glu Glu Gin Met Glu Glu Glu Glu Glu Glu Glu 
625 630 635 640 

Val Glu Arg Glu Glu Gly Ser Asp Lys Glu Asp Asp Gly Asp Ala Pro 

645 650 655 

Ala Gin Glu Glu Met Glu Glu Glu Lys Glu Glu Glu Gin Gin Gin Gin 

660 665 670 

Pro Glu Glu Glu Ser Asn Gly Asn Glu Asn Gin Glu Glu Glu Gin Gin 

675 680 685 

Gin Gin Gin Gin Pro Glu Arg Glu Glu Glu Asn Lys Asp Ala Asp Ser 

690 695 700 

Asp Ser Asp Ser Asp Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser 
705 710 715 720 

Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Glu 

725 730 735 

Asn Glu Ala Glu Lys Lys Lys Glu Glu Glu Val Pro Ala Lys lie Gin 

740 745 750 

Lys Arg Lys Arg Leu Ser Pro Ser Glu Ala Ala Ser Ser Pro Lys Arg 

755 760 765 

Met Arg Val Glu Glu Glu Gin Gin Gin Gin Leu Ser Pro Ser Leu Asp 

770 775 780 

He Leu Gin Thr Ala Val Asp Glu Met Met Glu Glu He Pro Ala Pro 
785 790 795 800 

Glu Pro lie Val Ala Thr Thr Ser Pro Lys Ala Ala Thr Leu Ala Leu 

805 810 815 

Lys Thr Gly Phe Ser Tyr Ser Ser Phe Val Arg Gly Asp Asp Leu Ser 

820 825 830 

Val Ala Gly Asn Thr Ser Pro Thr Glu Pro Ala Ala Val Pro Ala Ala 

835 840 845 

Ala Thr Cys Thr Ser Asp Val Gly Asn Asp Phe Leu Asp Met Leu Asp 

850 855 860 

Gly Leu Pro Gly Asp He Val Met Gin Pro Gly Glu Cys Asp Val Thr 
865 870 875 880 

Ala Lys Phe Phe Glu Gly lie Thr Leu Pro Asp Gly Thr Asp Asn Glu 

885 890 895 

Cys Thr Gly Phe Asp Asp Leu Leu Lys Ala Thr Glu Thr Asp Asn lie 

900 905 910 

He Thr Thr Thr Cys Phe Thr Ser Pro He His Pro Ser Ser Asn Ser 

915 920 925 

Ala Pro Arg Lys Asp He Asp Asn Cys Ser Ser lie Lys Arg Ser Arg 

930 935 940 

Ala Gly Ser Leu Phe Asp Thr Asp Asp Asp Ser Glu Thr Asn Glu Val 
945 950 955 960 

Glu Lys Glu Ala Pro Lys Arg Lys Lys His Leu Lys Lys Arg Arg Asn 

965 970 975 

Lys Ser His Arg Gly Ser Ser Gly Ser Ala Ser Ser Ser His Cys Met 

980 985 990 

Ser Ser Asp Glu Glu Ser Glu Asp Glu Arg Asp Met Lys Ser Thr Ser 

995 1000 1005 

Lys Val His Lys Ser Pro Lys Ala His Val Lys His S r Pro Lys Tyr 

1010 1015 1020 

Asp Ala Val Asn Ser Asp Val Asn Asn Ser Tyr Asn Asn Val Asn Ser 
1025 1030 1035 1040 

Thr Thr Cys Met Ser Ser Ser Asp Ser Asp Ala Glu Ala Gin Pro Lys 
1045 1050 1055 
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Ser His Asn Lys Ser His Ser Arg Lys His Ser Ser Ser Ser Thr Ser 

1060 1065 1070 

Asp Lys Lys Gin Asn Gin Gin Cys Ser lie Asn Thr Gin Asn Val Lys 

1075 1080 1085 

Lys Thr Val Val Gin Ser Pro Pro Ser Phe Arg Ser Phe Ser Pro Lys 

1090 1095 1100 

Lys Asp Glu Leu Gly Asp Phe Leu Ser Arg Lys His Thr Lys Pro Val 
1105 1110 1115 1120 

Arg Pro Tyr Asn Lys Lys Arg Asp Asn Val Asn Thr Thr Asn Asn Val 

1125 1130 1135 

Val Gin Arg Ser Ala 
1140 



<210> 200 

<211> 1875 

<212> DNA 

<213> SHRIMP 

<400> 200 

atgtcctcga cgacctctcc atcgtctttg 
gacgaaaaag atgtcaagca agaagtctcg 
actgtatctt ttagtgataa cgatgaggac 
tcaaactttg atatgtttgg tgattcagat 
cctcctccct ctacaacaac accacttcct 
gatgaatgtg acgaagaagg agcagcagca 
tctatccctg tcgggatctc tgaagctgaa 
gaaattaaga aactcaaaaa gatgatgaaa 
gaacctcctg tcgcagcaga ttataaaaca 
cctagtatcg atatgtgcgg tgtcgcccct 
tccatagatg tgtatacttc ttcctatgta 
aagaaagggt ccaagaaatg tcaattcctt 
cacgagaatg tatgcatggc ccctcccggt 
gaacaaagat tcttggctga acatggagat 
agagtattga acaaagcttt ccccaatctg 
atgacattct atactaacct ttgttggata 
gatgatggag gaatacatgg cgatgcgtca 
gtagtgttcc acaagaggga agaacaaaag 
ctcacgcctg acactagtaa tatgggaaca 
ggaacaaata ctgatatgca agaattccaa 
acttcatcag tgggtgtgaa taccaaccca 
actcctttaa cgaatgatgt accagaattg 
ggaggtggaa attctagaat gagtgcaaat 
attcctacct gcttcacagg aggtgcgaat 
acgttcccct tagaatgtga cgaagatgat 
gaggataaag tctttcatcc attttatgag 
ccatcatata acaagggaca gacttgtaat 
tctcttgcaa agcgtagagg aacagtcccc 
ttttgtaata tggatgtatg tactgccatg 
ggacaatatt gtaatagatg tatcgtaagg 
tacattgttt gcagagacgc ttcatgtgat 
gacatggatg actga 

<210> 201 
<211> 620 
<212> PRT 
<213> SHRIMP 



tgggatgatg acgatgatga tgacgaagaa 60 
aaccgtcccc ccattttttc gtacatggaa 120 
gataacaagg gagaagaaga atgttttgga 180 
aacatgccat caacttctac tgcccctttc 240 
actcctcgat ccatcatgga tactgattcg 300 
gcatcagcac cgtctattgc cgcctcttct 360 
ttgaaaaaaa tggaaaagaa aaagaggaag 420 
gatcctctcc ctcacctata tgtaggagga 480 
agggcaaaca tttcccttta taaagttgac 540 
cctcaatttt gcgctgaatt gcccacccca 600 
tttcctcctc ccacacctgc catgcataat 660 
aaggggagaa aggctttgag gaaatggatt 720 
aaaaggggag gtgtattttt ggctcacttg 780 
gaatacaagg tcccaaggat gtttgtttca 840 
attgctcgtg cagacacact gtgcagtgat 900 
gttaatggag ttgtcgtatg ctttgataaa 960 
gagtatgcaa caggagaaaa ttttgatact 1020 
accaatggga gtgccagtaa gaagaggcgt 1080 
agcactgatg tgcaagaatt ccaaacgatg 114 0 
tcaatgggaa caaataccaa ccccatagag 1200 
cttcccaacc ctcccccaag attggtaatt 1260 
gacatgatgt ggctttattc gccttccaga 1320 
acaggaacat ctcccctgtc taacacccca 1380 
gtagtagtgc ctaatggatt tgtccctccc 1440 
ccaagtattc ccaattctta caattacgaa 1500 
tatatggcca aatatctatc ccctcttgtt 1560 
gttgtccagg agtggttcaa gggatccttc 1620 
aaattctgta gtaacatttc ccacgctttc 1680 
tgcaaatggg cgaagactgt aattagacat 1740 
aggtcatgta catccatgct cgcatatcac 1800 
gttcccaagt gcagggaaag ggttcgcaac 1860 

1875 



<400> 201 

Met Ser Ser Thr Thr Ser Pro Ser Ser Leu Trp Asp Asp Asp Asp Asp 
15 10 15 
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Asp 


Asp 


UlU 


fZ*\ ii 

OiU 










Pro 


Pro 


Tl A 
lie 












Asp 


Asn 


Lys 






50 






oiy 


Asp 


Ser 


Asp 


65 








Pro 


Ser 


Thr 


Thr 


Asp 


o6I 


Asp 


ulU 








1UU 




lie 


Aia 


nla 






1 1 c 




Leu 


Lys 


Lys 


Met 




1 JU 






Lys 






Lys 


145 








Pro 


Val 


Ala 


Ala 


vai 


Asp 


Fro 


C A -r- 








1 o u 


Aid 


ulU 


Leu 


Pro 






i?j 




Pfie 


Pro 


Pro 


fro 




o i n 






Cys 


uJ.n 


rne 


Leu 


225 








Asn 


Val 


Cys 


Met 


His 


Leu 


GlU 


Cjin 










Pro 


Arg 


Met 


Pne 






O ~7 C 




lie 


Ala 


Arg 


Ala 




o o n 






Leu 


Cys 


Trp 


lie 


305 








Gly 


Gly 


He 


His 


Asp 


Tnr 


vai 


vai 










Ala 


ber 


T • •» 

i.ys 


Lys 






ODD 




oer 


1 nr 


Asp 


vai 




OTA 






bin 


ValU 


rne 


c?in 


385 








Ser 


Val 


Gly 


Val 


Val 


He 


Thr 


Pro 








420 


Lys 


Pro 


Ser 


Arg 






435 




Thr 


Ser 


Pro 


Leu 




450 






Ala 


Asn 


Val 


Val 


465 








Glu 


Cys 


Asp 


Glu 


Glu 


Asp 


Lys 


Val 



Asp 


Glu 


Lys 


Asp 




Tyr 


Mo r* 










40 


ulU 


KaJ. U 


rt: l n 

vji. u 


Cys 






-J J 




Asn 


pje t. 


fro 






70 






inr 


Pro 


Leu 


fro 


85 








Cys 


Asp 


Glu 


Glu 


Ser 


Car 

oer 


oer 


T 1 c 

i ie 








ltU 


ulU 


Lys 


Lys 


Lys 






X -J -J 




Asp 


Pro 


Leu 


fro 




1 OU 






Asp 


Tyr 


Lys 


TV* v- 

mr 


165 








He 


Asp 


Met 


Cys 


TV* -r- 


Pro 


Ser 


lie 










inr 


Pro 


TV 1 -a 

Ala 


Met 






^1D 




Lys 


Gly 


Arg 


Lys 










Ala 


Pro 


Pro 


t»iy 


245 








Arg 


Phe 


Leu 


Ala 


Val 


Ser 


Arg 


Val 








280 


Asp 


Thr 


Leu 


Cys 






o o c 




Val 


Asn 


Gly 


val 




olu 






Gly 


Asp 


Ala 


Ser 


325 








Phe 


His 


Lys 


Arg 


Arg 


Arg 


Leu 


Tnr 










Cjin 


blU 


pne 


bin 










ber 




c^iy 


inr 










Asn 


Thr 


Asn 


Pro 


405 








Leu 


Thr 


Asn 


Asp 


Gly 


Gly 


Gly 


Asn 








440 


Ser 


Asn 


Thr 


Pro 






455 




Val 


Pro 


Asn 


Gly 




470 






Asp 


Asp 


Pro 


Ser 


485 








Phe 


His 


Pro 


Phe 



Val 


Lys 


Gin 


Glu 


25 








Ser 


Phe 


Ser 


Asp 




uiy 




Asn 










Thr 

X I IX. 




Thr 


AX a 






75 




TVi r- 


Pro 


Arg 












uiy 


Aid 


Ala 


7V 1 -> 
nla 


105 








Pro 


Val 


Gly 


He 


Arg 


Lys 


Pin 


Tip 

1 1c 










U i c 

nis 


Leu 


Tyr 


val 






155 




Arg 1 


Ala 


Asn 


He 




J. / V 








V d J. 


Ala 


Pro 


185 








Asp 


Val 


Tyr 


Thr 


rllS 


Asn 


Lys 


Lys 








220 


Aia 


Leu 


Arg 


Lys 






235 




Lys 


TV — — 

Axg 


Gly Gly 










G1U 


His 


Gly Asp 


265 








Leu 


Asn 


Lys 


Ala 


Ser 


Asp 


Met 


Thr 








300 


vai 


vai 


Cys 


Phe 






315 




GlU 


Tyr 


Ala 


Thr 




*a *s n 






ulU 


blU 


Gin 


Lys 


345 








Pro 


Asp 


Thr 


Ser 


mr 


wex. 


Gly Thr 








380 


Asn 


i nr 


Asn 


Pro 






395 




Leu 


Pro 


Asn 


Pro 




410 






Val 


Pro 


Glu 


Leu 


425 








Ser 


Arg 


Met 


Ser 


He 


Pro 


Thr 


Cys 








460 


Phe 


Val 


Pro 


Pro 






475 




He 


Pro 


Asn 


Ser 




490 






Tyr 


Glu 


Tyr 


Met 



Val Ser Asn Arg 
30 

Asn Asp Glu Asp 
45 

Phe Asp Met Phe 

Pro Phe Pro Pro 
80 

He Met Asp Thr 
95 

Ala Ser Ala Pro 
110 

Ser Glu Ala Glu 
125 

Lys Lys Leu Lys 

Gly Gly Glu Pro 
160 

Ser Leu Tyr Lys 
175 

Pro Gin Phe Cys 
190 

Ser Ser Tyr Val 
205 

Gly Ser Lys Lys 

Trp He His Glu 
240 

Val Phe Leu Ala 
255 

Glu Tyr Lys Val 
270 

Phe Pro Asn Leu 
285 

Phe Tyr Thr Asn 

Asp Lys Asp Asp 
320 

Gly Glu Asn Phe 
335 

Thr Asn Gly Ser 
350 

Asn Met Gly Thr 
365 

Asn Thr Asp Met 

He Glu Thr Ser 
400 

Pro Pro Arg Leu 
415 

Asp Met Met Trp 
430 

Ala Asn Thr Gly 
445 

Phe Thr Gly Gly 

Thr Phe Pro Leu 
480 

Tyr Asn Tyr Glu 
4 95 

Ala Lys Tyr Leu 
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500 










505 










510 






Ser 


Pro 


Leu 
515 


Val 


Pro 


Ser 


Tyr 


Asn 
520 


Lys 


Gly 


Gin 


Thr 


Cys 
525 


Asn 


Val 


Val 


Gin 


Glu 
530 


Trn 


Phe 


Lys 


Glv 


Ser 
535 


Phe 


Ser 


Leu 


Ala 


Lys 
540 


Arcr 


Ara 


Glv 


Thr 


Val 


Pro 


Lys 


Phe 


Cys 


Ser 


Asn 


He 


Ser 


His 


Ala 


Phe 


Phe 


Cys 


Asn 


Met 


545 










550 










555 










560 


Asp 


Val 


Cys 


Thr 


Ala 
565 


Met 


Cys 


Lys 


Trp 


Ala 
570 


Lys 


Thr 


Val 


He 


Arg 
575 


His 


Gly 


Gin 


Tyr 


Cys 
580 


Asn 


Arg 


Cys 


He 


Val 
585 


Arg 


Arg 


Ser 


Cys 


Thr 
590 


Ser 


Met 


Leu 


Ala 


Tyr 
595 


His 


Tyr 


He 


val 


Cys 
600 


Arg 


Asp 


Ala 


Ser 


Cys 
605 


Asp 


Val 


Pro 


Lys 


Cys 
610 


Arg 


Glu 


Arg 


Val 


Arg 
615 


Asn 


Asp 


Met 


Asp 


Asp 
620 











<210> 202 
<211> 2820 
<212> DNA 
<213> SHRIMP 



<400> 202 

atgtctgagc 

aaagaattcg 

tatgactcaa 

tctgataatg 

cctgtcaaga 

taccggaggt 

atgcaagatc 

ttagtcatgt 

acaacacacg 

aaatatgaaa 

gcagactctg 

cagtatttta 

aatacatggt 

gctccgtcca 

tctgagcaga 

ggtgtagaaa 

gtagtaggaa 

gcggctatgc 

aaaaatatga 

ctcaaattaa 

gctaaaagaa 

cccgagacgg 

ccaaaacacc 

aagagcggtt 

aaactggaag 

ttcttggaca 

cctctgatga 

tgcgctagat 

gcgagtgcta 

ttggaggagt 

tttgccgatc 

gtaatacaag 

atttttagcc 

gcaggaaatg 

aacgttttcc 

tcaaagattg 

aagtttgtct 

tttgggcagc 



cttcagttta 
ggcttttagt 
gcaaacttga 
aaaaaagagt 
accctgaggg 
acaagcaaag 
acgttgacca 
gggctgctgt 
tgcctggctt 
atgtggtcat 
gctcatgtta 
ctaatctttc 
ctgatgtagt 
gatctgaaat 
catctgctcg 
tagacccatc 
cagagaaaaa 
gccttcaagt 
ttaaaatgag 
ttggagtaac 
aggaaacggc 
tgacagaaga 
ctgcgccctt 
cttcttcctc 
aaaaggcgag 
ttttagaagg 
taggcgtgga 
ttgcctcctc 
tcacttctag 
ctgttataga 
tggcaactct 
tacttgggtt 
cagaggattt 
tagatgtggg 
ttcaaactga 
attttcttca 
acatacaaga 
gtataataac 



tgcttttatt 
acaaccagga 
ttgtgacgca 
aggagagtgc 
aaaaacattg 
agacccccat 
tgaaactgga 
cccttggata 
tgcattcaag 
ttcaaaggct 
tcgatctgga 
ttacaattta 
ttctgtattg 
ggatgaaatt 
ccagaaagca 
ttctgtggat 
atccgaccgc 
catgggtctg 
tagaaacaac 
tttggccata 
cattaataat 
agatgcactt 
tgtggacatt 
aatatggaac 
tgttttcgtt 
taaactgttt 
ccaaatattg 
agctctggag 
gttagccgta 
atttgtagtt 
tgccgcgtca 
gattttggat 
aaagaagcag 
tgtagctcaa 
agaaaacggg 
aaaatacttc 
agcagctcaa 
agctgctgat 



gacataaaag 
cagaaattag 
ttttcttgca 
aactttgccg 
cgccatttca 
actggtttgc 
aataaaatgt 
cgacctggag 
gaggatgacg 
tattgtgatt 
tggatgaaat 
gctaacccaa 
acagataacc 
atcaccaaga 
gagaatataa 
gctttaatgc 
ctcatgcgag 
gacgacagtc 
ccagaatatg 
aaaagatctg 
ggcgaacagc 
ttatttgcaa 
ttacactcac 
gatatacttt 
aaaaatcttg 
tcggacggat 
agagaagtca 
tctagtttgg 
caactggcag 
gccgcaagcc 
gccttaaccg 
cttgcgctag 
gttctagtgt 
ccagtcaccc 
gaagaaaaga 
cattctactc 
gaatacttgg 
gatagtgaca 



aaatagaaaa 
ctcctttcag 
taccttcaga 
aacacacctc 
cggcatgtgg 
cagtagccag 
gtgaatatct 
accttactga 
aaagggattc 
tctttaaaca 
ttgtccattt 
aaccttaca'a 
ctattgtaga 
agaagttcaa 
tacgttctca 
agtttgttaa 
tggcagacgc 
aatctagacg 
caagacattt 
ttttctctaa 
atagaagaag 
gagaaaatat 
cagatatcaa 
caagaatttc 
tggtgaaagt 
acgaatggga 
ttaaggcggc 
ttactggctt 
ccagaacgtt 
ttcggctagc 
ttattggaat 
ggttaggttg 
ttaggagaga 
ctgaagaaat 
aggaagaagg 
ctttgatggg 
gaggaagaac 
ccaccaccac 



tggttgggaa 
ggatatttct 
tatacttcat 
tgtctcattt 
tccagggtgt 
aggcgtcctg 
gaaccagagc 
aggttacaac 
aaaaagagtt 
gtattatgac 
aatgtttggg 
tttaactgga 
tgcgggtgct 
cgtattccct 
gtacggagat 
tagggaaggt 
tgttatggat 
attattgtta 
ttccagttct 
aggcgcttca 
cagatggtca 
cactgaagac 
ttcatctatc 
ttcgaccaga 
agtgagacag 
cgataatatt 
cagtaatatg 
tatagactcg 
ctccgtattc 
gatacaggca 
tgttatattc 
gtacgatcac 
gtttgcaaag 
cgtcgctatc 
agcacgaaaa 
aaagaaaagt 
aatgaacgct 
cacacaagag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 
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ggaaggaggg atgacgaaac agtgactaag 

caaactctta aggattactc gtctgctgtt 

ggagaggaat gggtaagaaa tactgcccta 

gataacctat tcaagaaaac tgtttctctt 

ggaataggtg tattggtagc gtcccatatt 

gcttttgcgt tcgcaggtct cctagcattt 

atgaatgcta tgggtgtagt gaattcggac 

gatatcaaaa cagaccccag aagagtagga 

aaatacaaca tgattacaga tttcgtctct 



aaaatgagga gtattattct agaaacaggt 2340 

aactataacg cctcccgtct agattacgtg 2400 

aaagaagaga caaggagcaa cactactagt 24 60 

gctagtatgg ccggcgcatt tctggttcta 2520 

acacttttac gtttcaccaa tattggtcta 2580 

attgcactta tgagtatatc atatataaac 2640 

gcaatataca ggtctactgc tctagttgga 2700 

atggtccagc gccacgtagg tgtcggggct 2760 

ccaatgttag acgagatcga gagtgactaa 2820 



<210> 203 
<211> 931 
<212> PRT 
<213> SHRIMP 



<400> 203 
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Gly 


65 








Pro 


17-1 1 

val 


Lys 


Asn 


Gly 


Pro 


Gly 
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Pro 
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115 
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Lys 


Met 
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Val 


Pro 
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lie 


145 








His 


Val 


Pro 


Gly 


Arg 


Val 


Lys 


Tyr 








180 


Phe 


Lys 
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Tyr 






195 




Trp 


Met 


Lys 


Phe 
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Ser 


Tyr 


Asn 


Leu 


225 
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Ser 


Asp 


Val 


Gly 


Ala 


Ala 


Pro 








260 


Lys 


Phe 


Asn 


Val 






275 




Glu 
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lie 


He 




290 






Ser 


Ser 


Val 


Asp 


305 








Gly 


Thr 


Glu 


Lys 


Met 


Asp 


Ala 


Ala 



340 



ser 
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Tyr 


Aia 
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Lys 


Glu 
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Gly 
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Asp 


He 


Ser 
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Pro 


Ser 
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I u 
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Glu 


Gly 
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Tyr 


Arg 


Arg 


Tyr 
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Leu 


Met 


uin 
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Glu 


Tyr 


Leu 


Asn 






135 




Arg 


Pro 


Gly 


Asp 




150 






Phe 


Ala 


Phe 


Lys 


165 








Glu 


Asn 


Val 


Val 


Tyr 


Asp 


Ala 


Asp 
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Val 


His 


Leu 


Met 






215 




Ala 


Asn 


Pro 


Lys 




230 






Val 


Ser 


Val 


Leu 


245 








Ser 


Arg 


Ser 


Glu 


Phe 


Pro 


Ser 


Glu 








280 


Arg 


Ser 


Gin 


Tyr 






295 




Ala 


Leu 


Met 


Gin 




310 






Lys 


Ser 


Asp 


Arg 


325 








Met 


Arg 


Leu 


Gin 



Phe 


lie 


Asp 


lie 
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Leu 
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C A V 

oer 


ser 
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He 
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His 
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Asp 
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Asp 
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Asp 
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Val 
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Leu 
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Glu 


Gly 






155 




Glu 


Asp 


Asp 


Glu 




170 






He 


Ser 


Lys 


Ala 


185 








Ser 


Gly 


Ser 


Cys 


Phe 


Gly 


Gin 


Tyr 








220 


Pro 


Tyr 


Asn 


Leu 






235 




Thr 


Asp 


Asn 


Pro 




250 






Met 


Asp 


Glu 


He 


265 








Gin 


Thr 


Ser 


Ala 


Gly 


Asp 


Gly 


val 








300 


Phe 


Val 


Asn 


Arg 






315 




Leu 


Met 


Arg 


Val 




330 






Val 


Met 


Gly 


Leu 



345 



Lys 


G1U 


Tin 
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1 D 




Pro 
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ASp 
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<400> 204 

atggattcgt 

acaaacaaac 

agggcaagtc 

tgtataccca 

ttcgaatgga 

ctttctacaa 

aaagaagaag 

aatgtggaag 

ataattgttg 

ttctctaatc 

gctgctgaca 

tccgccctcc 

cagcgctcct 

ccgagtaatg 

gcgcaatata 

aacggccttg 

gaagtgaaag 

aaacatctcg 

aaaaaaatca 

tcgtgtccgc 

aatgaattct 

tctcaaaaaa 

atggagggaa 

tcattagcta 

ttttattttt 

tcagagacgg 

cctaatataa 

atcgaggaca 

caatgtatgg 

cgtggtaacc 

aatagtaaag 

ggaaattata 

gggatcgaac 

ttgtatggta 

aatagaatca 

gccgaaagtg 

gctgtttcat 

tggtggatcg 

aagaaatgtt 

aaaacttgtg 



tgactaatac 
caatcaccga 
ttttgaagtt 
agaacaagta 
gagatggaac 
aaaaatcgcc 
aatttgaaga 
atttcataaa 
atgacgaaga 
atctgaggcg 
ttgagaagaa 
agcaacagca 
catctgagaa 
gcttctcaga 
atggactcgt 
gtctgaaatg 
atgtgtggtg 
cattcgcaga 
ccgcctattt 
ttttgaaaag 
caggagttaa 
accccaacat 
ttataaccca 
cggctatggc 
ctgtatattt 
ataaggtaca 
gtgctcacca 
gacctatccg 
catgcaagga 
agcacttgtc 
tagattca tt 
catctaagtt 
cagtgtctac 
aagtcgttga 
gggtcttttt 
caaaaaatgc 
attttgagca 
aacttaatga 
ctaaagtaaa 
ataaatacaa 



agtaacactc 
acaagatgtg 
atattctgtt 
cacgaatgtg 
aaaagtacac 
tcggtcctca 
agaatttgaa 
tggtgatgga 
agaagagaat 
ccagactgcc 
ggacaaaaac 
acaaaaactg 
ggtcacctcc 
acagaccgag 
ggagctagac 
cagtatggaa 
tcagcctaaa 
atcgcctata 
ttgtttagac 
tggatcaaac 
aaatgacgat 
tgaaattgta 
tagaaaagat 
attttgccac 
accattctct 
tattggttca 
gaacaataac 
aaataataat 
aagatgcaca 
acatagtgta 
gaaaaaattg 
ttgtgataga 
ttctttctca 
cgttatcaat 
taactctgag 
attcaaggat 
aaacaaatta 
cttgaacatt 
tgatgcagaa 
aactgccaag 



ctcgtgaatg 
gaaaatactc 
ttcatcaaag 
caagaaatat 
agatcagttt 
ccttcccctc 
gatgatgaag 
gaagattcag 
gaagagggag 
gccgctgccg 
cacgcagtta 
ctccagcaac 
acacccaaca 
ctctttgttt 
atcttaccca 
actcccccag 
actagctttg 
cttcaaaggc 
gattctgtag 
tttcgagtgt 
gacacgtctt 
tcaaaattaa 
ttgttcgaga 
ccaaaagcta 
aaaataactc 
gacgcgatct 
aacaataata 
ataagcagaa 
aacaattgca 
aaaggggaag 
agcagagtac 
agctctatgt 
tcggacagtt 
agttattctg 
gaaaaagata 
atacttgttc 
tcctctaaag 
atgtttgaga 
tctttaaagg 
agggcaatta 



accgtcttgg 
ttaaccttaa 
aaatgcagtc 
tcgaagatgg 
caccaagttc 
ctccatcgat 
aaatatacga 
aagaggaaga 
aaaacaagta 
ccgccgccgc 
gtgcgcatga 
agcaacagca 
aattcaacaa 
gtttcgatgt 
ttgttgctga 
tgaaaccgtg 
aaaatgatgc 
ctagagattt 
acattaaaaa 
ccgaatattc 
caaacacttg 
atattgaatt 
cgggcatttt 
gagttcgaaa 
gcaaagaaac 
tttctccccc 
acaataatac 
aaatgaccat 
ctaacggtaa 
atttctttaa 
tgattcccgc 
gccatagctt 
ttgaaaagac 
ccataaaaac 
ataagactat 
acgaatgtaa 
atgggcatct 
aacacgtgga 
atatttttaa 
ttggagcaca 



aaatcatcga 
cagtctagaa 
ctattctgga 
actaattact 
ccctatacct 
gccttctatt 
aacagatgaa 
agaagaagat 
cgttctcgca 
agctgctgct 
ctacacactg 
acaacaccag 
gtttttactg 
ggataaaatt 
atacattatc 
caggagaaag 
tgtggaagat 
ccctatccct 
cccctggggt 
gcgtcatttt 
ctttatatac 
tgaggtaatg 
gagcgattct 
tgttgcattg 
tataaaatgt 
aagtgataat 
cagtgtgaat 
cacaaactac 
ctatcccgat 
gattttaaat 
tcctccctct 
cttttgtaga 
taaacttgtt 
ttcccataat 
cccctctaga 
taaagaacga 
atctaacaag 
agatttttac 
tgattttgaa 
agacccttct 



60 
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660 

720 

780 

840 
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1080 

1140 

1200 
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1680 

1740 

1800 
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1920 
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2220 
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acttctactc cctctaaaaa ggagaatggt atcactagga ttattagtac attatccgaa 2460 

tttcattcaa aagatgaagc tacagtaagt gcccttctcg acaaaacaat gctcttggga 2520 

tcgaggacaa taatgtctgg tgttagatgt gttatacgta acaatagtgt gttttcgggc 2580 

tttgaaaata agaacactaa taataattgg gaacttgaga ttagacacta tgtcatctct 264 0 

atgggaggtg ctgcagtgac aaagatttcc gatgaagatt tggaacaatt cacgcctgta 2700 

agaggtgctg tctctgtcac tacagcacct aatgataagc tacctgtagg ggcacatcag 2760 

acatggaagg atgaacaaac actaaaaaca aacactaaac gtaatagtct atatgactct 282 0 

tacaattcaa aaaggaataa tagggataat aataaaataa aaaatcgttc attaaaacta 2880 

tcagatttta attggagaac acccaatatc tctattcaag aatttaatgc aaataaagat 294 0 

gatgttaaca agaagaggta cgcagaagtc gtggcgtcag ctgctccaaa gtcaccttca 3000 

ccaacgagca gcagcagcag caacagcaac agcagcagcc ctcctctttc accgctctca 3060 

ccaacagtga agaatagtaa taataaacca ttgtatattc ctccccataa aagaatgacc 3120 

actactgctg tttga 3135 

<210> 205 
<211> 1036 
<212> PRT 
<213> SHRIMP 
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Cys 


Arg 


625 










630 






Phe 


Ser 


Ser 


Asp 


Ser 


Phe 


Glu 


Lys 










64d 








vai 


vai 


Asp 


va i 


Tl « 

lie 


Asn 


□er 


Tyr 








660 










Asn 


Arg 


Tl _ 

lie 


Axg 


vai 


Phe 


On A 

Pne 


Asn 






675 










680 


lie 


Pro 


Ser 


Arg 


Ala 


Glu 


Ser 


Ala 




690 










695 




vai 


nlS 


blU 


Cys 


Asn 


Lys 


Ol u 


Arg 


705 










710 






Lys 


Leu 


Ser 


Ser 


Lys 


Asp 


Gly 


His 










725 








Leu 


Asn 


Asp 


Leu 


Asn 


He 


Met 


Phe 








740 










Lys 


Lys 


Cys 


Ser 


Lys 


Val 


Asn 


Asp 






755 










7 60 


Asn 


Asp 


Phe 


Glu 


Lys 


Thr 


Cys 


Asp 




770 










775 




He 


He 


Gly 


Ala 


Gin 


Asp 


Pro 


Ser 


785 










790 






Asn 


Gly 


He 


Thr 


Arg 


He 


He 


Ser 



805 





JOU 










335 




Ir i- w 


T 1 p 
X -L c 


Leu 


will 


Arc[ 


Pro 


Arg 


Asp 


J *i .J 










350 






nla 


Tyr 


rne 


Cys 


Leu 


Asp 


Asp 


Oat- 










JOJ 










^» jr ^ 


rto 


l^eu 


Leu 


Lys 


Ser 


vaxy 








DO u 










oer 


Arg 


nlS 


pne 


Asn 




XT I lC 


Ser 






-J 17 -J 










400 


Ser 


Ser 


Asn 


x nr 


Cys 




Tic 
1 1c 


Tyr 




4 1 n 










/it 
j 




± xe 


Vdl 


oer 


Lys 


Leu 


Asn 


T I A 

lie 




4 ZD 
















Tl a 

i ie 


i nr 


jjys 


Asp 


Leu 


pne 


OlU 


inr 










4 4 c. 








Aia 


i nr 


Aia 




Aia 


pne 


Cys 


nis 








^ ^n 










rne 


Tyr 


rne 


oer 


vai 


Tyr 


Leu 


Pro 






*1 / D 










*i O \J 


inr 


i ie 


Lys 


Cys 


Ser 


GlU 


mr 


Asp 




a on 










^ 1?D 




l ie 


pne 


oer 


Pro 


pro 


oer 


Asp 


Asn 


DUD 










m n 






Asn 


Asn 


Asn 


Asn 


Asn 


Asn 


Asn 


Asn 










525 








Pro 


i ie 


Arg 


Asn 


Asn 


Asn 


i ie 


oer 








D4 U 










Gin 


cys 


Met 


Aia 


Cys 


Lys 


GlU 


Arg 






c. c 
DDD 










*i^n 


Asn 


Tyr 


Pro 


Asp 


Arg 


Gly 


Asn 


Gin 




5 /0 










ETC 

D /D 




GlU 


Asp 


pne 


pne 


Lys 


He 


Leu 


Asn 


DoD 










590 






Lys 


Leu 


Ser 


Arg 


Val 


Leu 


X 1 A 

lie 


Pro 










605 








Ser 


Lys 


Phe 


Cys 


Asp Arg 


Ser 


oer 


















Gly 


I le 


GlU 


Pro 


Val 


Ser 


mr 


Oa V 

oer 






DDD 










OflU 


Thr 


Lys 


Leu 


vai 


Leu 


Tyr 


Gly 


Lys 




oDU 










ODD 




C? A V- 

oer 


Aia 


i ie 


Lys 


Thr 


Ser 


til s 


Asn 


£ £ c 
DOD 










670 






C? A V 

oer 


G1U 


GlU 


T hp 

i»ys 


Asp Asn 


Lys 


inr 










685 








Lys 


Asn 


Ala 


pne 


Lys 


Asp 


He 


Leu 








/ uu 










Ala 


vai 


C A V- 

oer 


Tyr 


Phe 


Glu 


Gin 


Asn 






/ID 










720 


Leu 


Ser 


Asn 


Lys 


Trp 


Trp 


He 


Glu 




730 










735 




Glu 


Lys 


His 


Val 


Glu 


Asp 


Phe 


Tyr 


745 










750 






Ala 


Glu 


Ser 


Leu 


Lys 


Asp 


lie 


Phe 










765 








Lys 


Tyr 


Lys 


Thr 


Ala 


Lys 


Arg Ala 








780 










Thr 


Ser 


Thr 


Pro 


Ser 


Lys 


Lys 


Glu 






795 










800 


Thr 


Leu 


Ser 


Glu 


Phe 


His 


Ser 


Lys 




810 










815 
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Asp Glu 


Ala 


Thr 


Val 


Ser 


Ala 


Leu 


Leu 


Asp 


Lys 


Thr 


Met 


Leu 


Leu 


Gly 








820 










825 










830 






Ser 


Arg 


Thr 
835 


He 


Met 


Ser 


Glv 


Val 
840 


Arg 


Cys 


Val 


He 


Arg 
845 


Asn 


Asn 


Ser 


Val 


Phe 
850 


Ser 


Glv 


Phe 


Glu 


Asn 
855 


Lys 


Asn 


Thr 


Asn 


Asn 
860 


Asn 


Trp 


CZl n 
\J JL U 


Leu 


Glu 


He 


Arg 


His 


Tvr 


Val 


He 




Met 


Gly 


Gly Ala 


Ala 


Val 


Th r 
J. ill: 


Lys 


865 










870 










O "7 C 










880 


He 


Ser 


Asp 


Glu 


Asp 
885 


Leu 


Glu 


Gin 


Phe 


890 


Fro 


Val 


Arg 


Gly 


Al a 
t\d.d 

895 


VOX 




V Ct J. 


Thr 


Thr 


Ala 


Pro 


Asn 


Asp 


Lys 


Leu 


Pro 


Val 


Gly Ala 


Hie 


uin 








900 




















910 






Thr 


T rn 


Lys 
915 


Asp 


Glu 


Gin 


Thr 


Leu 
920 


Lys 


Th r 

X i IX 


Asn 


Thr 


Lys 
925 


Arg 


Asn 


Ser 


Leu 


Tyr 
930 


Asp 


Ser 


Tvr 


Asn 


Ser 
935 


Lys 


Arg 


Asn 


Asn 


Arg 
940 


Asp 


Asn 


Asn 


Lys 


He 


Lys 


Asn 


Arg 


Ser 


Leu 


Lys 


Leu 


Ser 


Asp 


Phe 


Asn 


Trp 


Arg 


Th r 

i nr 


Pro 


945 










950 










955 










960 


Asn 


He 


Ser 


He 


Gin 
965 


Glu 


Phe 


Asn 


Ala 


Asn 
970 


Lys 


Asp 


Asp 


Val 


Asn 
975 


Lys 


Lys 


Arg 


Tyr 


Ala 
980 


Glu 


Val 


Val 


Ala 


Ser 
985 


Ala 


Ala 


Pro 


Lys 


Ser 
990 


Pro 


Ser 


Pro 


Thr 


Ser 


Ser 


Ser 


Ser 


Ser 


Asn 


Ser 


Asn 


Ser 


Ser 


Ser 


Pro 


Pro 


Leu 






995 










1000 








1005 






Ser 


Pro 


Leu 


Ser 


Pro 


Thr 


Val 


Lys 


Asn 


Ser 


Asn 


Asn 


Lys 


Pro 


Leu 


Tyr 




1010 








1015 








1020 






He 


Pro 


Pro 


His 


Lys 


Arg 


Met 


Thr 


Thr 


Thr 


Ala 


Val 










1025 








1030 








1035 











<210> 206 
<211> 1082 
<212> DNA 
<213> SHRIMP 



<400> 206 

atgaagatat 

taccacgtga 

ctcctccact 

aacttgtgtc 

cacttggacg 

ggegegaage 

ccccacttaa 

cgcttggacg 

aagaacgtga 

gtgaagcagc 

cacttaaagg 

tgtctgttga 

gaegtaagag 

aagcagctcc 

aggaagaact 

aaggtgctct 

gtaagaggcg 

tttacgcacc 

aa 



gecagattte 
agcagctcct 
taaaggtgcg 
tgctgaagaa 
tgaaaagege 
agctccccca 
aggtgeaett 
taagaggege 
aaagcgcgaa 
tcctccactt 
tgcacttgga 
agaagaacgt 
gegegaagea 
cccacttaaa 
tgtgtctgct 
tggacgtaag 
cgaagcagca 
tgttgctttg 



ttcccctacc 
ccacttaaag 
cttggacgta 
gaacgtgaaa 
gaagcagctc 
cttaaaggtg 
ggaegtaaga 
gaagcagaat 
gcagctcccc 
aaaggtgcgc 
egtaagagge 
gaaaagcgcg 
gctcccccac 
ggtgctcttg 
gaagaagaac 
aggegegaag 
gcagcagcta 
cctctacatg 



ctcaccctga 
gtgcacttgg 
agaggegega 
agegegaage 
ccccacttaa 
cgcttggacg 
ggegegaage 
ccttggagga 
cacttaaagg 
ttggacgtaa 
gcgaagcaga 
aagcagctcc 
ttaaaggtgc 
gaegtaagag 
gtgaaaagcg 
cagctccccc 
tgcctccccc 
gagtatggaa 



gcatcccatt 
aegtaaaagg 
agcagaatcc 
agctccccca 
aggtgeaett 
tgaaaagege 
agctccccca 
agaacttgtg 
tgcacttgga 
gaggegegaa 
atccttggag 
cccacttaaa 
tcttggacgt 
gegegaagea 
cgaagcagct 
acttaaaggt 
tgaagacgat 
agcaccagaa 



ggagggcgtc 
cgtgaagcag 
ttggaggaag 
cttaaaggtg 
ggaegtaaga 
gaagcagctc 
cttaaaggtg 
tetgetgaag 
egtaaaagge 
gcagctcccc 
gaagaacttg 
ggtgctcttg 
aagaggcgcg 
gaatccttgg 
cccccactta 
gcacttggac 
ctcgacttct 
cctacaggtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1082 



<210> 207 
<211> 361 
<212> PRT 
<213> SHRIMP 
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<220> 

<221> VARIANT 

<222> (1) . . . (361) 

<223> Xaa = Any Amino Acid 



<400> 207 



Met 


Lys 


He 


Cys 


Gin 


He 


Ser 


Ser 


1 








5 








Leu 


Glu 


Gly 


Val 


Tyr 


His 


Val 


Lys 








20 










Leu 


Asp 


val 


Lys 


Gly Val 


Lys 


Gin 






JO 










40 


Asp 


Val 


Arg 


Gly Ala 


Lys 


Gin 


Asn 




c n 

jU 










DO 




Leu 


Lys 


Lys 


Asn 


Val 


Lys 


Ser 


Ala 


bo 










70 






His 


Leu 


Asp 


Val 


Lys 


Ser 


Ala 


Lys 










85 








Leu 


Asp 


Val 


Arg 


Gly Ala 


Lys 


Gin 








100 










Asp 


Val 


Lys 


Ser 


Ala 


Lys 


Gin 


Leu 






115 










120 


Val 


Arg 


Gly 


Ala 


Lys 


Gin 


Leu 


Pro 




130 










135 




Arg 


Gly 


Ala 


Lys 


Gin 


Asn 


Pro 


Trp 


145 










150 






Lys 


Asn 


Val 


Lys 


Ser 


Ala 


Lys 


Gin 










165 








Asp 


Val 


Lys 


Gly 


Val 


Lys 


Gin 


Leu 








180 










Val 


Arg 


Gly 


Ala 


Lys 


Gin 


Leu 


Pro 






195 










200 


Arg 


Gly 


Ala 


Lys 


Gin 


Asn 


Pro 


Trp 




210 










215 




Lys 


Asn 


Val 


Lys 


Ser 


Ala 


Lys 


Gin 


225 










230 






Asp 


Val 


Arg 


Gly 


Ala 


Lys 


Gin 


Leu 










245 








Val 


Arg 


Gly 


Ala 


Lys 


Gin 


Leu 


Pro 








260 










Arg 


Gly 


Ala 


Lys 


Gin 


Asn 


Pro 


Trp 






275 










280 


Lys 


Asn 


Val 


Lys 


Ser 


Ala 


Lys 


Gin 




290 










295 




Asp 


Val 


Arg 


Gly 


Ala 


Lys 


Gin 


Leu 


305 










310 






Val 


Arg 


Gly 


Ala 


Lys 


Gin 


Gin 


Gin 










325 








lie 


Ser 


Thr 


Ser 


Phe 


Thr 


His 


Leu 








340 










Gly 


Lys 


His 


Gin 


Asn 


Leu 


Gin 


Val 






355 










360 



Pro 


Tnr 


Leu 


Thr 


Leu 


Ser 


T 1 a 

I le 


Pro 




1U 










-L o 




Gin 


Leu 


Leu 


His 


Leu 


Lys 


val 


His 


ZD 










30 






Leu 


Leu 


His 


Leu 


Lys 


Val 


Arg 


Leu 


















Pro 


Trp 


Arg 


Lys 


Asn 


Leu 


Cys 


Leu 








60 










Lys 


Gin 


Leu 


Pro 


His 


Leu 


Lys 


Val 






— 1 c 

/ O 












Gin 


Leu 


Pro 


His 


Leu 


Lys 


Val 


His 














95 




Leu 


Pro 


His 


Leu 


Lys 


Val 


Arg 


Leu 


105 










110 






Pro 


His 


Leu 


Lys 


Val 


His 


Leu 


Asp 










125 








His 


Leu 


Lys 


Val 


Arg 


Leu 


Asp 


Val 








14 0 










Arg 


Lys 


Asn 


Leu 


Cys 


Leu 


Leu 


Lys 






155 










160 


Leu 


Pro 


His 


Leu 


Lys 


Val 


His 


Leu 




170 










175 




Leu 


His 


Leu 


Lys 


Val 


Arg 


Leu 


Asp 


185 










190 






His 


Leu 


Lys 


Val 


His 


Leu 


Asp 


Val 










205 








Arg 


Lys 


Asn 


Leu 


Cys 


Leu 


Leu 


Lys 








220 










Leu 


Pro 


His 


Leu 


Lys 


Val 


Leu 


Leu 






235 










240 


Pro 


His 


Leu 


Lys 


Val 


Leu 


Leu 


Asp 




250 










255 




His 


Leu 


Lys 


Val 


Leu 


Leu 


Asp 


Val 


265 










270 






Arg 


Lys 


Asn 


Leu 


Cys 


Leu 


Leu 


Lys 










285 








Leu 


Pro 


His 


Leu 


Lys 


Val 


Leu 


Leu 








300 










Pro 


His 


Leu 


Lys 


Val 


His 


Leu 


Asp 






315 










320 


Gin 


Leu 


Cys 


Leu 


Pro 


Leu 


Lys 


Thr 




330 










335 




Leu 


Leu 


Cys 


Leu 


Tyr 


Met 


Glu 


Tyr 


345 










350 






Xaa 

















<210> 208 
<211> 816 
<212> DNA 
<213> SHRIMP 



<400> 208 

atggacgtcg agttcggttt cttccacggt ctgctctcca aggcccttct cccagatgaa 60 
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aaacatcaac 
ggctgctgct 
caactaatag 
tctaatcttt 
tcccattcta 
ttgcctcatt 
cttcaaccca 
gacgacaatg 
aaaactattg 
ggtaatgaat 
tacgacactg 
agcacgtgca 
aaacaagatg 



ccgttataag 
cgttctgtgg 
acgtttgttc 
caaccagttg 
gttttctgga 
ctttacgtgc 
tagaagaaga 
ttgatgatgg 
gagaaaaatc 
ttataaggat 
aagctaaatt 
catgtaagga 
gaatggataa 



gcgtctttgt 
aagaagagga 
ctttatagga 
ttctagacta 
tgtggtatat 
catttggaat 
ggatattggt 
tgaccaacaa 
aaccattgaa 
tcctttaaga 
tgttaaccac 
acgtcttatt 
accaacggac 



gcggatgatt 
acaggagaga 
actgtctcat 
caaaaaacgt 
ccaagtttga 
aaacaacttc 
tataaggatt 
gaacaaatga 
ctgataggca 
gaaactgcgt 
aaggactcta 
gatttttctg 
aagtag 



ctagaaataa 
gcaatactgc 
ctattggtac 
cagacagtta 
agaaaacaac 
caaagttgta 
atgttgtttc 
ttattgatga 
tgtataacaa 
tgcacgcaca 
tacctctatt 
agagacaact 



gggagaggat 
gtgccttgaa 
aattatcaat 
tgcggcatta 
tgaagacgta 
tgaaaaaact 
aattgaagac 
agaatcttat 
taacaagttt 
atctctgagg 
ttatgaaaac 
acaacaacta 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
816 



<210> 209 
<211> 269 
<212> PRT 
<213> SHRIMP 



<400> 209 



Met 


Asp 


Val 


Glu 


Phe 


Gly 


Phe 


Phe 


1 








5 








Leu 


Pro 


Asp 


Glu 


Lys 


His 


Gin 


Pro 








20 










Asp 


Ser 


Arg 


Asn 


Lys 


Gly 


Glu 


Asp 






35 










40 


Arg 


Gly 


Thr 


Gly 


Glu 


Ser 


Asn 


Thr 




50 










55 




Val 


Cys 


Ser 


Phe 


He 


Gly 


Thr 


Val 


65 










70 






Ser 


Asn 


Leu 


Ser 


Thr 


Ser 


Cys 


Ser 










85 








Tyr 


Ala 


Ala 


Leu 


Ser 


His 


Ser 


Ser 








100 










Leu 


Lys 


Lys 


Thr 


Thr 


Glu 


Asp 


Val 






115 










120 


Trp 


Asn 


Lys 


Gin 


Leu 


Pro 


Lys 


Leu 




130 










135 




Glu 


Glu 


Glu 


Asp 


He 


Gly 


Tyr 


Lys 


145 










150 






Asp 


Asp 


Asn 


Val 


Asp 


Asp 


Gly 


Asp 










165 








Glu 


Glu 


Ser 


Tyr 


Lys 


Thr 


He 


Gly 








180 










Gly 


Met 


Tyr 


Asn 


Asn 


Asn 


Lys 


Phe 






195 










200 


Leu 


Arg 


Glu 


Thr 


Ala 


Leu 


His 


Ala 




210 










215 




Ala 


Lys 


Phe 


Val 


Asn 


His 


Lys 


Asp 


225 










230 






Ser 


Thr 


Cys 


Thr 


Cys 


Lys 


Glu 


Arg 










245 








Gin 


Leu 


Lys 


Gin 


Asp 


Gly 


Met 


Asp 



2 60 



His 


Gly 


Leu 


Leu 


Ser 


Lys 


Ala 


Leu 




10 










15 




Val 


He 


Arg 


Arg 


Leu 


Cys 


Ala 


Asp 


25 










30 






Gly 


Cys 


Cys 


Ser 


Phe 


Cys 


Gly 


Arg 










45 








Ala 


Cys 


Leu 


Glu 


Gin 


Leu 


He 


Asp 








60 










Ser 


Ser 


He 


Gly 


Thr 


He 


He 


Asn 






75 










80 


Arg 


Leu 


Gin 


Lys 


Thr 


Ser 


Asp 


Ser 




90 










95 




Phe 


Leu 


Asp 


Val 


Val 


Tyr 


Pro 


Ser 


105 










110 






Leu 


Pro 


His 


Ser 


Leu 


Arg 


Ala 


He 










125 








Tyr 


Glu 


Lys 


Thr 


Leu 


Gin 


Pro 


He 








140 










Asp 


Tyr 


Val 


Val 


Ser 


He 


Glu 


Asp 






155 










160 


Gin 


Gin 


Glu 


Gin 


Met 


He 


He 


Asp 




170 










175 




Glu 


Lys 


Ser 


Thr 


He 


Glu 


Leu 


He 


185 










190 






Gly 


Asn 


Glu 


Phe 


He 


Arg 


He 


Pro 










2 05 








Gin 


Ser 


Leu 


Arg 


Tyr 


Asp 


Thr 


Glu 








220 










Ser 


He 


Pro 


Leu 


Phe 


Tyr 


Glu 


Asn 






235 










240 


Leu 


He 


Asp 


Phe 


Ser 


Gin 


Leu 


Gin 




250 










255 




Lys 


Pro 


Thr 


Asp 


Lys 









265 



<210> 210 
<211> 3813 
<212> DNA 
<213> SHRIMP 
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<400> 210 

atggtgctaa 

aaggaagatg 

attgcaagac 

tggaacgaaa 

gatatcatat 

gataaattgg 

gtggatgaaa 

aaggttatga 

acatctattt 

ctgaaaagcg 

cttccccacc 

gctggaacag 

tcctttagaa 

gaagaagaag 

ggacacaacg 

tatgactatt 

ggcacattca 

acagaattag 

tatggggagt 

gtggtagatt 

gtgtataaga 

ttggccgtta 

gaagagatgg 

cctgcagaag 

tctgtgttaa 

gtgttaaccg 

tttgaagcat 

aggaatactg 

ccccctagcc 

tccctgcaaa 

ggaggatttg 

tgtgatggag 

catgaaaaaa 

gtgcttcaga 

atctacgctg 

gggattgctc 

acggttgaag 

ctcaccacat 

gaaggactga 

ttaaggtctg 

aaaaatgaca 

tctttttcgt 

tttgctgcag 

ctcgacaaat 

agttatttga 

aaggctacag 

aactttggaa 

gccgctgtcg 

agccctaaag 

ggatgggcta 

gatgcatcat 

tcgaaatata 

ggatcttggg 

gatttgccta 

tcttccgttc 

gacgccgtgg 

atatttggag 

aaaactggcc 

gcagggtcgg 

cagactgaac 



ctctttcctg 
cggtgttggg 
ctccctcaat 
gtgggggaga 
ccttagtgga 
cagataaaac 
aattgacaat 
aaaaggaccc 
tattggacag 
ccgaaagagt 
atccgagcct 
tcatcgcatc 
aatggggtcc 
aagaagaaaa 
ggcaacatta 
tggcctacag 
atatggagga 
agtacaaaag 
ttttatgtta 
ctctagaaaa 
acactgctaa 
ggagtatggc 
aagaagaaga 
tagacttcct 
ataactttgt 
atgatactgt 
tatcttccca 
atatacaaag 
gtgtccgtgg 
cgtatggcgg 
aagataccga 
gaaaagtgct 
cagcagaaac 
ctgcaagtaa 
ttttgcgtga 
tcaaggataa 
gagtcaaggg 
atagtgcaga 
cagctgcaac 
atgcgaatct 
cctctcgttc 
cgtttgcaag 
agatgagaga 
tcgaagagac 
cagagaatcc 
attcagtatt 
aggttacaat 
ccctggtgca 
gtgtcatcag 
agccaaccaa 
tcttaactga 
aggcctacgc 
caacattcga 
aaggacaatc 
ttttgtccat 
tggggttggc 
tgcctcttgg 
tcattgtatt 
gcccgctaac 
aattcgaaga 



tacgactagg 
gaaccagttc 
agaatctttt 
aaaaattttc 
aagtgtatat 
atgtctaacc 
cttatcaggt 
caaaaaaata 
gttaaaaaca 
agaagtactt 
atttgataat 
agatatgcat 
ctgtggagat 
gaaacattcc 
tcataggccc 
gaatctccct 
tttacctttc 
gtttgtgcaa 
ctgtatcttc 
tagttccatg 
ttacaaaagg 
agaatttatc 
agaggaagaa 
ttcagtgcct 
ggagaacgat 
atcaggaaca 
tttattcctt 
aaccctattt 
aagcaatgtc 
tatagaagag 
catgtattcc 
aagtgtcacc 
ttttaagga t 
gaccggaaac 
aaatggcatt 
aacaaatcaa 
atttttcagc 
aaccttcaga 
agtggcagaa 
aggtctagaa 
tttattggaa 
ggacatgggc 
aacatacccc 
tataccagaa 
agaaagggct 
ggggaagaaa 
tgtagtaggg 
tgcgtccaga 
ttataaaatt 
gcacccgttc 
aaatggagca 
accaatctgt 
cgacccgcat 
cctctcctgc 
aggaaaggat 
gagcaaggca 
atttggtata 
ttcaatacta 
cctgaattgg 
cgggggagga 



cgcgtagcgt 
cccattttaa 
tctgcatcag 
gacatatctc 
gaacctgtat 
gcggctgcct 
agtgatggga 
gcagaatctc 
gccaagaaac 
catcggttga 
tttagtggag 
ttcaaattgg 
aaaactgaaa 
atatcaagat 
gaaagtgctt 
aatgagtaca 
cgccctttcg 
tcaacaaatc 
ggagcagatt 
atatcatttg 
ttggggaaga 
cgcactgaag 
gaagcggaag 
catttacgcc 
ctttctatat 
gatacggaca 
tcaagaatat 
tcaacgcacg 
aactttaata 
ttgcctgaaa 
ggagaggatg 
ttcaattgtg 
gatacggatt 
cttaacaaaa 
gagcgccccg 
ccccctcccc 
ggttttcgtg 
gatttaggcc 
acatccttct 
ttttcagagg 
gaaactaggg 
gtccaagtta 
gatgcagccc 
agtcaagtga 
ggcaaagaaa 
ctaggcaatg 
gcttctgtgg 
ggggcacatc 
gtggactttt 
agggaagaaa 
tatgtattcc 
ggaacaaaag 
tctgtattgc 
gataaaggga 
gttgcagagg 
atttcagctg 
gctgctacac 
ctagtggtca 
t ttggtgcaa 
aatcgttcaa 



ctagcaaggg 
agaaatcaaa 
tggaaaaaat 
agaatgaaga 
tttctaaatc 
ttgcagcact 
gtgtgcttca 
ttttaaataa 
ttctaagcag 
ataaactcaa 
gaaaaacatc 
ttgaacatat 
gcggggaaga 
tcgtgcttca 
ctgtttactt 
aattatcgtc 
cagtaccttc 
ttccccagct 
ggtacaaaca 
attctcagac 
aaagaaacgg 
cgcataaggc 
aagaagcgat 
gtaaaattcg 
tggtttctaa 
actttggttc 
tggatgaagt 
tgtctctgtc 
ataacgctgg 
atgtactagt 
ttgttgtcgt 
gtgataattt 
tagttgaacg 
aagcatattc 
gggacgattt 
ctgcaagaag 
acattttgga 
aaggcatagt 
ctgaaggttt 
acgccaaaac 
cattaagagc 
gtgccgattt 
ttgaacaaaa 
agaaactaaa 
ttaacgacac 
cagttacagt 
tggccgggtt 
tcaacgtcgt 
cttgtgcaga 
tagaccatgt 
ctgaagacgg 
atgctgctca 
cttgggtggc 
tgtccacttt 
ctatttttga 
taataaataa 
gcctcaatcc 
tactgatagt 
agaattcagc 
aaatagtatt 



gaatttctct 
caacttgtca 
attcagggaa 
agaatggatg 
acttaaacct 
agcttctgcc 
acgtacaaca 
tgaaaaatgg 
acgaggtgca 
ggaggctcct 
agcagtatct 
ttttaaggtc 
agaagatgag 
atttatgaac 
ttgtgattat 
aatgcatcct 
aacttataag 
aagtttcgac 
cctgggggat 
attgagtggt 
aatagccgat 
attgacagca 
ggaccaggag 
tcaagctgtt 
cttcaagaat 
tagtggagaa 
gcacattctt 
ggataaatct 
gaacatttct 
cggtttgtcc 
atgggatggt 
tatccagctc 
aataagagat 
aaggaagaac 
tacagaaaag 
tgccaagata 
gacgagggcg 
aaaagagacc 
agctgaaagt 
ggttgtattc 
aaacaatact 
agatgctgaa 
cttgaaagat 
gaaaatagac 
tgaactgtca 
gttgatgaac 
tttaggtcca 
ggaccacacc 
tagaaacacc 
tatcgcacta 
aggacccaaa 
aggagaatgt 
aagcatgaaa 
aaaggcagtt 
ggttgcagag 
ccccttgttc 
atccaactgg 
tcgcttcttt 
taaaaggaaa 
ggcagaaaag 



60 
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300 

360 
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540 
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720 

780 

840 

900 
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1200 
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3600 



WO 01/38351 



295 



PCT/US00/28888 



gacaacgcca atagtaaact tcaatcgagg aggaatgaaa ctgggcccat gagattagag 3660 

gagcttcctg ggcatgaaga tttgcgccca gttttcttcc ctgccacaac aaattattcc 3720 

aaatctgcca agattctggg ctacaaatct aaacccttca acgactttta tacaaaaata 3780 

ataaacacag acatcataaa aatggatagg taa 3813 



<210> 211 
<211> 1264 
<212> PRT 
<213> SHRIMP 
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His 
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Val 


Val 
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Ser 
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Glu 


45 
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Asn 


Glu 
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Glu 
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Trp 


Met 
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He 


Leu 
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Met 
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Lys 
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Trp 
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Leu 
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Arg 
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Asn 
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Val 


He 
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255 




He 
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Thr 
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He 
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400 
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Lys 


Asn 


Thr 


Ala 
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Glu 
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Ser 
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His 
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4 65 
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Asn 
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Phe 


Val 


Glu 
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Asp 
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Lys 


Asn 


Val 


Leu 
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Asp 


Asp 
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Phe 
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Glu 


Phe 


Glu 
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Asp 
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Val 


His 




530 
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Thr 
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Ser 
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Val 
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Val 
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Ser 


Asn 


Val 
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He 


Ser 


Ser 
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Thr 
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Gly 








580 










Val 


Leu 


Val 


Gly 


Leu 


Ser 


Gly 


Gly 
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Gly 


Glu 


Asp 


Val 


Val 


Val 
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Trp 
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Thr 


Phe 


Asn 


Cys 


Gly 
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Ala 
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64 5 








Arg 
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Val 
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Thr 


Ala 


Ser 
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Asn 


He 
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Gly 
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Thr 
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Gin 


Pro 


Pro 


Pro 


Pro 
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Glu 


Gly 


Val 


Lys 


Gly 


Phe 


Phe 


Ser 
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Leu 


Thr 


Thr 


Tyr 


Ser 


Ala 


Glu 


Thr 








740 










Val 


Lys 


Glu 


Thr 


Glu 


Gly 


Leu 


Thr 






755 
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Phe 


Ser 


Glu 


Gly 
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Ala 


Glu 


Ser 
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Leu 


Glu 
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Ser 


Glu 


Asp 


Ala 


Lys 
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Ser 
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Ser 


Leu 


Leu 


Glu 


Glu 


Thr 
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Ser 


Ser 
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Ala 
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Ala 


Ala 


Glu 
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Glu 
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Asp 


Phe 


Tyr 


Thr 


Lys 


He 


He 


Asn 


Thr 


Asp 


He 


He 


Lys 


Met 


Asp 


Arg 



1250 1255 1260 



<210> 212 
<211> 3744 
<212> DNA 
<213> SHRIMP 

<400> 212 

atgggcgcgc ctaccaacgc 
tatcttgtta accctggagc 
tctgactctt ttgtgtacaa 
tgtcccttta acgagccagc 
ctaattaaca gatctcgttt 
tttcgctatt ctggaggaag 



tgattttaca cgcacggtat 
tccttccgat agagaaaagt 
ctacaaggat gcagtggtga 
tcttcatgag cacatcatga 
tgtgtgtaac cctgttaaat 
tactccccag aacttaattt 



caggagtagc ttcttctctc 60 
tagttttggc ctcttcttat 120 
ccgctgaggc tcccaagtgg 180 
acagacttga aaaagctggt 24 0 
cggctggaga agtatgcgga 300 
tcccgattgg agcatcagag 360 
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ataaattggt 
aaagacgcca 
aagaatgcaa 
agatcagcgg 
gaggtaagaa 
catttctgta 
ttctggacga 
aatgaaacta 
gtaaaaaacc 
gctgccatca 
attaaaacat 
gggaatttgt 
aaagcaggca 
catttaatag 
cttgctcttt 
ctagaagtga 
gcctacaaga 
gctacaattt 
gtacctctcc 
gtggatatgg 
ttacatagaa 
agactaagag 
gaagaagagg 
gaagaagatg 
gagggtgaag 
tcctccactc 
aagggaaaaa 
aacgatgacc 
aaatatgtta 
gttacaaatc 
cacgcaaata 
tactatgaaa 
attgagcggg 
gtagactcta 
ccttcattat 
ttattcacaa 
gatcttggaa 
tgtctagtgg 
ctagaaaagg 
gaattgaacg 
ccagaacctg 
gccatggaaa 
ttttgcaaga 
tttattgacc 
aaaaaataca 
aataaatggg 
cctgttgatt 
ccttcacgga 
accgagagca 
gtgttggaag 
ttgggaggag 
gaagacgttc 
atagaacaac 
tttgcttcat 
aattctgagg 
caattattgg 
aacagtgtca 

<210> 213 
<211> 1245 
<212> PRT 



tggtcatact 
tcgaacgcat 
tgaggagact 
gcatgattcg 
gaaggcaact 
aagaatcaac 
tcgttgaagc 
atgccatccc 
aacgcgacac 
aagaacatga 
gtctaacaga 
ccccggccca 
tatgggttat 
ggtacgaagc 
ttatatctca 
tgaaagaaag 
acaatactaa 
atggtcaaat 
ataaatcgag 
gtttggcaaa 
gaaaaatgca 
gtgaagaaga 
aagaagaagc 
atggtgagga 
atatgaatgg 
aacaacctcc 
ggttaaagga 
gccaagtgaa 
aaggtggaaa 
tactagattc 
ggcgatcaga 
gaaacgtatc 
caagagaaag 
acaaccctga 
caggggtaga 
gtcttgtcac 
gatttatcgt 
acggagatat 
gtacagtgaa 
tggccactgc 
gaatcaagca 
agggtcgagg 
tgctgaagtg 
ctgcgtcacg 
agaacattga 
tgggaactgg 
tttacacaat 
aaccaaatga 
gggacgcttc 
tcaacggaga 
aagaagaaga 
aagtacagga 
atcaacaggg 
taccagctga 
agggggaaaa 
atttaataga 
ccattcaaga 



tagaaatgct 
tcctgatcta 
tcgtgtatgg 
ctacgaaccc 
aaaaggaagt 
tcttctggct 
tgtcatccgg 
tgaagactct 
aaatcaagga 
aagcatagga 
ggagcaacga 
tctaacaaat 
tatgcagagt 
tcaacgactt 
gaggggaatt 
agctgcaaat 
cgagtctaat 
gtctgattgt 
gatgaacatg 
gatacgtgta 
agaaaaggca 
agaagaagaa 
tggaacatca 
ggaggaggag 
ggaaaataac 
ccaaaagcgt 
aagagataat 
tgtggagagc 
atgtggagat 
agtgttccat 
aactggcgta 
aaaagacagt 
ggatgcttcc 
agatgtggaa 
agattctctc 
caatccaatt 
attaaataac 
gcctatgctt 
aaacacgaga 
tcaatctgca 
aatgttactc 
agggagatcg 
tttattcttc 
tgccacctta 
ctgggtaaag 
agaatacact 
tttgaaatac 
cgtgtattac 
atgtgaaagc 
cggacgtgtc 
agaggaaata 
cagtaatcta 
tggagaagaa 
taacgattcc 
aagtccagaa 
tagcgatgga 
atga 



gctcgttttg 
agagaaggtg 
cgagccttta 
ttttcggttg 
tatgagcgaa 
gatatttcaa 
tacaagaata 
tctataaact 
gatgattcta 
gaaaaaagga 
gaaatgattt 
ctggccgatg 
cttttaaagc 
ttaatgttca 
ggtgacgtgt 
gcaaaaataa 
aatttgggca 
aatgcaattc 
caagacattg 
gcggaagaaa 
gccagggagc 
gaggaggagg 
ggtgttaatg 
gaagaggaag 
tcaagaaaac 
cagcgaggta 
attggaggat 
atacaaaaac 
aatggactac 
ttcagaaagg 
tacaccacaa 
agtaataata 
tgtgccgaat 
caacgtatgc 
gctatcgaaa 
ttcaatgccg 
attgtaaaat 
ctagactcaa 
aaatttttca 
ggacaccaat 
cctgattgta 
gcccttcaca 
aatattgacc 
tttaggcttg 
gatttactag 
aacattggac 
acaatgattg 
agtactattg 
taccgcccga 
ccttaccctt 
acaggcgtta 
ttcgatgtgg 
gaggaacttc 
tcctcacccg 
ccgtacaata 
agaaacaata 



gtacagtggc 
gtacaagtaa 
actggatagc 
gctgtgctct 
atgtactatt 
gagggggaag 
gacatgctcg 
tggagtggga 
cgttagaaaa 
aaaaacatat 
ttaagggagt 
cgatactggc 
aaatcaattt 
agttgtatat 
ttctgaatgg 
gagacatggt 
attattccca 
ctctatctat 
aaaagaccat 
atagagtagc 
gagcagctgc 
aggaagaggg 
gttctggtta 
aagaggatga 
gtaaaaacac 
aaaatgcgcc 
ttttattagc 
ttttgacggc 
cagaattgtt 
ggtctattct 
aggcaaactg 
caccccattc 
ctaacaaacg 
gggaattgat 
gagtacttca 
tgttaggtgc 
ttatgaatat 
gaggcaagac 
aaccaaatat 
acatgaacgc 
taatgaaact 
gacagaaatg 
cttcaaatgc 
atgatctttg 
accctgtaat 
gtgactcaaa 
atgatggtgt 
aaagagcgga 
ccttgtttga 
ccagtgaaac 
ttgacgacag 
aattattcga 
cttcagcaat 
cgcatattcc 
tttttgattc 
accctaaaag 



agcatcggcc 
acatgtggca 
ggaagcctcc 
atacgatcac 
tcttggtgac 
aagttctgat 
aacaatcagt 
ggatgttcta 
gaccttagaa 
cctagagttt 
tggaggaaaa 
taataatgcc 
ctctatactc 
gcctgcatta 
tgtgtttaac 
ttctcgagac 
attcgatatc 
taatattgga 
acaacacata 
tagacgtctt 
gagacagaga 
agaagaaatg 
tgatcaagaa 
agaagatagt 
tggtaatact 
catttcaacc 
cacaattcaa 
caggcaaaga 
aattgaaaag 
taacagcata 
tatttgtgat 
atctgaatgt 
cccttgtcct 
catggaccct 
aaatgagata 
tgaaaaagga 
gactattgct 
aaaaaacctt 
gactgcagca 
tggtcattgt 
gaaatccatc 
tgaccatgcc 
tgcagataca 
tagggatagg 
gaagggaaca 
cgtggccgcc 
aataagtgtt 
tgacttactc 
cgccagagcg 
cgtagaagac 
tactgaaata 
tatccctgaa 
atctgaagtg 
ctcatttggg 
tgccctcgac 
agtcgactgg 



420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3744 
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<213> SHRIMP 



<400> 213 



net 


Gly 


Ala 


Pro 


Thr 


Asn 


nJ.d 


Asp 


J. 








5 








Aid 


Ser 


Ser 


Leu 


Tyr 


Leu 


XT 1 

vai 


Asn 








on 










Lys 


Leu 


Val 


Leu 


Ala 


Ser 


Ser 


Tyr 






35 










a n 


Lys 


Asp Ala 


Vdi 


Val 


Thr 


Ala 


uXU 




50 










c c 
3D 




oxU 


Pro 


Ala 


Leu 


His 


Glu 


fllS 


Ti- 
ne 


bo 










70 






Leu 


He 


Asn 


Arg 


Ser 


Arg 




vai 










85 








IjlU 


Val 


Cys 


Gly 


Phe 


Arg 


Tyr 










100 










lie 


Phe 


Pro 


He 


Gly Ala 


ber 


GlU 






115 










TOO 

12U 


Asn 


Ala 


Ala 


Arg 


Phe 


Gly 


mr 


vai 




130 










IOC 




Glu 


Arg 


He 


Pro 


Asp 


Leu 


Arg 


GlU 


1 HO 










150 






Lys 


Asn 


Ala 


Met 


Arg 


Arg 


Leu 


Arg 










165 








Aia 


Glu 


Ala 


Ser Arg 


Ser 


a J. a 


Gly 








180 










vai 


Gly Cys 


Ala 


Leu 


Tyr 


Asp 


HIS 






195 










200 


Gly 


Ser 


Tyr 


Glu 


Arg 


Asn 


val 


Leu 




210 










215 




Glu 


Ser 


Thr 


Leu 


Leu 


Ala 


Asp 


He 


225 










230 






Phe 


Trp 


Thr 


He 


Val 


Glu 


Ala 


Val 










245 








Arg 


Thr 


He 


Ser 


Asn 


Glu 


Thr 


Asn 








260 










Asn 


Leu 


Glu 


Trp 


Glu 


Asp 


val 


Leu 






275 










i o r\ 


Gin 


Gly Asp 


Asp 


Ser 


Thr 


Leu 


Glu 




290 










one 




Glu 


His 


Glu 


Ser 


He 


Gly 


Glu 


Lys 












310 






inr 


Cys 


Leu 


Thr 


Glu 


Glu 


bin 


Arg 










325 








Gly 


Lys 


Gly 


Asn 


Leu 


Ser 


Pro 


Ala 








340 










He 


Leu 


Ala 


Asn 


Asn 


Ala 


Lys 


Ala 






355 










360 


Leu 


Leu 


Lys 


Gin 


He 


Asn 


Phe 


Ser 




370 










375 




Ala 


Gin 


Arg 


Leu 


Leu 


Met 


Phe 


Lys 


385 










390 






Leu 


Phe 


He 


Ser 


Gin 


Arg 


Gly 


He 










405 








Phe 


Asn 


Leu 


Glu 


Val 


Met 


Lys 


Glu 








420 










Asp 


Met 


Val 


Ser 


Arg 


Asp 


Ala 


Tyr 






435 










440 


Asn 


Leu 


Gly 


Asn 


Tyr 


Ser 


Gin 


Phe 




450 










455 





Phe 


Thr 


Arg 


Thr 


Val 


Ser 


Gly 


Va 1 
vox 




X \J 














Pro 


j. y 


Ala 
nxa 


Pro 


Co y 


Asp 


Arg 


(Z1 n 
uiu 


£. J 
















Cor 
JCi 


Asp 






Va 1 
v a. j. 


lyr 


Asn 


Tyr 


















Ala 


Pro 


Lys 


Trp 


Cys 


Pro 




Asn 








ou 










nee 


Asn 


Arg 


Leu 


blli 


Lys 


Aia 


ijiy 






/ D 










oU 


Cys 


Asn 


P-r-r* 
ir X U 


Vdl 


Lys 




Aia 


uiy 




on 














Gly 


ox jr 


Oca. 


Thr 

X UX 


It ro 




Asn 


Leu 












11U 






Tip 


Asn 


* r P 


Leu 


Val 


Tip 


Leu 


Arg 










125 








M.J- a 


nld 


Ser 


Ala 


Lys 


Asp 


Aia 


ne 








140 










/"* 1 * * 

Giy 


Gly 


Tnr 


Ser 


Lys 


His 


val 


Ala 
















1 oU 


Vdl 


Trp 


Arg 


Ala 


Phe 


Asn 


Trp 


lie 




l /u 










1 /b 




Met 


T 1 A 

lie 


Arg 


Tyr 


Glu 


Pro 


Pne 


Ser 


loo 










1 yu 






/IT,, 


Val 


Arg 


Arg 


Arg 


ri n 

tsin 


Leu 


Lys 










205 








Ok jcw 

rne 


Leu 


uiy 


Asp 


His 


Phe 


Cys 


Lys 








220 










Ser 


.Arg 


Gly 


Gly Arg 


Ser 


Ser 


Asp 






235 










240 


He 


Arg 


Tyr 


Lys 


Asn 


Arg 


His 


Ala 




250 










255 




Ala 


He 


Pro 


Glu 


Asp 


Ser 


Ser 


He 


o c c 










270 






vai 


Lys 


Asn 


Gin 


Arg 


Asp 


Thr 


Asn 










285 








Lys 


i nr 


Leu 


Glu 


Ala 


Ala 


He 


Lys 








300 










Arg 


Lys 


Lys 


His 


He 


Phe 


He 


Lys 






"J1 c 












Glu 


Met 


T 1 a 

lie 


Phe 


Lys 


Gly Val 


Gly 




0 ^ 










335 




U-i e 


Leu 


Thr 

1 IIX 


Asn 


Leu 


Ala 


Asp 


rtXa 


345 










350 






Gly 


He 


Trp 


Val 


He 


Met 


Gin 


Ser 










365 








He 


Leu 


His 


Leu 


lie 


Gly 


Tyr 


Glu 








380 










Leu 


Tyr 


Met 


Pro 


Ala 


Leu 


Leu 


Ala 






395 










400 


Gly 


Asp 


Val 


Phe 


Leu 


Asn 


Gly 


Val 




410 










415 




Arg 


Ala 


Ala 


Asn 


Ala 


Lys 


He 


Arg 


425 










430 






Lys 


Asn 


Asn 


Thr 


Asn 


Glu 


Ser 


Asn 










445 








Asp 


He 


Ala 


Thr 


He 


Tyr 


Gly 


Gin 



460 
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Met 


Ser 


Asp 


Cys 


Asn 


Ala 


He Pro 


Leu 


Ser 


He 


Asn 


He 


Gly 


Val 


Pro 


465 










470 








475 










480 


Leu 


His 


Lys 


Ser 


Arg 


Met 


Asn Met 


Gin 


Asp 


He 


Glu 


Lvs 


Thr 


He 


Gin 










485 








490 










495 




His 


lie 


Val 


Asp 


Met 


Glv 
oxy 


Leu Ala 


Lys 


He 


Arg 


Val 


Ala 


Glu 


Glu 


Asn 








500 








505 










510 






Arg 


Val 


Ala 


Arg 


«x y 


Leu 


Leu His 


Arg 


Arg 


Lvs 


Met 


Gin 


Glu 


Lvs 


Ala 






515 








520 










525 








Ala 


Arg 


Glu 


Arg 


Ala 


Ala 


Ala Am 


Gin 


Arg 


Arg 


Leu 


Arg 


Glv 

oxy 


Glu 


Glu 




530 










535 








540 










Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu Glu 


Glu 


Glu 


Glv 

oxy 


Glu 


Glu 


Met 


Glu 


Glu 


545 










550 








555 










560 


Glu 


ox u 


ox u 


Glu 


Ala 


Gly 


Thr Ser 


Gly 


Val 


Asn 


Glv 


Ser 


Glv 

oxy 


Twr 
x yx 


Asp 










565 








570 










575 




Gin 


Glu 


Glu 


Glu 


Asp 


Asp 


Gly Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 








580 








585 










590 






Glu 


Asp 


Glu 


Glu 


Asp 


Ser 


Glu Gly 


Glu 


Asp 


Met 


Asn 


Glv 


Glu 


Asn 


Asn 






595 








600 










605 








Ser 


Arg 


Lys 


Arg 


Lys 


Asn 


Thr Gly 


Asn 


Thr 


Ser 


Ser 


Thr 


Gin 


Gin 


Pro 




610 










615 








620 










Pro 


Gin 


Lys 


Arg 


Gin 


Arg 


Gly Lys 


Asn 


Ala 


Pro 


He 


Ser 


Thr 


Lys 


Glv 


625 










630 








635 










640 


Lys 


Ar 5 


Leu 


Lys 


Glu 


Arg 




He 


Gly 


Glv 
oxy 


Phe 


Leu 


Leu 


Ala 


Thr 










645 








650 










655 




lie 


Gin 


Asn 


Asp 


Asp 


Arg 


Gin Val 


Asn 


Val 


Glu 


Ser 


He 


Gin 


Lys 


Leu 








660 








665 










670 






Le 
eu 


Thr 


Ala 


Arg Gin Arg 


Lys Tyr 


Val 


Lys 


Gly 


Glv 
oxy 


Lys 


Cys 


Gly 


Asp 






675 








680 










685 








Asn 


ox y 


Leu 


Pro 


Glu 


Leu 


Leu He 


Glu 


Lys 


Val 


Thr 


Asn 


Leu 


Leu 


Asp 




con 










695 








700 










Car 
OCX 


Ua 1 
val 


rue 


His 


Phe 


Arg 


xiy o O-L y 


OCI 


lie 


Leu 


Asn 


Ser 


I le 


His 


Ala 


70S 










710 








715 










720 


Asn 


Ar g 


Arg 


Ser 


Glu 


Thr 


uiy vai 


lyr 


X 11X 


Thr 


x*y a 


Ala 


As n 




He 










725 








7 n 
















Asp 


Tyr 


Tyr 


Glu 


Arg 


noil vai 


OCX 


Lys 


Asp 


Ser 


Ser 


Asn 


Asn 


Thr 








740 








745 










750 






r x \j 


Hie 
n x o 




Ser 


Glu 


Cys 


Tip r;iii 

x x c uxu 




Al A 
nxa 


Arg 


Glu 


Arg 


Asp 


Ala 


Ser 






755 








760 










7 65 








y o 


A 1 a 
nia 


Glu 


Ser 


Asn 


Lys 


Arg Pro 


Cys 


Pro 


Val 


Asp 


Ser 


Asn 


Asn 


Pro 




770 










775 








780 










Glu 


21 cn 


Val 


Glu 


Gin 


Arg 


Met Arg 


Glu 


Leu 


lie 


Met 


Asp 


Pro 


Pro 


Ser 


785 










790 








795 










800 


Leu 


v- 

OC L 


w X y 


Val 


Glu 


Asp 


Ser Leu 


Ala 


Tip 
X xc 


Glu 


Arg 


Val 


Leu 


Gin 


Asn 










805 








810 










815 




Glu 


lie 


Leu 


Phe 


Thr 


Ser 


Leu Val 


Thr 


Asn 


Pro 


He 


Phe 


Asn 


Ala 


Val 








820 








825 










830 






Leu 


Gly 


Ala 


Glu 


Lys 


Gly 


Z\ cn T ,<=m i 


Gly 


Arg 


Phe 


He 


Val 


Leu 


Asn 


Asn 






835 








840 










845 








T 1 ^ 


V CI X 


Lys 


Phe 


Met 


Asn 


Met Thr 


Tip 
x x c: 


Ala 


Cys 


Leu 


Val 


Asp 


Gly 


Asp 




850 










855 








860 










Met 


Pro 


Met 


Leu 


Leu 


Asp 


Ser Arg 


Gly 


Lys 


Thr 


Lys 


Asn 


Leu 


Leu 


Glu 


865 










870 








875 










880 


Lys 


Gly 


Thr 


Val 


Lys 


Asn 


Thr Arg 


Lys 


Phe 


Phe 


Lys 


Pro 


Asn 


Met 


Thr 










885 








890 










895 




Ala 


Ala 


Glu 


Leu 


Asn 


Val 


Ala Thr 


Ala 


Gin 


Ser 


Ala 


Gly 


His 


Gin 


Tyr 








900 








905 










910 






Met 


Asn 


Ala 


Gly 


His 


Cys 


Pro Glu 


Pro 


Gly 


He 


Lys 


Gin 


Met 


Leu 


Leu 






915 








920 










925 








Pro 


Asp 


Cys 


He 


Met 


Lys 


Leu Lys 


Ser 


He 


Ala 


Met 


Glu 


Lys 


Gly 


Arg 




930 










935 








940 










Gly 


Gly 


Arg 


Ser 


Ala 


Leu 


His Arg 


Gin 


Lys 


Cys 


Asp 


His 


Ala 


Phe 


Cys 



WO 01/38351 



301 



PCT/US00/28888 



945 










950 










955 










960 


Lys 


Met 


Leu 


Lys 


Cys 
965 


Leu 


Phe 


Phe 


Asn 


He 
970 


Asp 


Pro 


Ser 


Asn 


Ala 
975 


Ala 


Asp 


Thr 


Phe 


He 
980 


Asp 


Pro 


Ala 


Ser 


Arg 
985 


Ala 


Thr 


Leu 


Phe 


Arg 
990 


Leu 


Asp 


Asp 


Leu 


Cys 


Arg 


Asp 


Arg 


Lys 


Lys 


Tyr 


Lys 


Asn 


He 


Asp 


Trp 


Val 


Lys 






995 










1000 








1005 






Asp 


Leu 


Leu 


Asp 


Pro 


Val 


Met 


Lys 


Gly 


Thr 


Asn 


Lys 


Trp 


Val 


Gly 


Thr 




1010 








1015 








1020 








Gly 


Glu 


Tyr 


Thr 


Asn 


He 


Gly Arg 


Asp 


Ser 


Asn 


Val 


Ala 


Ala 


Pro 


Val 


1025 








1030 








1035 








1040 


Asp 


Phe 


Tyr 


Thr 


He 


Leu 


Lys 


Tyr 


Thr 


Met 


He 


Asp 


Asp 


Gly 


Val 


He 










1045 








1050 








1055 


Ser 


Val 


Pro 


Ser 


Arg 


Lys 


Pro 


Asn 


Asp 


Val 


Tyr 


Tyr 


Ser 


Thr 


He 


Glu 








1060 








1065 








1070 




Arg 


Ala 


Asp 


Asp 


Leu 


Leu 


Thr 


Glu 


Ser 


Arg 


Asp 


Ala 


Ser 


Cys 


Glu 


Ser 






1075 








1080 








1085 






Tyr 


Arg 


Pro 


Thr 


Leu 


Phe 


Asp 


Ala 


Arg 


Ala 


Val 


Leu 


Glu 


Val 


Asn 


Gly 




1090 








1095 








1100 








Asp 


Gly Arg 


Val 


Pro 


Tyr 


Pro 


Ser 


Ser 


Glu 


Thr 


Val 


Glu 


Asp 


Leu 


Gly 


1105 








1110 








1115 








1120 


Gly 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


He 


Thr 


Gly 


Val 


He 


Asp 


Asp 


Ser 


Thr 










1125 








1130 








1135 


Glu 


He 


Glu 


Asp 


Val 


Gin 


Val 


Gin 


Asp 


Ser 


Asn 


Leu 


Phe 


Asp 


Val 


Glu 








1140 








1145 








1150 




Leu 


Phe 


Asp 


lie 


Pro 


Glu 


He 


Glu 


Gin 


His 


Gin 


Gin 


Gly 


Gly 


Glu 


Glu 






1155 








1160 








1165 






Glu 


Glu 


Leu 


Pro 


Ser 


Ala 


He 


Ser 


Glu 


Val 


Phe 


Ala 


Ser 


Leu 


Pro 


Ala 




1170 








1175 








1180 








Asp 


Asn 


Asp 


Ser 


Ser 


Ser 


Pro 


Ala 


His 


He 


Pro 


Ser 


Phe 


Gly 


Asn 


Ser 


1185 








1190 








1195 








1200 


Glu 


Glu 


Gly 


Glu 


Lys 


Ser 


Pro 


Glu 


Pro 


Tyr 


Asn 


He 


Phe 


Asp 


Ser 


Ala 










1205 








1210 








1215 


Leu 


Asp 


Gin 


Leu 


Leu 


Asp 


Leu 


He 


Asp 


Ser 


Asp 


Gly 


Arg 


Asn 


Asn 


Asn 








1220 








1225 








1230 




Pro 


Lys 


Arg 


Val 


Asp 


Trp 


Asn 


Ser 


Val 


Thr 


He 


Gin 


Glu 









1235 1240 1245 



<210> 214 
<211> 2946 
<212> DNA 
<213> SHRIMP 



<400> 214 

atggaagaaa 

aatattacca 

gtcaacattt 

gaactgaata 

gaccacataa 

gatgataaaa 

tcgcttgcat 

tgcgtaaaaa 

gctggattaa 

ggtaaaagcg 

aataaaagag 

cttgacgatt 

gatataaatg 

ataataggac 

aaagtcaagg 

gaagatgaat 



caataaccct 
gaatagcgtc 
ctggagctcc 
ga tactggac 
cgagggatga 
atggactata 
ctttttccca 
ttccccacga 
gtgggccttt 
gta tagatgg 
caactga tat 
ctgaaatgtc 
tgtccagttc 
aacatctgat 
aagaggaaaa 
atgaagaagg 



aaaagagtcg 
taattacgtc 
tctaactgct 
caaaactaac 
agatactggt 
tcagtctgcc 
tggagctgat 
cactcgcgct 
taagagatgg 
cctttcaggt 
tttacatttg 
tcgtactttt 
aaggcaagtg 
cgaatttaga 
tggtgagcat 
aggttgccta 



actggaacag 
cgagccttta 
gagaagaatc 
gtgttatgca 
acaataacac 
gttttaatgc 
ttggtttcca 
gaatctttac 
agcattaact 
tccatgttga 
gtgaataatg 
aaccaccaaa 
aaccagcgta 
accaagcaac 
gaagaaatga 
tcagatattg 



tgtcaccatt 
ctgacacgtg 
caagcgctat 
acccactctt 
taaaattcaa 
tggctctcga 
ataaaagtga 
tgaataatgt 
acaaggctgc 
cagtactaaa 
tttctgcctc 
agaaagttgg 
atttactcca 
ttgaacgcgc 
caagtgagga 
acgaggaaga 



caagaatgaa 
gtctcatttg 
cccagctaac 
taaattggag 
aatgtatata 
ttcgttcgtt 
aaacaaattc 
tggattcccc 
aaacttgagc 
aaataacacc 
tgcacaacaa 
agtttgttat 
ccatcagaat 
acaaaataaa 
ggaagaagag 
tttctatgag 
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780 
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gatggttacg 
gaagatgaag 
acttgtggtg 
caaaattcaa 
aagggggagt 
agcctaggta 
atgaagggga 
cgtataaggt 
tcgagttgga 
aaggaagaag 
gtatctgaac 
catttggctc 
ctcaatttga 
ataccagaag 
gggctcctct 
ggcgcctttg 
agagatggga 
agaagtattc 
tctagttctt 
agcaagcgtg 
tcatcctcgg 
acgacaatca 
aaaaacatgt 
ttcaaggcgc 
actttattgg 
gacgtcaatg 
tctagggtgt 
aggaacaaac 
ggcgactttt 
caacaagata 
agagtaagcg 
actgtatctc 
cactctcttt 
ctataa 



atgaagaaga 
aggatgaaga 
ctaatggtgt 
aaaggaagaa 
gctctgctaa 
taaagccagt 
gtgaagctga 
cactactcaa 
aatgggctaa 
ttgaggttaa 
tccctagtaa 
tgattgaaga 
ttaaaactat 
aagtgctcag 
ctgtacaatt 
cgacgtcatg 
gtaacacgtg 
gcaacagagg 
cttgtggagg 
gtataaccct 
ccatgatctc 
tccaagattt 
ctgacaagag 
tcctaacatg 
catctggaga 
atgcccacca 
ataaaagagc 
cccagtttac 
cttccgagtt 
gctgcacaga 
gttcttccac 
cgaacgaaaa 
tcaacgctct 



gggcgatgac 
agaagaatat 
tattgattgt 
aaatggtaaa 
cactttatcc 
agggtgtccg 
cagctgttat 
taaatactct 
ccctcccgac 
gtttgaaatt 
tattaaagta 
ctttcttcca 
gacgagcatt 
ttttattcct 
tgatcgtata 
cttgaacaac 
gctccacacg 
agtgaatatt 
gaataagggc 
gaaaccacca 
attgcctcag 
ctcagaagtt 
caaagacgtg 
cacagtcaac 
gggtgtggta 
gttccaggaa 
ccagcgccga 
tatagcagtc 
aaatccaacg 
agacgacgac 
cacagccgga 
gaggaactct 
atctgcaaaa 



aatagaacta 
gacgatgaag 
gaagacgatg 
aaaacaaaca 
tttgtggaaa 
cccccatcca 
aatacttgtc 
gttaaagatt 
cgacggttcg 
gaatgtgaaa 
tggttaaaag 
gctatgggtg 
ttctctgtta 
atagaatgga 
atagaagtga 
gcattcttct 
gaccttgtgc 
ggcggtaaca 
gattatggag 
cctgcagcga 
cccacgcgcc 
tctgggaaat 
tttaatgatg 
gataaaagta 
cgaagaaacc 
gaatgcggaa 
ggttcggcag 
tctgacgaag 
cactctcaac 
gttttagtgt 
gacagagttc 
gccgccctcg 
acaaagttgg 



gaaagaagaa 
aagatgaaga 
caatcatttt 
ttaaaaagcg 
aatacgttgg 
ctgaatttac 
agtccaccag 
tgatgcaggt 
tgctgtttga 
aatccgagta 
agacggcaaa 
ctgctacccc 
gagatattgt 
agacatctat 
tagatttaat 
tggaaagagg 
aactctccac 
acaacactgg 
tacgttgtgg 
tgactaattc 
agagcataga 
tgaggcttaa 
caatatacga 
gacgtaaaag 
tgatggtgag 
taaaaattgg 
tcagttccag 
acgacgactg 
tacttctatt 
ctgtagaaga 
ttgcaaagga 
ccgcactcac 
gagaaaatgg 



gaaaatggaa 
agaggcagaa 
ccccaatgga 
gtcacggagg 
aaattgtaag 
ttccctattt 
aggggctagc 
aaacagccct 
taagaaaact 
ttttgatgtc 
aataataaaa 
aaaaattccc 
tggatttaaa 
ttctgcaatg 
gataactaat 
agtggtgccc 
ctccatattt 
tagcaattct 
attgagtata 
ttcttccccg 
tctttcgata 
tggattacag 
ctctggcgca 
gaaaagaagg 
tcagggcaat 
gggcggggct 
aagaagagta 
cgaagaagaa 
ccaacaacgg 
atataacaac 
tcttctctct 
catatcccgg 
acgtttcttc 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
2946 



<210> 215 
<211> 975 
<212> PRT 
<213> SHRIMP 



<400> 215 



Met 


Glu 


Glu 


Thr 


He 


Thr 


Leu 


Lys 


Glu 


Ser 


Thr 


Gly 


Thr 


Val 


Ser 


Pro 


1 








5 










10 










15 




Phe 


Lys 


Asn 


Glu 


Asn 


He 


Thr 


Arg 


He 


Asn 


Tyr 


Val 


Arg 


Ala 


Phe 


Thr 






20 










25 










30 






Asp 


Thr 


Trp 
35 


Ser 


His 


Leu 


val 


Asn 
40 


He 


Ser 


Gly 


Ala 


Pro 
45 


Leu 


Thr 


Ala 


Glu 


Lys 
50 


Asn 


Pro 


Ser 


Ala 


He 
55 


Pro 


Ala 


Asn 


Glu 


Leu 
60 


Asn 


Arg 


Tyr 


Trp 


Thr 


Lys 


Thr 


Asn 


val 


Leu 


Cys 


Asn 


Pro 


Leu 


Phe 


Lys 


Leu 


Glu 


Asp 


His 


65 








70 










75 










80 


He 


Thr 


Arg 


Asp 


Glu 
85 


Asp 


Thr 


Gly 


Thr 


He 
90 


Thr 


Leu 


Lys 


Phe 


Lys 
95 


Met 


Tyr 


He 


Asp 


Asp 
100 


Lys 


Asn 


Gly 


Gin 


Ser 
105 


Ala 


Val 


Leu 


Met 


Leu 
110 


Ala 


Leu 


Asp 


Ser 


Phe 


Val 


Ser 


Leu 


Ala 


Ser 


Phe 


Ser 


His 


Gly 


Ala 


Asp 


Leu 


Val 




115 










120 










125 








Ser 


Asn 
130 


Lys 


Ser 


Glu 


Asn 


Lys 
135 


Phe 


Cys 


Val 


Lys 


He 
140 


Pro 


His 


Asp 


Thr 


Arg 


Ala 


Glu 


Ser 


Leu 


Leu 


Asn 


Asn 


Val 


Gly 


Phe 


Pro 


Ala 


Gly 


Leu 


Ser 


145 










150 










155 










160 
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Gly 


Pro 


Phe 


Lys 


Gly 


Lys 


Ser 


Gly 








180 


Lys 


Asn 


Asn 


Thr 






195 




Asn 


Val 


Ser 


Ala 




210 






Thr 


Phe 


Asn 


His 


225 








Ser 


Ser 


Ser 


Arg 


He 


He 


Gly 


Gin 








260 


Ala 


Gin 


Asn 


Lys 






275 




Met 


Thr 


Ser 


Glu 




290 






Cys 


Leu 


Ser 


Asp 


305 








Glu 


Glu 


Glu 


Gly 


Glu 


Asp 


Glu 


Glu 








340 


Glu 


Glu 


Ala 


Glu 






355 




Asp 


Ala 


He 


He 




370 






Gly 


Lys 


Lys 


Thr 


385 








Ser 


Ala 


Asn 


Thr 


Ser 


Leu 


Gly 


He 








420 


Thr 


Ser 


Leu 


Phe 






435 




Cys 


Gin 


Ser 


Thr 




450 






Tyr 


Ser 


Val 


Lys 


465 








Trp 


Ala 


Asn 


Pro 


Lys 


Glu 


Glu 


Val 






500 


Tyr 


Phe 


Asp 


Val 






515 




Lys 


Glu 


Thr 


Ala 




530 






Leu 


Pro 


Ala 


Met 


545 








Lys 


Thr 


Met 


Thr 


He 


Pro 


Glu 


Glu 








580 


He 


Ser 


Ala 


Met 






595 




Val 


He 


Asp 


Leu 




610 






Asn 


Asn 


Ala 


Phe 


625 








Asn 


Thr 


Trp 


Leu 



Arg 


Trp 


Ser 


lie 


165 








He 


Asp 


Gly 


Leu 


Asn 


Lys 


Arg 


Ala 








200 


Ser 


Ala 


Gin 


Gin 






215 




Gin 


Lys 


Lys 


Val 




230 






Gin 


Val 


Asn 


Gin 


245 








His 


Leu 


He 


Glu 


Lys 


Val 


Lys 


Glu 








280 


Glu 


Glu 


Glu 


Glu 






295 




He 


Asp 


Glu 


Glu 




310 






Asp 


Asp 


Asn 


Arg 


325 








Asp 


Glu 


Glu 


Glu 


Thr 


Cys 


Gly 


Ala 








360 


Phe 


Pro 


Asn 


Gly 






375 




Asn 


He 


Lys 


Lys 




390 






Leu 


Ser 


Phe 


Val 


405 








Lys 


Pro 


Val 


Gly 


Met 


Lys 


Gly 


Ser 








440 


Arg 


Gly 


Ala 


Ser 






455 




Asp 


Leu 


Met 


Gin 




470 






Pro 


Asp 


Arg 


Arg 


485 








Glu 


Val 


Lys 


Phe 


Val 


Ser 


Glu 


Leu 








520 


Lys 


He 


He 


Lys 






535 




Gly 


Ala 


Ala 


Thr 




550 






Ser 


He 


Phe 


Ser 


565 








Val 


Leu 


Ser 


Phe 


Gly 


Leu 


Leu 


Ser 








600 


Met 


He 


Thr 


Asn 






615 




Phe 


Leu 


Glu 


Arg 




630 






His 


Thr 


Asp 


Leu 



Asn Tyr Lys Ala 
170 

Ser Gly Ser Met 
185 

Thr Asp He Leu 

Leu Asp Asp Ser 
220 

Gly Val Cys Tyr 
235 

Arg Asn Leu Leu 
250 

Phe Arg Thr Lys 
265 

Glu Glu Asn Gly 

Glu Asp Glu Tyr 
300 

Asp Phe Tyr Glu 
315 

Thr Arg Lys Lys 
330 

Glu Tyr Asp Asp 
345 

Asn Gly Val He 

Gin Asn Ser Lys 
380 

Arg Ser Arg Arg 
395 

Glu Lys Tyr Val 
410 

Cys Pro Pro Pro 
425 

Glu Ala Asp Ser 

Arg He Arg Ser 
4 60 

Val Asn Ser Pro 
475 

Phe Val Leu Phe 
4 90 

Glu He Glu Cys 
505 

Pro Ser Asn He 

His Leu Ala Leu 
540 

Pro Lys He Pro 
555 

Val Arg Asp He 
570 

He Pro He Glu 
585 

Val Gin Phe Asp 

Gly Ala Phe Ala 
620 

Gly Val Val Pro 

635 

Val Gin Leu Ser 



Ala Asn Leu Ser 
175 

Leu Thr Val Leu 
190 

His Leu Val Asn 
205 

Glu Met Ser Arg 

Asp He Asn Val 
240 

His His Gin Asn 
255 

Gin Leu Glu Arg 
270 

Glu His Glu Glu 
285 

Glu Glu Gly Gly 

Asp Gly Tyr Asp 
320 

Lys Lys Met Glu 
335 

Glu Glu Asp Glu 
350 

Asp Cys Glu Asp 
365 

Arg Lys Lys Asn 

Lys Gly Glu Cys 
400 

Gly Asn Cys Lys 
415 

Ser Thr Glu Phe 
430 

Cys Tyr Asn Thr 
445 

Leu Leu Asn Lys 

Ser Ser Trp Lys 
480 

Asp Lys Lys Thr 
495 

Glu Lys Ser Glu. 
510 

Lys Val Trp Leu 
525 

He Glu Asp Phe 

Leu Asn Leu He 
560 

Val Gly Phe Lys 

575 

Trp Lys Thr Ser 
590 

Arg He He Glu 
605 

Thr Ser Cys Leu 

Arg Asp Gly Ser 
640 

Thr Ser He Phe 
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Tic* 
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Arg 
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Gly 
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Asp 
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Tin 






Val 


Asn 


Asp 


Lys 
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Ser 


Gly 


GlU 


Giy 


Asp 


Val 


Asn 


Asp 










Gly 


Gly 


.Gly 


TV 1 

Ala 






o "3 c 




Ala 


Val 


Ser 


Ser 




850 






His 


vax 


oer 


Asp 


865 








Ser 


Glu 


Leu 


Asn 


Gin 


Gin 


Asp 


Ser 








900 


Glu 


Tyr 


Asn 


Asn 






915 




Val 


Leu 


Ala 


Lys 




930 






Asn 


Ser 


Ala 


Ala 


945 








Leu 


Ser 


Ala 


Lys 



645 



Asn 


Arg 


Ksxy 


v ax 


OCX 


Cor- 

OCX 


O tii 


C a tt 
OCX 








OOu 




T All 


OCX 


Tip 










nld 


net 


i. ni 


Asn 




/ X U 






Pro 


Gin 


Pro 


Thr 


725 










Asp 


erne 


oer 


Lys 


Asn 


1 Jtr L. 


Ser 








/ oU 


Asp 


oer 


oxy 


nld 






TIC 




ber 


Arg 


Arg 


Lys 




/ y u 






Val 


Val 


Arg 


Arg 


805 








ax a 


nxS 


bin 


trne 


ser 


Arg 


vai 


Tyr 








o4U 


Arg 


Arg 


Arg 


Val 






o c c 
ODD 




OXLl 


ASp 


Asp 


Asp 




870 






Pro 


Thr 


His 


Ser 


885 








Cys 


Thr 


Glu 


Asp 


Arg 


Val 


Ser 


Gly 








920 


Asp 


Leu 


Leu 


Ser 






935 




Ala 


Leu 


Thr 


He 




950 






Thr 


Lys 


Leu 


Gly 



965 





OJU 






Asn 


Tip 
X X ts 


bxy 


y 


665 








Cys 


Gly 


Gly 


Asn 


oer 


Lys 


Arg 


uiy 








700 


ocl 




oer 


Pro 






*7 1 ^ 




Arg 


bin 


C A V- 

oer 


l le 










G1U 


vai 


C A V 

oer 


Giy 


745 








Asp 


Lys 


Ser 


Lys 




Lys 


Aid 


Leu 








■7 o i*i 


Arg 


Lys 


Arg 


Arg 






/ yo 




Asn 


Leu 


net 


vax 




fti n 






LaXn 


blU 




Cys 


825 








Lys 


Arg 


Ala 


Gin 


Arg 


Asn 


Lys 


Pro 








O C f\ 

0 oU 


Cys 


blU 


GlU 


GlU 






one 
O / O 




Gin 


Leu 


Leu 


Leu 




890 






Asp 


Asp 


Val 


Leu 


905 








Ser 


Ser 


Thr 


Thr 


Thr 


Val 


Ser 


Pro 








940 


Ser 


Arg 


His 


Ser 






955 




Glu 


Asn 


Gly 


Arg 




970 







655 

Asn Asn Asn Thr 
670 

Lys Gly Asp Tyr 
685 

He Thr Leu Lys 

Ser Ser Ser Ala 
720 

Asp Leu Ser He 
735 

Lys Leu Arg Leu 
750 

Asp Val Phe Asn 
765 

Leu Thr Cys Thr 

Thr Leu Leu Ala 
800 

Ser Gin Gly Asn 
815 

Gly lie Lys He 
830 

Arg Arg Gly Ser 
845 

Gin Phe Thr He 

Gly Asp Phe Ser 
880 

Phe Gin Gin Arg 
895 

Val Ser Val Glu 
910 

Ala Gly Asp Arg 
925 

Asn Glu Lys Arg 

Leu Phe Asn Ala 
960 

Phe Phe Leu 
975 



<210> 216 
<211> 2280 
<212> DNA 
<213> SHRIMP 



<400> 216 

atgccaccta 

cgaaaaaagg 

aaattggacg 

attctgtcaa 

ggaaaaagag 

gaagggaaag 

gaagagaaag 

gttacaccta 

ccctctaaaa 

aaaatgatga 

gtaaggagtg 

gtcaaggaga 



aacataaacc 
aggatgaege 
cgccaacttc 
ggagtggcga 
acactgatca 
aggaagggga 
aagaaaaaga 
ctacatctgt 
aaagaaagag 
ccgtgtctcg 
aaaatgegee 
aggctataca 



taatactget 
cgaatccaga 
tagtaagaat 
ttgtgttgct 
agaaggtggt 
aggggaggaa 
tggagaggag 
ategaagegg 
aagtgacaca 
accattgaga 
acaatacgat 
gtacagaata 



ttaaaaaagc 
ttccaacgaa 
aggcaaaggc 
ggcgattgtt 
ggaagaggag 
cagcageggg 
gaagaagagg 
gcaaaacaaa 
gaatcaaagg 
ggtgctataa 
tacgtgtcat 
aggtctcttt 



atataattag 
acatgggtca 
gtaagattcg 
cagatttaga 
gaggtaatga 
aagaaaagga 
agaatgttga 
tgaaaaagca 
cattagcagt 
cttcgggctc 
acttggcaga 
tagctaacct 



gaaccaacaa 
ggaggtctct 
tactagtaaa 
aaatgatgag 
agaagaagag 
agaacagagt 
agacgaacat 
catatttcct 
gcctgctgga 
tattttaggt 
cgaggcagtg 
tttaaaggee 
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aataaaacca 
aagttcggag 
aaagctcttc 
gagattcagt 
acagagataa 
gaacgagtat 
ttaccgaaca 
actgtacgtg 
aaaatgtcat 
tctatctcca 
gttgaaaaaa 
ggagtgtttg 
cctttccaac 
cctacttacc 
gctcctcaac 
ccgcctcagc 
cctcctcccc 
cagcagcagc 
aatactgctc 
caacctccac 
agagcagaag 
agtattgatt 
gcgcaacgtc 
gcatttattg 
ttagatgtca 
tttgactggg 



aggcttttcc 
gaaaaagaac 
tggctaatag 
ggcaagaatt 
acaatttaat 
actttgaaca 
agaactttga 
gcctatcaat 
cgactatgga 
tgctggagaa 
tgataaagag 
tgcccaaaca 
ctcctccctt 
aaggatactt 
aattacaaca 
tgcagcagca 
ctactcctca 
cctttccggt 
cttctccacc 
ctacaccagt 
aagaaaacgc 
ttgatcttcc 
aagctgtcaa 
caccaattat 
atgaatccat 
atatggggtt 



tacttcatca 
aaacactttt 
ttgttgggca 
aagttcaaaa 
gagcagtatc 
acttgctact 
taagaatttt 
acctcgttat 
tcttttggtt 
atgtgcagct 
tgcagtggag 
acaacaacag 
tcctcttcct 
aaatccttat 
acagtacccg 
gcagcagcag 
acaacaatca 
tcaactcatt 
tatttcccgt 
tctacccaag 
tactgatatg 
tcccgctact 
ggaaacggga 
tcgtcaaccg 
ctatggctca 
agacgattta 



tctttattgt 
gttgtcacaa 
atttcccata 
atattgaaat 
gtggaggatc 
gtgtgtaata 
gtatctgtag 
tttagagcta 
ggaggaggga 
ggtgttttag 
gaaacaagtc 
caacgtcagc 
ccaccacagg 
tatcaatata 
ctgtactttt 
cagcaaccac 
cccagtaata 
tctagtcccc 
gtaagatttg 
cctactcctc 
tcctttactg 
ccagggagga 
gtcagaggag 
cgtacaccag 
gacattgaac 
aatggggatg 



catctgaaca 
atgtgggtgc 
ggaaagatat 
ctcttaacga 
gtatacagcg 
acctgtttgg 
cttctgataa 
taaataataa 
tgaggagtaa 
caagagcttc 
aagcatttaa 
aacaacaaca 
cacctttcca 
accagtatta 
taggcaacca 
cacagccgcc 
ttcctcctcc 
ctcctcctcc 
actctcgttc 
ttcctcctcc 
atatagactc 
acgttgaaga 
aagaagaaga 
gaaattttag 
cagcagcagc 
aaccatatga 



agggaagaaa 
tgaattggtg 
acggtcagga 
tggaaatgca 
aactgtgaaa 
tacacgtata 
ctctaacgcc 
cgtgtgggta 
atcagagcat 
agcaagaccg 
tttatcgact 
acaatttccc 
ggttcaacaa 
taacccatat 
atcacagccg 
caataatatt 
ccagcagcag 
tcctatacct 
tactacccct 
gtctacagca 
tgagcttggc 
gataataaaa 
agaagaggag 
agatgaactt 
agcagcagcg 
atttgaataa 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 



<210> 217 
<211> 757 
<212> PRT 
<213> SHRIMP 



<400> 217 



Met 


Pro 


Pro 


Lys 


His 


Lys 


Pro 


Asn 


1 








5 








Arg 


Asn 


Gin 


Gin 


Arg 


Lys 


Lys 


Glu 








20 










Arg 


Asn 


Met 


Gly 


Gin 


Glu 


Val 


Ser 






35 










40 


Lys 


Asn 


Arg 


Gin 


Arg 


Arg 


Lys 


He 




50 










55 




Ser 


Gly 


Asp 


Cys 


Val 


Ala 


Gly 


Asp 


65 










70 






Gly 


Lys 


Arg 


Asp 


Thr 


Asp 


Gin 


Glu 










85 








Glu 


Glu 


Glu 


Glu 


Glu 


Gly 


Lys 


Glu 








100 










Arg 


Glu 


Glu 


Lys 


Glu 


Glu 


Gin 


Ser 






115 










120 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Asn 


Val 




130 










135 




Thr 


Ser 


val 


Ser 


Lys 


Arg 


Ala 


Lys 


145 










150 






Pro 


Ser 


Lys 


Lys 


Arg 


Lys 


Arg 


Ser 










165 








Ala 


Gly 


Lys 


Met 


Met 


Thr 


Val 


Ser 








180 










Ser 


Gly 


Ser 


He 


Leu 


Gly 


Val 


Arg 






195 










200 


Tyr 


Val 


Ser 


Tyr 


Leu 


Ala 


Asp 


Glu 



Thr 


Ala 


Leu 


Lys 


Lys 


His 


He 


He 




10 










15 




Asp 


Asp 


Ala 


Glu 


Ser 


Arg 


Phe 


Gin 


25 










30 






Lys 


Leu 


Asp 


Ala 


Pro 


Thr 


Ser 


Ser 










45 








Arg 


Thr 


Ser 


Lys 


He 


Leu 


Ser 


Arg 








60 










Cys 


Ser 


Asp 


Leu 


Glu 


Asn 


Asp 


Glu 






75 










80 


Gly 


Gly 


Gly 


Arg 


Gly 


Gly 


Gly 


Asn 




90 










95 




Glu 


Gly 


Glu 


Gly 


Glu 


Glu 


Gin 


Gin 


105 










110 






Glu 


Glu 


Lys 


Glu 


Glu 


Lys 


Asp 


Gly 










125 








Glu 


Asp 


Glu 


His 


Val 


Thr 


Pro 


Thr 








140 










Gin 


Met 


Lys 


Lys 


His 


He 


Phe 


Pro 






155 










160 


Asp 


Thr 


Glu 


Ser 


Lys 


Ala 


Val 


Pro 




170 










175 




Arg 


Pro 


Leu 


Arg 


Gly 


Ala 


He 


Thr 


185 










190 






Ser 


Glu 


Asn 


Ala 


Pro 


Gin 


Tyr 


Asp 










205 








Ala 


Val 


Val 


Lys 


Glu 


Lys 


Ala 


He 
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210 

Gin Tyr Arg lie 
225 

Thr Lys Ala Phe 

Lys Lys Lys Phe 
260 

Val Gly Ala Glu 
275 

lie Ser His Arg 
290 

Leu Ser Ser Lys 
305 

lie Asn Asn Leu 

Val Lys Glu Arg 
340 

Leu Phe Gly Thr 
355 

Val Ser Val Ala 
370 

lie Pro Arg Tyr 
385 

Ser Ser Thr Met 

Glu His Ser lie 
420 

Arg Ala Ser Ala 
435 

Glu Thr Ser Gin 
450 

Gin Gin Gin Gin 
4 65 

Gin Pro Pro Pro 

Gin Gin Pro Thr 
500 

Gin Tyr Tyr Asn 
515 

Leu Tyr Phe Leu 
530 

Gin Gin Gin Gin 
545 

Pro Pro Thr Pro 

Gin Gin Gin Gin 
580 

Pro Pro Pro Pro 
595 

Val Arg Phe Asp 
610 

Val Leu Pro Lys 
625 

Glu Glu Glu Asn 

Leu Gly Ser He 
660 

Val Glu Glu He 
675 

Val Arg Gly Glu 
690 



215 

Arg Ser Leu Leu 
230 

Pro Thr Ser Ser 
245 

Gly Gly Lys Arg 

Leu Val Lys Ala 
280 

Lys Asp He Arg 
295 

He Leu Lys Ser 
310 

Met Ser Ser He 
325 

Val Tyr Phe Glu 

Arg He Leu Pro 
360 

Ser Asp Asn Ser 
375 

Phe Arg Ala He 
390 

Asp Leu Leu Val 
4 05 

Ser Met Leu Glu 

Arg Pro Val Glu 
440 

Ala Phe Asn Leu 
455 

Gin Arg Gin Gin 
470 

Phe Pro Leu Pro 
485 

Tyr Gin Gly Tyr 

Pro Tyr Ala Pro 
520 

Gly Asn Gin Ser 
535 

Gin Gin Pro Pro 
550 

Gin Gin Gin Ser 
565 

Gin Pro Phe Pro 

He Pro Asn Thr 
600 

Ser Arg Ser Thr 
615 

Pro Thr Pro Leu 
630 

Ala Thr Asp Met 
645 

Asp Phe Asp Leu 

He Lys Ala Gin 
680 

Glu Glu Glu Glu 
695 



220 

Ala Asn Leu Leu 
235 

Ser Leu Leu Ser 
250 

Thr Asn Thr Phe 
265 

Leu Leu Ala Asn 

Ser Gly Glu He 
300 

Leu Asn Asp Gly 
315 

Val Glu Asp Arg 
330 

Gin Leu Ala Thr 
345 

Asn Lys Asn Phe 

Asn Ala Thr Val 
380 

Asn Asn Asn Val 
395 

Gly Gly Gly Met 
410 

Lys Cys Ala Ala 
425 

Lys Met He Lys 

Ser Thr Gly Val 
460 

Gin Gin Gin Gin 
475 

Pro Pro Gin Ala 
490 

Leu Asn Pro Tyr 
505 

Gin Gin Leu Gin 

Gin Pro Pro Pro 
540 

Gin Pro Pro Asn 
555 

Pro Ser Asn He 
570 

Val Gin Leu He 
585 

Ala Pro Ser Pro. 

Thr Pro Gin Pro 
620 

Pro Pro Pro Ser 
635 

Ser Phe Thr Asp 
650 

Pro Pro Ala Thr 
665 

Arg Gin Ala Val 

Glu Glu Ala Phe 
700 



Lys 


Ala 


Asn 


Lys 








240 


Ser 


Glu 


Gin 


Gly 






255 




Val 


Val 


Thr 


Asn 




270 






Ser 


Cys 


Trp 


Ala 


285 








Gin 


Trp 


Gin 


Glu 


Asn 


Ala 


Thr 


Glu 








320 


He 


Gin 


Arg 


Thr 






335 




Val 


Cys 


Asn 


Asn 




350 






Asp 


Lys 


Asn 


Phe 


365 








Arg 


Gly 


Leu 


Ser 


Trp 


Val 


Lys 


Met 








400 


Arg 


Ser 


Lys 


Ser 






415 




Gly 


Val 


Leu 


Ala 




430 






Ser 


Ala 


Val 


m ii 

V7JLU 


445 








Phe 


Val 


Pro 


Lys 


Phe 


Pro 


Pro 


Phe 








480 


Pro 


Phe 


Gin 


Val 






4 95 




Tyr 


Gin 


Tyr 


Asn 




510 






Gin 


Gin 




Pro 


525 








Gin 


Leu 


Gin 


Gin 


Asn 


He 


Pro 


Pro 








560 


Pro 


Pro 


Pro 


Gin 






575 




Ser 


Ser 


Pro 


Pro 




590 






Pro 


He 


Ser 


Arg 


605 








Pro 


Pro 


Thr 


Pro 


Thr 


Ala 


Arg 


Ala 








640 


He 


Asp 


Ser 


Glu 






655 




Pro 


Gly 


Arg 


Asn 




670 






Lys 


Glu 


Thr 


Gly 


685 








He 


Ala 


Pro 


He 
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lie Arg Gin Pro Arg Thr Pro Gly 
705 710 
Val Asn Glu Ser lie Tyr Gly Ser 
725 

Ala Ala Phe Asp Trp Asp Met Gly 
740 

Pro Tyr Glu Phe Glu 
755 



Asn Phe Arg Asp Glu Leu Leu Asp 
715 720 
Asp lie Glu Pro Ala Ala Ala Ala 

730 735 
Leu Asp Asp Leu Asn Gly Asp Glu 
745 750 



<210> 218 
<211> 3177 
<212> DNA 
<213> SHRIMP 



<400> 218 

atggaaggtg 

gacctaatgg 

ggagaaagtg 

gatgaacttt 

tctctattaa 

gttccattta 

aaaaaacaag 

gtcatgaaaa 

gtggaggaga 

atacgcattt 

atgaaaagtg 

cataaggagt 

gctagaacgt 

atttgggaaa 

aaccactaca 

gaatatgtgc 

gatatagtgt 

agagtagagg 

gcttcgctct 

agaaaactag 

gcaatcgcca 

gcggcgatgg 

atattgagtg 

aaaccaaaat 

gtacatgctt 

cctgattgta 

aaactaaagg 

agatgtcacg 

ccagaataca 

tgcttctata 

gactttgcac 

gttaaactct 

acccagtgtt 

gtggatgtac 

ttccaggatg 

gatactggtc 

aattttgtac 

agaactcccg 

ctagcgctag 

aacgacatat 

tacttaaaca 

agtatacctc 

ttaggattat 

ccagaatgtg 

ttatcgggta 



taacatccat 
gaggcagaaa 
gtgatcttct 
tggagggaga 
tgaagaaagc 
caactcttga 
atgggccagc 
atacgcgctt 
agaatgcgtt 
tcttactcct 
tcatttatga 
ggtacatgtg 
ttagtgtttt 
atatggcctt 
ataagggtca 
cccaagaatt 
acagatactc 
agccaaagaa 
tagaaaatac 
gatctttcat 
aatgcatggt 
tgtaccaacc 
ggaataccac 
tcttcaactg 
tcatggacac 
ctataaaaca 
acgtttttag 
aattttcatc 
ttgcagccaa 
ctgtggaact 
tagacaagga 
ttaagatggt 
tactacccaa 
ctcctccagt 
tagaagatgc 
ccggactcca 
ctccaaatga 
aaaagaagat 
ccaactggca 
ctgtaatcca 
gtgtttttgg 
taattgcaac 
gggcgaaaat 
tgttccatag 
tcagacctaa 



agttgcagct 
taataagagg 
ggaggccata 
aacagttgtg 
cctcgacatg 
ccaaaacgaa 
atacaaggat 
gatgttggat 
gtgtgaggcg 
tgttgccatc 
agatgtggta 
gcccttctct 
atttggttct 
ctccgtaaca 
tcttcgcatg 
acaccatatt 
tatctacaat 
atccgtaatg 
accactgtcc 
cctcgaacat 
tagccaigaca 
atctgacgaa 
caataaaaca 
ttctcatgaa 
ttttgaagca 
cctaatgtat 
gtgcccatgt 
tttattcgag 
caaacttctc 
tcttcaggcc 
ttcaaaattt 
ggagggagaa 
tgtatgtgac 
attgccgtac 
tgaatttgat 
taaatggccc 
agaggtgaca 
tcgaggatgg 
ctcgactgat 
caacaccagg 
gggttttgat 
ctatttttat 
gtttgtcaag 
agcccattct 
ccaaggtgca 



gctgtcccag 
tccacctatg 
ctggacatct 
attaatccaa 
aacatcaaga 
caagaattca 
ctaattcatc 
gaaattatac 
tacgccgaga 
aaacgagatc 
gtttctctca 
gctttgcaag 
gatatgcatg 
gaagcatttt 
tacgctgcca 
ctttttggaa 
gttccctatc 
tcgccttctg 
ctggtatcta 
gaaaacgctg 
ctgcaagaag 
gtggaggtaa 
tgtgggttgt 
aacatgaaag 
cgacaagaaa 
gtgtacgaaa 
tgtggagaat 
agggcgattt 
atcaccgaac 
gaattcatgg 
actgtggtgg 
actggtgata 
cagcctaatg 
cctcctcccg 
gaccctccca 
atgagattgt 
aattgtaggc 
aacccagagt 
cgtatgccag 
gagaggtttc 
gatagagatt 
gtctatgaaa 
aatcttatag 
tttgtccttc 
aaaatggaaa 



aagttgcaat 
aacgaattgt 
gcaaccgcaa 
ccggtctttt 
tgaaaagtaa 
ttggacattt 
gtatttatag 
gtggaaatgc 
tgatatcaga 
agaacaagaa 
acactttgaa 
tggggacaaa 
aaggaagaaa 
taagtggtcc 
ggcccgtgta 
caaaaatagc 
ttttggcagc 
gtctaataat 
gacacggcat 
aaaatatgca 
agtcttgggg 
tacaagcaca 
gctacgcgga 
cctcttacga 
cgtgttctgc 
aggtctcggc 
atatggtaca 
tagcaggaga 
tcatcaagcg 
agatgtgtaa 
ataatagatt 
gcaaatgttc 
agatggaaga 
aacaactgga 
ccgatgagct 
cctgcggttt 
aggctgtttc 
ctccggaggg 
agaatatgaa 
agaatagggt 
ttgaacaagt 
aattgaacca 
gggaaatggt 
attgtgtaga 
ttgttaaaca 



cttgatcacc 
gggtatagtg 
ttcttatagg 
gaaagagata 
cgacgaccca 
aaagagttgc 
tggcatgttt 
aggcgatgca 
tatggatttg 
gcaccgccac 
ggatgtgttt 
aataagagac 
caatgacagg 
atccaccaac 
tgacgctatg 
caagatgatt 
agatactgaa 
ctccccgaac 
accttcagcc 
tctagaagaa 
agaaagtcaa 
cgttacaaag 
tttggacatg 
ttatttcccc 
aaaactatgc 
tggtagtgaa 
gtttataggt 
aaatgtcgac 
agcagagaaa 
gatggacaag 
cagaccgccc 
cctgatctgt 
cattgtaaca 
agactattac 
agtaagatat 
cttggcgtca 
tatactaaaa 
caaggttctt 
ggggcttctc 
taaagtccat 
tgtaggtgtg 
tgagagtgca 
gttggagcgg 
taggcgtgcc 
ggtaaatatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 
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gtacgtcaaa 
aggacattag 
aaatgtaaga 
agcggtagaa 
aaccctgacc 
gctgctttag 
gatagttatt 
atggaagatg 



acatgacatc 
aatggaaggt 
cacctaacat 
gagatgggta 
atatactaat 
aaaaggtgta 
tgtccaaaca 
gttttgaatt 



tgaatcaata 
tgagaaagaa 
aaaattaggt 
ccctacagtt 
tgaccataaa 
taattgtact 
aagaggagga 
tgatgtacat 



aaggatccag 
ggacaggaaa 
ggatgtatta 
ttttgttgga 
ctattgtatt 
ctatgttgtc 
ggagaagaag 
actaaaacag 



tgttcactgt 
ttaagactgt 
ccatgacttg 
tatgtgaaga 
ctgattgtaa 
tagctttaag 
aagagattga 
cggtgcctac 



ggatgaaaag 
aaaatgccct 
ttatgattgt 
tgaaataact 
gtctacaaag 
aaaatgtagt 
gatatatgta 
aaaataa 



2760 
2820 
2880 
2940 
3000 
3060 
3120 
3177 



<210> 219 
<211> 1050 
<212> PRT 
<213> SHRIMP 



<400> 219 



Met 


Glu 


Gly 


Val 


1 








He 


Leu 


He 


Thr 








20 


i yi 


Glu 


Arg 


lie 






35 




Ala 


He 


Leu 


Asp 




50 






Glu 


Gly 


Glu 


Thr 


65 








Ser 


Leu 


Leu 


Met 


Asn 


Asp 


Asp 


Pro 








100 


Phe 


He 




His 






115 




Lys 


Asp 


Leu 


He 




130 






Thr 


Arg 


Leu 


Met 


145 








Val 


Glu 


Glu 


Lys 


Asp 


Met 


Asp 


Leu 








180 


Asp 


Gin 


Asn 


Lys 






195 




Val 


Val 


Val 


Ser 




210 






Tyr 


Met 


Trp 


Pro 


225 








Ala 


Arg 


Thr 


Phe 


Asn 


Asn 


Asp 


Arg 








260 


Phe 


Leu 


Ser 


Gly 






275 




Arg 


Met 


Tyr 


Ala 




290 






Gin 


Glu 


Leu 


His 


305 








Asp 


He 


Val 


Tyr 


Ala 


Asp 


Thr 


Glu 








340 


Ser 


Gly 


Leu 


He 






355 





Thr 


Ser 


He 


Val 


5 








Asp 


Leu 


Met 


Gly 


Val 


Gly 


He 


Val 








40 


He 


Cys 


Asn 


Arg 






55 




Val 


Val 


He 


Asn 




70 






Lys 


Lys 


Ala 


Leu 


85 








Val 


Pro 


Phe 


Thr 


Leu 


Lys 


Ser 


Cys 








120 


n j. d 


Arg 


He 


-lyxr 






135 




Leu 


Asp 


Glu 


He 




150 






Asn 


Ala 


Leu 


Cys 


165 








He 


Arg 


He 


Phe 


Lys 


His 


Arg 


His 








200 


Leu 


Asn 


Thr 


Leu 






215 




Phe 


Ser 


Ala 


Leu 




230 






Ser 


Val 


Leu 


Phe 


245 








He 


Trp 


Glu 


Asn 


Pro 


Ser 


Thr 


Asn 








280 


Ala 


Arg 


Pro 


Val 






295 




His 


He 


Leu 


Phe 




310 






Arg 


Tyr 


Ser 


He 


325 








Arg 


Val 


Glu 


Glu 


He 


Ser 


Pro 


Asn 



360 



Ala 


Ala 


Ala 


Val 




10 






Gly 


Arg 


Asn 


Asn 


25 








Gly 


Glu 


Ser 


Gly 


Asn 


Ser 


Tyr 


Arg 








60 


Pro 


Thr 


Glv 


Leu 






75 




Asp 


Met 


Asn 


He 




90 






Thr 


Leu 


Asp 


Gin 


105 








Lys 


Lys 


Gin 


Asp 


Ser 


Gly 


Met 


Phe 








140 


lie 


Arg 


Gly 


Asn 






155 




Glu 


Ala 


Tyr 


Ala 




170 






Leu 


Leu 


Leu 


Val 


185 








Met 


Lys 


Ser 


Val 


Lys 


Asp 


val 


Phe 








220 


Gin 


Val 


Gly 


Thr 






235 




Gly 


Ser 


Asp 


Met 




250 






Met 


Ala 


Phe 


Ser 


265 








Asn 


His 


Tyr 


Asn 


Tyr 


Asp 


Ala 


Met 








300 


Gly 


Thr 


Lys 


He 






315 




Tyr 


Asn 


Val 


Pro 




330 






Pro 


Lys 


Lys 


Ser 


345 








Ala 


Ser 


Leu 


Leu 



Pro 


Glu 


Val 


Ala 






15 




Lys 


Arg 


Ser 


Thr 




30 






Asp 


Leu 


Leu 


Glu 


45 








Asp 


Glu 


Leu 


Leu 


Leu 


Lys 


Glu 


He 








80 


Lys 


Met 


Lvs 


Ser 






95 




Asn 


Glu 


Gin 


Glu 




110 






Gly 


Pro 


Ala 


Tyr 


125 








Val 


Met 


Lys 


Asn 


Ala 


Gly 


Asp 


Ala 








160 


Glu 


Met 


He 


Ser 






175 




Ala 


He 


Lys 


Arg 




190 






He 


Tyr 


Glu 


Asp 


205 








His 


Lys 


Glu 


Trp 


Lys 


He 


Arg 


Asp 








240 


His 


Glu 


Gly 


Arg 






255 




Val 


Thr 


Glu 


Ala 




270 






Lys 


Gly 


His 


Leu 


285 








Glu 


Tyr 


Val 


Pro 


Ala 


Lys 


Met 


He 








320 


Tyr 


Leu 


Leu 


Ala 






335 




Val 


Met 


Ser 


Pro 




350 






Glu 


Asn 


Thr 


Pro 



365 
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Leu 


Ser 


Leu 


Val 




370 






Ser 


Phe 


lie 


His 


385 








Ala 


Lys 


Cys 


Met 


Ser 


Gin 


Ala 


Ala 








420 


Gin 


Ala 


His 


Val 






435 




Cys 


Gly 


Leu 


Cys 




450 






Cys 


Ser 


His 


Glu 


465 








Ala 


Phe 


Met 


Asp 


Leu 


Cys 


Pro 


Asp 








500 


Val 


Ser 


Ala 


Gly 






515 




Cys 


Gly 


Glu 


Tyr 




530 






Ser 


Leu 


Phe 


Glu 


545 








Tyr 


lie 


Ala 


Ala 


Glu 


Lys 


Cys 


Phe 








580 


Met 


Cys 


Lys 


Met 






595 




Thr 


Val 


Val 


Asp 




610 






Val 


Glu 


Gly 


Glu 


625 








Cys 


Leu 


Leu 


Pro 


Val 


Thr 


Val 


Asp 








660 


Gin 


Leu 


Glu 


Asp 






675 




Asp 


Pro 


Pro 


Thr 




690 






His 


Lys 


Trp 


Pro 


705 








Pro 


Asn 


Glu 


Glu 


Arg 


Thr 


Pro 


Glu 








740 


Gly 


Lys 


Val 


Leu 






755 




Asn 


Met 


Lys 


Gly 




770 






Glu 


Arg 


Phe 


Gin 


785 








Gly 


Gly 


Phe 


Asp 


Pro 


Leu 


He 


Ala 








820 


Ser 


Ala 


Leu 


Gly 






835 




Glu 


Met 


Val 


Leu 



Ser Arg His Gly 
375 

Glu Asn Ala Glu 
390 

Val Ser Gin Thr 
405 

Met Val Tyr Gin 

Thr Lys He Leu 
440 

Tyr Ala Asp Leu 
455 

Asn Met Lys Ala 
470 

Thr Phe Glu Ala 
485 

Cys Thr He Lys 

Ser Glu Lys Leu 
520 

Met Val Gin Phe 
535 

Arg Ala He Leu 
550 

Asn Lys Leu Leu 
565 

Tyr Thr Val Glu 

Asp Lys Asp Phe 
600 

Asn Arg Phe Arg 
615 

Thr Gly Asp Ser 
630 

Asn Val Cys Asp 
645 

Val Pro Pro Pro 

Tyr Tyr Phe Gin 
680 

Asp Glu Leu Val 
695 

Met Arg Leu Ser 
710 

Val Thr Asn Cys 
725 

Lys Lys He Arg 

Leu Ala Asn Trp 
760 

Leu Leu Asn Asp 
775 

Asn Arg Val Lys 
790 

Asp Arg Asp Phe 
805 

Thr Tyr Phe Tyr 

Leu Trp Ala Lys 
840 

Glu Arg Pro Glu 



He 


Pro 


Ser 


Ala 








ion 


Asn 


Met 


His 


Leu 






T> ft C 

395 




Leu 


Gin 


Glu 


Glu 




410 






Pro 


Ser 


Asp 


Glu 


425 








Ser 


Gly 


Asn 


Thr 


Asp 


Met 


Lys 


Pro 








4 60 


Ser 


Tyr 


Asp 


Tyr 






475 




Arg 


Gin 


Glu 


Thr 




490 






His 


Leu 


Met 


Tyr 


505 








Lys 


Asp 


Val 


Phe 


He 


Gly Arg 


Cys 








540 


Ala 


Gly 


Glu 


Asn 






555 




He 


Thr 


Glu 


Leu 




570 






Leu 


Leu 


Gin 


Ala 


585 








Ala 


Leu 


Asp 


Lys 


Pro 


Pro 


Val 


Lys 








620 


Lys 


Cys 


Ser 


Leu 






635 




Gin 


Pro 


Asn 


Glu 




650 






Val 


Leu 


Pro 


Tyr 


665 








Asp 


Val 


Glu 


Asp 


Arg 


Tyr 


Asp 


Thr 








700 


Cys 


Gly 


Phe 


Leu 






715 




Arg 


Gin 


Ala 


Val 




730 






Gly 


Trp 


Asn 


Pro 


745 








His 


Ser 


Thr 


Asp 


He 


Ser 


Val 


He 








780 


Val 


His 


Tyr 


Leu 






795 




Glu 


Gin 


Val 


Val 




810 






Val 


Tyr 


Glu 


Lys 


825 








Met 


Phe 


Val 


Lys 


Cys 


Val 


Phe 


His 



Arg 


Lys 


Leu 


Gly 


Glu 


GlU 


Ala 


lie 










Ser 


Trp 


Gly 


Glu 










it- 1 

Val 


Glu 


Val 


1 le 




a i n 






Thr 


Asn 


Lys 


Thr 


445 








Lys 


Phe 


Phe 


Asn 


Phe 


Pro 


Val 


His 








A Q ft 
H O U 


Cys 


Ser 


Ala 


Lys 






495 




Val 


Tyr 


Glu 


Lys 




OlU 






Arg 


Cys 


Pro 


Cys 


525 








His 


Glu 


Phe 


Ser 


Val 


Asp 


Pro 


Glu 








C £ ft 

560 


lie 


Lys 


Arg 


Ala 






575 




Glu 


Phe 


Met 


Glu 




590 






Asp 


Ser 


Lys 


Phe 


605 








Leu 


Phe 


Lys 


Met 


He 


Cys 


Thr 


Gin 








640 


Met 


Glu 


Asp 


He 






655 




Pro 


Pro 


Pro 


Glu 




670 






Ala 


Glu 


Phe 


Asp 


685 








Gly 


Pro 


Gly 


Leu 


Asn 


Phe 


Val 


Pro 








720 


Ser 


He 


Leu 


Lys 






735 




Glu 


Ser 


Pro 


Glu 




750 






Arg 


Met 


Pro 


Glu 










His 


Asn 


Thr 


Arg 


Asn 


Ser 


Val 


Phe 








800 


Gly 


Val 


Ser 


He 






815 




Leu 


Asn 


His 


Glu 




830 






Asn 


Leu 


He 


Gly 


845 








Arg 


Ala 


His 


Ser 
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850 










O +J <J 










860 










rne 


veil 


Leu 


His Cys 




Asp 


Arg 


Arg 


Ala 
Ala 


Leu 






He 


Arg 


XT X \J 


ft 










ft 7 n 




















880 


Asn 




«xy 


nld 


Lys 


net 




lie 


Val 


Lys 


uin 


val 


Asn 


He 


Val 


Arg 




















pQn 

O 7 U 










895 




uin 


Asn 


net 


i nr 


oer 




oer 


lie 


Lys 


Asp 


fro 


val 


rne 


Thr 


Val 


ASp 




























910 






(jJLU 


x<ys 


/\rg 


i nr 


Leu 


bJ.U 


Trp 


Lys 


val 


VjxU 


Lys 


bill 


Gly 


Gin 


Glu 


ne 
















920 










925 








Lys 


Thr 


Val 


Lys 


Cys 


pro 


Lys 


Cys 


Lys 


mr 


Pro 


Asn 


He 


Lys 


Leu 






930 










935 










940 










Gly 


Cys 


He 


Thr 


Met 


Thr 


Cys 


Tyr 


Asp 


Cys 


Ser 


Gly 


Arg 


Arg Asp 


oiy 


945 










950 










955 










960 


Tyr 


Pro 


Thr 


Val 


Phe 


Cys 


Trp 


He 


Cys 


Glu 


Asp 


Glu 


He 


Thr 


Asn 


Pro 










965 










970 










975 




Asp 


His 


He 


Leu 


He 


Asp 


His 


Lys 


Leu 


Lys 


Asp 


Cys 


Lys 


Ser 


Thr 


Lys 








980 










985 










990 






Ala 


Ala 


Leu 


Glu 


Lys 


Val 


Tyr 


Asn 


Cys 


Thr 


Leu 


Cys 


Cys 


Leu 


Ala 


Leu 






995 










1000 








1005 






Arg 


Lys 


Cys 


Ser 


Asp 


Ser 


Tyr 


Leu 


Ser 


Lys 


Gin 


Arg 


Gly 


Gly 


Gly 


Glu 




1010 








1015 








1020 








Glu 


Glu 


Glu 


He 


Glu 


He 


Tyr 


Val 


Met 


Glu 


Asp 


Gly 


Phe 


Glu 


Phe 


Asp 


1025 








1030 








1035 








1040 


Val 


His 


Thr 


Lys 


Thr 


Ala 


Val 


Pro 


Thr 


Lys 















1045 1050 



<210> 220 
<211> 2547 
<212> DNA 
<213> SHRIMP 



<400> 220 

atgggttcta 

agtcttgaaa 

cccaagcttg 

cttcccgcta 

aaaattgttg 

cacaaaaaca 

atccaccctg 

atggccaatg 

tgtttcggac 

agaaagaaga 

attcacggat 

tttacccatg 

tgcttccttc 

gcggcaatca 

aaaggaagcc 

atctttaacg 

gccatctata 

gccggtaatg 

ttcatggaga 

ggcctttccg 

ggcaaaggta 

atcgaaactg 

gaaaatgtcg 

tcggaggaaa 

tctcccatcc 

atcatgacca 

cgcatttcca 

ttcttcaaac 



accagcaaca 
agataatcaa 
acaagaacgc 
ccatctcctt 
accaccctga 
ccaaagagtg 
gaactggtaa 
ctgaaattct 
tgaggacgct 
gaa tcttggt 
cagacatcaa 
cttcccccac 
tgggccttca 
tctccaagac 
ccatttcgtc 
tctctgtgaa 
tttctgattg 
aagatttgag 
gagtgaaggc 
acgtccatgg 
aagaggtggt 
gaacacctta 
gcatcatcaa 
ctgcagtgtg 
cttccctaag 
gaaatctcga 
atatgaaaac 
tcagaatccc 



atcattcatc 
gaggattgaa 
aattaaccct 
ccaagaaatg 
ttttggaaaa 
gaatagtttt 
accagcatca 
cgatgctgtt 
agaatactct 
tgagcgccct 
atctgtcatt 
actgtttaat 
agatgatagc 
tgctggagga 
atggagtggt 
aaaggttagc 
gcatctggac 
gacgagggat 
tgggaagaat 
cgaagagttt 
gaaggcacgt 
tgtgtgcttt 
gtcttcaaat 
caatttggct 
gccctatgtt 
caaggtgatt 
taggccaatg 
cttcgaatct 



tcaaagagga 
aatgcctgtc 
caagaacttg 
gacgattttc 
ctggcaggaa 
agtgcaacaa 
gtagttaatg 
atcgattaca 
tatttgatca 
caggacatga 
gaaacatatg 
tgtggaacag 
attgagggta 
ctcggcatcc 
acccaccctg 
cagggaggag 
gtgaaggact 
cttttcccag 
tggtccctga 
aaggcgttgt 
gcattattcg 
aaggatacca 
ttgtgcactg 
tctatcgcag 
gattaccggg 
gatgtcaatt 
ggattgggtg 
gaagaagcgg 



atggcactaa 
ttccagtcaa 
catctcacat 
tggccgatta 
gattcatctg 
ctcagaaatt 
atacctacta 
aaatggatta 
agattggttc 
ttatgcgtgt 
atctcatgtc 
tcacacccca 
tttatgatac 
actttcatga 
gtctcatggc 
acaagaggag 
ttattgactg 
ctatctgggt 
tgtgccccca 
acgagaaata 
accaaattaa 
tcaatagaaa 
aaattgtcca 
tcaacaagtt 
agatgaagag 
tctatgcagt 
tgcagggact 
cactaattaa 



gcaagaaatt 
ccagtatgtg 
catggaccgt 
tgcaaagaca 
ttcgaacatc 
gaggcacgca 
tgaaaatgtt 
tctcttcacc 
ccccactgat 
ggctgtcggc 
gaggcactat 
actttcctcg 
tcttaaggag 
tttgagggca 
gttcctccaa 
aggagctgca 
cagaaagaat 
atctgatctc 
cgagtgccct 
tgaggctgaa 
ttctgcacgt 
gtctaaccaa 
gtacagtgat 
tgtgaagtat 
ggttgtaaaa 
tgacaagacc 
agcagatttg 
caagaggatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 
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tttgaaacta 
gaaacatatg 
gggaaggaaa 
gagcaattga 
atgcctactg 
tccaacatgt 
attagagagc 
agtggtggat 
actgtatggg 
gttgaccaag 
tcaatgacta 
ggcgcagcta 
gaagctccct 
aagtcctcta 
ggtgcttgtg 



tatactatgg 
agctgtttga 
atattaagaa 
gaagggacat 
catccactgc 
ataatcgtaa 
tcataaaact 
ctattcagac 
aaattaatcc 
ctcaatccct 
tgtatgcatg 
gagctgtcca 
ctcctgttgc 
ttgttgttcc 
aaatgtgttc 



tgccttggaa 
gggtagtcct 
tagggacata 
tatgaagtat 
acagatcctg 
tgtactttca 
gggagaatgg 
gcttcctaat 
tcgtactact 
caacttgttt 
ggagaagggg 
gttcacagtc 
agctttttct 
agatcctgct 
ttcctag 



gcttcatgtg 
ttgagcaaag 
tatttcaact 
ggtgttcaca 
ggcaactctg 
ggatcattcc 
aattcagtaa 
atccctaaat 
ttggacatgg 
gtggaagaac 
atcaagactc 
gacaagaatg 
gctcctgtcc 
gctgctcttt 



aaattgccaa 
ggattttcca 
ctttgccaat 
attcaatgtt 
aatcctttga 
aagtggtgaa 
ctaaacagag 
cgaccaagga 
ctattcagag 
ccgaactcag 
tctattatct 
tactccaaga 
gagaagaaga 
tatgttctat 



agaaaaggga 
atttgacatg 
tcacgattgg 
tgttgctccc 
gcctttaacg 
cgaatatgtg 
gattatggca 
actattcaaa 
aggtatgttt 
caaggtgcgg 
acgcacaaag 
agtcaagaag 
agaggagaag 
caataaccct 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2547 



<210> 221 
<211> 842 
<212> PRT 
<213> SHRIMP 



<400> 221 



Met 


Gly 


Ser 


Asn 


Gin 


Gin 


Gin 


Ser 


Phe 


He 


Ser 


Lys 


Arg 


Asn 


Gly 


Thr 


1 








5 










10 










15 




Lys 


Gin 


Glu 


He 


Ser 


Leu 


Glu 


Lys 


He 


He 


Lys 


Arg 


He 


Glu 


Asn 


Ala 








20 










25 










30 






Cys 


Leu 


Pro 


Val 


Asn 


Gin 


Tyr 


Val 


Pro 


Lys 


Leu 


Asp 


Lys 


Asn 


Ala 


He 






Jo 










A A 
H U 


















Asn 


Pro 


Gin 


Glu 


Leu 


Ala 


Ser 


His 


He 


Met 


Asp 


Arg 


Leu 


Pro 


Ala 


Thr 




50 










55 










60 










He 


Ser 


Phe 


Gin 


Glu 


Met 


Asp 


Asp 


Phe 


Leu 


Ala 


Asp 


Tyr 


Ala 


Lys 


Thr 


65 










70 










75 










80 


Lys 


He 


Val 


Asp 


His 


Pro 


Asp 


Phe 


Gly 


Lys 


Leu 


Ala 


Gly 


Arg 


Phe 


He 










85 










90 










95 




Cys 


Ser 


Asn 


He 


His 


Lys 


Asn 


Thr 


Lys 


Glu 


Trp 


Asn 


Ser 


Phe 


Ser 


Ala 








100 










105 










110 






Thr 


Thr 


Gin 


Lys 


Leu 


Arg 


His 


Ala 


He 


His 


Pro 


Gly 


Thr 


Gly 


Lys 


Pro 






115 










120 










125 








Ala 


Ser 


Val 


Val 


Asn 


Asp 


Thr 


Tyr 


Tyr 


Glu 


Asn 


Val 


Met 


Ala 


Asn 


Ala 




130 










135 










140 










Glu 


lie 


Leu 


Asp 


Ala 


Val 


He 


Asp 


Tyr 


Lys 


Met 


Asp 


Tyr 


Leu 


Phe 


Thr 


145 










150 










155 










160 


Cys 


Phe 


Gly 


Leu 


Arg 


Thr 


Leu 


Glu 


Tyr 


Ser 


Tyr 


Leu 


He 


Lys 


He 


Gly 










165 










170 










175 




Ser 


Pro 


Thr 


Asp 


Arg 


Lys 


Lys 


Arg 


He 


Leu 


Val 


Glu 


Arg 


Pro 


Gin 


Asp 








180 










185 










190 






Met 


He 


Met 


Arg 


Val 


Ala 


Val 


Gly 


He 


His 


Gly 


Ser 


Asp 


He 


Lys 


Ser 






195 










200 










205 








Val 


He 


Glu 


Thr 


Tyr 


Asp 


Leu 


Met 


Ser 


Arg 


His 


Tyr 


Phe 


Thr 


His 


Asp 




210 










215 










220 










Thr 


Leu 


Phe 


Asn 


Cys 


Gly 


Thr 


Val 


Thr 


Pro 


Gin 


Leu 


Ser 


Ser 


Cys 


Phe 


225 










230 










235 










240 


Leu 


Leu 


Gly 


Leu 


Gin 


Asp 


Asp 


Ser 


He 


Glu 


Gly 


He 


Tyr 


Asp 


Thr 


Leu 










245 










250 










255 




Lys 


Glu 


Ala 


Ala 


He 


He 


Ser 


Lys 


Thr 


Ala 


Gly 


Gly 


Leu 


Gly 


lie 


His 








260 










265 










270 






Phe 


His 


Asp 


Leu 


Arg 


Ala 


Lys 


Gly 


Ser 


Pro 


He 


Ser 


Ser 


Trp 


Ser 


Gly 






275 










280 










285 








Thr 


His 


Pro 


Gly 


Leu 


Met 


Ala 


Phe 


Leu 


Gin 


He 


Phe 


Asn 


Val 


Ser 


Val 




290 










295 










300 










Lys 


Lys 


Val 


Ser 


Gin 


Gly 


Gly 


Asp 


Lys 


Arg 


Arg 


Gly 


Ala 


Ala 


Ala 


He 
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O AC 
















Tyr 


Ti- 
ne 


5er 


Asp 


Trp 


niS 


Leu 


Asp 










325 








Lys 


Asn 


ax a 


Gly Asn 


f* 1 u 

blu 


Asp 


Leu 








340 










lie 


Trp 


Val 


Ser 


Asp 


Leu 


rfie 


Met 






ICC 












Trp 


Ser 


Leu 


Met 


Cys 


pne 


Cys 


Fro 




370 










J / o 




Glu 


Phe 


Lys 


Ala 


Leu 


Tyr 


Glu 


Lys 


o o c 

385 
















Glu 


Val 


Val 


Lys 


Ala 


Arg 


Ala 


Leu 










405 








lie 


Glu 


Thr 


Gly 


Thr 


Pro 


Tyr 


vax 








420 










Lys 


Ser 


Asn 


Gin 


Glu 


Asn 


val 


Gly 






/IOC 

43d 










A A ft 

*l *± u 


Thr 


Glu 


lie 


Val 


Gin 


Tyr 


Ser 


Asp 




450 














Leu 


Ala 


Ser 


He 


Ala 


vax 


TV 0n 

Asn 


Lys 


465 
















Ser 


Leu 


Arg 


Pro 


Tyr 


Val 


Asp 


Tyr 










485 








lie 


Met 


Thr 


Arg 


Asn 


Leu 


Asp 


Lys 








500 










Val 


Asp 


Lys 


Thr 


Arg 


lie 


Ser 


Asn 






515 










con 


Gin 


Gly 


Leu 


Ala 


Asp 


Leu 


Phe 


Phe 




530 










535 




Glu 


Glu 


Ala 


Ala 


Leu 


He 


Asn 


Lys 


545 










550 






Gly 


Ala 


Leu 


Glu 


Ala 


Ser 


Cys 


Glu 










565 








Tyr 


Glu 


Leu 


Phe 


Glu 


Gly 


Ser 


Pro 






580 










Asp 


Met 


Gly 


Lys 


Glu 


Asn 


He 


Lys 






595 










£. e\ n 


Leu 


Pro 


lie 


His 


Asp 


Trp 


Glu 


Gin 




610 










C 1 c 




Gly 


Val 


His 


Asn 


Ser 


Met 


Phe 


Val 


625 










C O A 






Ala 


Gin 


lie 


Leu 


Gly 


Asn 


Ser 


GlU 










645 








Met 


Tyr 


Asn 


Axg 


Asn 


Val 


Leu 


Ser 








660 










Tyr 


Val 


lie 


Arg 


Glu 


Leu 


He 


Lys 






o to 










boU 


Lys 


Gin 


Arg 


He 


Met 


Ala 


Ser 


Gly 




690 










/"AC 

feyo 




lie 


Pro 


Lys 


Ser 


Thr 


Lys 


Glu 


Leu 


705 










710 






Pro 


Arg 


Thr 


Thr 


Leu 


Asp 


Met 


Ala 










725 








Gin 


Ala 


Gin 


Ser 


Leu 


Asn 


Leu 


Phe 








740 










Val 


Arg 


Ser 


Met 


Thr 


Met 


Tyr 


Ala 






755 










760 


Tyr 


Tyr 


Leu 


Arg 


Thr 


Lys 


Gly 


Ala 




770 










775 




Asp 


Lys 


Asn 


Val 


Leu 


Gin 


Glu 


Val 


785 










790 





















320 


val 


Lys 


Asp 








v-yt> 


Arg 




330 










335 




Arg 


1 III 


Arg 


ASp 


Leu 




Pro 


Ala 


1A ^ 










-J J V 






LjIU 


Arg 


Vol 


Lys 


Mia 


oiy 


_ 

Lys 


Asn 










jOj 








oxy 


Leu 




Asp 


Val 


Ui c 
111 O 


uxy 


Glu 


















Tyr 


GXU 


Aid 


GXU 


Gxy 


Lys 


uiy 


Lys 


















pne 


Asp 


bin 


x xe 


Asn 


oer 


Ala 


Arg 














11J 




Cys 


pne 


Lys 


Asp 


inr 


Tl - 

ne 


Asn 


Arg 












a "*n 

*i JU 






xxe 


x xe 


Lys 


ber 


oei 


Asn 


Leu 


Cys 










4 /I ^ 

*i O 










GXU 


GlU 


i nr 


Ala 


Val 


Cys 


Asn 


















pne 


IT-, 1 

vax 


Lys 


Tyr 


oer 


Pro 


lie 


trro 






I/O 










HOv 


Arg 


GXU 


Met 


Lys 


Arg 


Vdl 


vai 


Lys 




VI Q A 














vax 


T1 „ 

lie 


Asp 


vax 


Asn 


r*ne 


Tyr 


Til a 
Aid 












n 






Met 


Lys 


Trp 


Met 


Giy 


Leu 


oiy 


Val 


















Lys 


Leu 


Arg 


xxe 


Pro 


pne 


GXU 


Ser 








540 










Arg 


x xe 


Pne 


GXU 


Thr 


xxe 


Tyr 


Tyr 






c c c 










C£ A 

jdU 


He 


Ala 


Lys 


Glu 


Lys 


Gly 


GlU 


Tnr 




c "7 n 










c "7 c 




Leu 


Ser 


Lys 


Gly 


He 


Pne 


Gin 


Pne 


C Q C 

boo 










con 






Asn 


Arg 


Asp 


lie 


Tyr 


Phe 


Asn 


ser 










C ft c 








Leu 


Arg 


Arg 


Asp 


xxe 


Met 


Lys 


Tyr 








620 










Ala 


Pro 


Met 


Pro 


Thr 


Ala 


ser 


inr 






OJ3 












Ser 


pne 


GlU 


Pro 


Leu 


inr 


oer 


Asn 




djU 










ODD 




Gly 


O j-i 

ber 


rtie 


Gxn 


vax 


vax 


Asn 


ulu 


ODD 










v / KJ 






Leu 


Gly 


GXU 


Trp 


Asn 


Ser 


vax 


IPU -r- 

inr 










CO c 
DtJj 








GXy 


Ser 


T 1 „ 

x xe 


Gxn 


inr 


Leu 


Pro 


Asn 








—l ftft 










Phe 


Lys 


Thr 


Val 


Trp 


Glu 


xxe 


Asn 






715 










720 


He 


Gin 


Arg 


Gly 


Met 


Phe 


Val 


Asp 




730 










735 




Val 


Glu 


Glu 


Pro 


Glu 


Leu 


Ser 


Lys 


745 










750 






Trp 


Glu 


Lys 


Gly 


He 


Lys 


Thr 


Leu 










765 








Ala 


Arg 


Ala 


Val 


Gin 


Phe 


Thr 


Val 








780 










Lys 


Lys 


Glu 


Ala 


Pro 


Ser 


Pro 


Val 






795 










800 
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Ala Ala Phe Ser Ala Pro Val Arg 
805 

Ser lie Val Val Pro Asp Pro Ala 
820 

Asn Pro Gly Ala Cys Glu Met Cys 
835 840 



Glu Glu Glu Glu Glu Lys Lys Ser 

810 815 
Ala Ala Leu Leu Cys Ser He Asn 
825 830 
Ser Ser 



<210> 222 
<211> 222 
<212> DNA 
<213> SHRIMP 

<400> 222 

atggtcatac gactctgctt cttggaaagt atcacgtgct ttgtatatgg catcatggca 60 

ccactatctt tggacacaaa taccgattat ttatcccaca aaaaggatac taataaaaaa 120 

atacagatgc agataaactt tattccttat tcaaatatgc acgtatatat agcaggtgtg 180 
tacacatttc atgaaaaaaa ggggttaaca tatcaacaat at 222 

<210> 223 
<211> 74 
<212> PRT 
<213> SHRIMP 

<400> 223 

Met Val He Arg Leu Cys Phe Leu Glu Ser He Thr Cys Phe Val Tyr 

15 10 15 

Gly He Met Ala Pro Leu Ser Leu Asp Thr Asn Thr Asp Tyr Leu Ser 

20 25 30 

His Lys Lys Asp Thr Asn Lys Lys He Gin Met Gin He Asn Phe He 

35 40 45 

Pro Tyr Ser Asn Met His Val Tyr He Ala Gly Val Tyr Thr Phe His 

50 55 60 

Glu Lys Lys Gly Leu Thr Tyr Gin Gin Tyr 
65 70 



<210> 224 
<211> 1848 
<212> DNA 
<213> SHRIMP 



<400> 224 

atggaataca 

gagt taaaac 

gtcatgggtg 

tttgttgtac 

caagaa ttgg 

cctggacatt 

caagaagaag 

aaggacttta 

gaaaaaaaac 

acaatgaacg 

aaggatatga 

atagccaatc 

gtggcgggcg 

tcgacactag 

gagttggcta 

acaaaagttg 

ctagaaacac 

cttttcaaag 



ttggagagaa 
taagatcgtc 
tttatgaagc 
atgtgaagaa 
tagaagatga 
tattcaaaga 
acgtgcgtaa 
aattcagtca 
acgattgtat 
tcacgcacgc 
tggtcccatt 
ctattttcaa 
tttttgcaag 
cctacgctgt 
ggaaaaattt 
aaaaggaact 
aagtagtaaa 
tctctgagga 



aaacaataac 
attcctgatg 
tatcgaatct 
agataaacaa 
ctctttaaga 
tgatgctggg 
aatcaataca 
attatacagg 
gataataagt 
tttaaagtca 
ggttgaaatt 
gagccattct 
gagcgcacac 
tatcttatac 
acaaataaaa 
agaaacacaa 
gaaagaggag 
acaaaaagaa 



cctgttagta 
atcgggaaga 
ataagacaaa 
ctcaaattcg 
attgaacgga 
cacgttactg 
atagtcaagg 
tacatgagca 
tccgatttct 
atggagagaa 
tgtcaccgta 
tcacattgct 
ccttctgctg 
agtgatgaaa 
gaggaactag 
gtagtaaaga 
tacaaaaact 
tctctcagaa 



atgaaagtgt 
aaacaagtaa 
gcgaattgtc 
caagagggtt 
taagttgtgc 
acgaggaatg 
aaaaattaaa 
atagtctttc 
taatcggttt 
ctatgcagaa 
cccattacaa 
tgattgttcc 
caagcattga 
agcaacgcca 
aaaaccaggt 
ccacaaaagt 
cgtatatcga 
atg tacacaa 



atcagaaaag 
atatgagcaa 
cgaagacaca 
aaaaagatta 
ccctcctgaa 
gcttgcaacg 
acgaaaagac 
tgaagcagta 
aggtttcagt 
acatggtttc 
aggagaatat 
cttgtttatg 
aatgtacctt 
gatacgcgaa 
cgaaaagacc 
tgaaaaggaa 
aactgaacaa 
gaaatcttcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 
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aatgcgacct 
ttctacttgt 
ttgaggtaca 
ctcatcatgg 
caaaacaagt 
agtgatgaag 
atgaatgatg 
cttgatgaag 
atggagaaca 
gaagctcgta 
caaaagaagc 
aggacctggg 
aacactcaag 



tcagatacga 
tgtgcaggac 
ttttctcccc 
ccgtgactgg 
tcaaggtatt 
atttgaaggg 
gcaccgtcta 
aaaatatgac 
catcaaaaat 
cgagaaacaa 
gagaaaatgg 
aaaacgacaa 
aacaacagcg 



cagcggctct 
agacaaaagt 
cataaacaag 
ctgtgacgca 
aaaatgtaac 
tattgtacaa 
tcaagatggg 
ccaatttgaa 
tgtcactagg 
gaccgaaaag 
agaacagtct 
tgaatttgat 
tgtaaagaga 



tgtctcgtct 
ggttcctttg 
aaaagggata 
cctatcgctt 
cgatcaagta 
aaagtgacag 
cagaggatag 
attgaacaac 
tacaacaagg 
tggtttgaga 
accagtgaac 
agcgacgtag 
catgccattt 



tctcgatatc 
aaactgcaac 
ctgcaggtat 
gcaacgacag 
tagttttcca 
gttctgatat 
acatctcttc 
aaaggaagct 
aaagacattt 
aggtaaagaa 
aggagcaaag 
aagaagaaga 
ctgtgtaa 



ctctacagaa 
agaaaatggt 
gaggcccaga 
tatcaaacac 
gacccctcca 
ccgaatcttt 
gcctcaagaa 
ccattccatg 
gaccacaaag 
gagggaggaa 
gggagtaaaa 
agatggaaac 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1848 



<210> 225 
<211> 611 
<212> PRT 
<213> SHRIMP 



<400> 225 



Met 


Glu 


Tyr 


He 


1 








Val 


Ser 


Glu 


Lys 








ZU 


Lys 


Lys 


Thr 


Ser 






35 




Glu 


Ser 


lie 


Arg 




50 






Val 


Lys 


Lys 


Asp 


£ R 

UJ 








Leu 


Val 


Glu 


Asp 


Pro 


Glu 


Pro 


Gly 








100 


Glu 


Glu 


Trp 


Leu 






115 




lie 


Val 


Lys 


Glu 




130 






Gin 


Leu 


Tyr 


Arg 


145 








Lys 


His 


Asp 


Cys 


Phe 


Ser 


Thr 


Met 








180 


Met 


Gin 


Lys 


His 






195 




Cys 


His 


Arg 


Thr 




210 






Lys 


Ser 


His 


Ser 


225 








Gly 


Val 


Phe 


Ala 


Tyr 


Leu 


Ser 


Thr 








2 60 


Gin 


lie 


Arg 


Glu 






275 




Leu 


Glu 


Asn 


Gin 




290 






Thr 


Gin 


Val 


Val 


305 








Val 


Val 


Lys 


Lys 



Gly 


GlU 


Lys 


Asn 


5 








Glu 


Leu 


Lys 


Leu 


Lys 


Tyr 


GlU 


Gin 








40 


Gin 


ser 


GlU 


Leu 






XL C 




Lys 


Gin 


Leu 


Lys 




/ u 






Asp 


Ser 


Leu 


Arg 


85 








His 


Leu 


Phe 


Lys 


Ala 


Thr 


Gin 


Glu 








120 


Lys 


Leu 


Lys 


Arg 






135 




Tyr 


Met 


Ser 


Asn 




150 






Met 


He 


He 


Ser 


165 








Asn 


Val 


Thr 


His 


Gly 


Phe 


Lys 


Asp 








200 


His 


Tyr 


Lys 


Gly 






215 




Ser 


His 


Cys 


Leu 




230 






Arg 


Ser 


Ala 


His 


245 








Leu 


Ala 


Tyr 


Ala 


Glu 


Leu 


Ala 


Arg 








280 


Val 


Glu 


Lys 


Thr 






295 




Lys 


Thr 


Thr 


Lys 




310 






Glu 


Glu 


Tyr 


Lys 



Asn 


Asn 


Pro 


Val 




1U 






Arg 


Ser 


Ser 


Pne 


25 








Val 


Met 


Gly 


Val 


Ser 


Glu 


Asp 


Thr 








60 


Phe 


Arg 


Leu 


Lys 






*7 ^ 




He 


Glu 


Arg 


He 




90 






Asp 


Asp 


Ala 


Gly 


105 








Glu 


Asp 


Val 


Arg 


Lys 


Asp 


Lys 


Asp 








140 


Ser 


Leu 


Ser 


Glu 






155 




Ser 


Asp 


Phe 


Leu 




170 






Ala 


Leu 


Lys 


Ser 


185 








Met 


Met 


Val 


Pro 


Glu 


Tyr 


He 


Ala 








220 


He 


Val 


Pro 


Leu 






235 




Pro 


Ser 


Ala 


Ala 




250 






Val 


He 


Lys 


Asp 


265 








Lys 


Asn 


Leu 


Gin 


Thr 


Lys 


Val 


Glu 








300 


Val 


Glu 


Lys 


Glu 






315 




Asn 


Ser 


Tyr 


He 



Ser Asn Glu Ser 
15 

Leu Met He Gly 
30 

Tyr Glu Ala He 
45 

Phe Val Val His 

Arg Leu Gin Glu 
80 

Ser Cys Ala Pro 
95 

His Val Thr Asp 
110 

Lys He Asn Thr 
125 

Phe Lys Phe Ser 

Ala Val Glu Lys 
160 

He Gly Leu Gly 
175 

Met Glu Arg Thr 
190 

Leu Val Glu He 
205 

Asn Pro He Phe 

Phe Met Val Ala 
240 

Ser He Glu Met 
255 

Glu Lys Gin Arg 
270 

He Lys Glu Glu 
285 

Lys Glu Leu Glu 

Leu Glu Thr Gin 
320 

Glu Thr Glu Gin 
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Leu 


Phe 


Lys 


Val 










J_iy s 


ys 


Ser 


Ser 






jj j 




vai 
















Lys 


Ser 


Gly 


Ser 


385 










Se r 




Tip 

1 1<C 


Leu 


Tie 
11c 


pie u 


nla 








4 

*± £. \J 


Ser 


Tip 
1 -LC 


T \T o 


His 






1 J O 






He 


Val 


Phe 




1 JU 






Val 


Gin 


Lys 


Val 


465 








1 111 


» O J- 


Tyr 


Gin 


T .Ol 1 
iicU 




Glu 


Glu 








500 


Leu 


HlS 


Ser 


Met 






515 




Lys 


Glu 


Arg 


His 




530 






Glu 


Lys 


Trp 


Phe 


545 








Glu 


Asn 


Gly Glu 


Arg 


Thr 


Trp 


Glu 








580 


Glu 


Asp 


Gly Asn 






595 




lie 


Ser 


Val 






610 







325 



Ser 


Glu 


Glu 


Gin 


Asn 


nla 


Thr 
i 1 ii 










360 


Ser 


Ser 


Thr 


Glu 










Phe 


Glu 


Thr 


Ala 




390 






Asn 


Lys 


Lys 


Arg 


405 








Vol 


Thr 


Gly 


Cys 


1 r\ 

uin 


Asn 


Lys 


fne 










will 


Thr 


Pro 


r r~\ 

IT 1 (J 






455 




i nr 


Gly 


Ser 


Asp 




470 






Asp 


Gly Gin 


Arg 


485 








Asn 


Met 


Thr 


Qsin 


Met 


Glu 


Asn 


Thr 








520 


Leu 


Thr 


Thr 


Lys 






535 




Glu 


Lys 


Val 


Lys 




550 






Gin 


Ser 


Thr 


Ser 


565 








Asn 


Asp 


Asn 


Glu 


Asn 


Thr 


Gin 


Glu 



600 



330 

Lys Glu Ser Leu 
345 

Arg Tyr Asp Ser 

Phe Tyr Leu Leu 
380 

Thr Glu Asn Gly 
395 

Asp Thr Ala Gly 
410 

Asp Ala Pro He 
425 

Lys Val Leu Lys 

Ser Asp Glu Asp 
460 

He Arg He Phe 
475 

He Asp He Ser 
490 

Phe Glu He Glu 
505 

Ser Lys He Val 

Glu Ala Arg Thr 
540 

Lys Arg Glu Glu 
555 

Glu Gin Glu Gin 
570 

Phe Asp Ser Asp 
585 

Gin Gin Arg Val 







335 




ni y 


Asn 


Val 


His 




350 






Glv 
oiy 


Ser 


Cys 


Leu 


365 








Cys 


Arg 


Thr 


Asp 


Leu 




±yi 


He 








400 




Arg 


Pro 


Arg 






415 




/Ala. 




Asn 


Asp 




430 








Asn 


Arg 


Ser 


445 








Leu 


Lys 


Gly 


He 


1 It? c 


Asn 


no fcs 


Gly 








480 




IT 1 \J 


V311I 


U1U 






*4 ^ J 




Oil i 


O 111 


m y 


lijr o 




510 






llli 


Arg 


Tyr 


Asn 


525 








Arg 


Asn 


Lys 


Thr 


Gin 


Lys 


Lys 


Arg 








560 


Arg 


Gly 


Val 


Lys 






575 




Val 


Glu 


Glu 


Glu 




590 






Lys 


Arg 


His 


Ala 


605 









<210> 226 
<211> 885 
<212> DNA 
<213> SHRIMP 



<400> 226 

atggagtata 

gagttacagc 

gaagtaatga 

gttttcattg 

ttaagagagc 

cctgagaaag 

gcttctcaag 

ttgtttagag 

atacctttcg 

tttagtgctg 

ggtctgaaat 

gatattgttg 

gtgtatctag 

tatctgttgg 

tgeatgegat 



taggggaaca 
tacgctcttc 
gtacatttga 
tacaattaaa 
tcacagggga 
gaeaegttat 
acaaagacgt 
ggttttattt 
gggaaaagga 
atgatgtcat 
atacatgggt 
taaaccccat 
ggtacatgtt 
cgttagcgtt 
tatgtgagga 



aaaactgata 
ttttctaatg 
agcagt tgaa 
ggaaaa taaa 
caattccctc 
acttaaaaac 
acaggaagta 
cagtccaact 
acgttttgtt 
ggaaaaatta 
gcctgtggcc 
tttcaagagt 
ttcccgtaat 
tgctat tgat 
tatttctgag 



aatctcctgg 
attggagaaa 
actatccgaa 
cacattactt 
aagatagaat 
aegtctacaa 
aataagctcg 
tacaggtata 
gttagcacag 
atagegatag 
gaagtgtgcc 
tatcattccc 
gttcaacccc 
ttgtacggca 
gtgaaaaggg 



atgagacacc 
aacaatatga 
agagtgaatt 
ttgaaggggg 
cattattatc 
ctactgatga 
taaaggaaaa 
taacaaaaag 
attttttgat 
agggaaatat 
atctcaaaaa 
attgectagt 
catctctaga 
gggaagaaat 
ggtaa 



tgaagaagac 
aaagtatgaa 
tagagaegge 
actgaaagaa 
ctctattaag 
cgagtggctt 
gaeaeggatg 
tcttcctcaa 
tggtctcggt 
gcggaaaagt 
atacaagggt 
tattccactg 
agtggagacg 
gegcaagtet 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

885 
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<210> 227 
<211> 290 
<212> PRT 
<213> SHRIMP 



<400> 227 



Met 


Glu 


Tyr 


lie 


1 








Pro 


Glu 


Glu 


Asp 








20 


Glu 


Lys 


Gin 


Tyr 






35 




Val 


Glu 


Thr 


lie 




50 






Gin 


Leu 


Lys 


Glu 


65 








Leu 


Arg 


Glu 


Leu 


Ser 


Ser 


lie 


Lys 








100 


Thr 


Thr 


Thr 


Asp 






115 




Glu 


Val 


Asn 


Lys 




130 






Phe 


Tyr 


Phe 


Ser 


145 








Phe 


Gly 


Glu 


Lys 


Leu 


Gly 


Phe 


Ser 








180 


Gly 


Asn 


Met 


Arg 






195 




Glu 


Val 


Cys 


His 




210 






lie 


Phe 


Lys 


Ser 


225 








Leu 


Gly 


Tyr 


Met 


Glu 


Thr 


Tyr 


Leu 








2 60 


Met 


Arg 


Lys 


Ser 






275 




Arg 


Gly 








290 







Gly Glu Gin Lys 
5 

Glu Leu Gin Leu 

Glu Lys Tyr Glu 
40 

Arg Lys Ser Glu 
55 

Asn Lys His lie 
70 

Thr Gly Asp Asn 
85 

Pro Glu Lys Gly 

Asp Glu Trp Leu 
120 

Leu Val Lys Glu 
135 

Pro Tyr Tyr lie 
150 

Glu Arg Phe Val 
165 

Ala Asp Asp Val 

Lys Ser Gly Leu 
200 

Leu Lys Lys Tyr 
215 

Tyr His Ser His 
230 

Phe Ser Arg Asn 
245 

Leu Ala Phe Ala 

Cys Met Arg Leu 
280 



Leu 


He 


Asn 


Leu 




10 






Arg 


Ser 


Ser 


Phe 


25 








Glu 


Val 


Met 


Ser 


Phe 


Arg 


Asp 


Gly 








60 


Thr 


Phe 


Glu 


Gly 










Ser 


Leu 


Lys 


lie 




90 






His 


Val 


He 


Leu 


105 








Ala 


Ser 


Gin 


Asp 


Lys 


Thr 


Arg 


Met 








13U 


Thr 


Lys 


Ser 


Leu 






155 




Val 


Ser 


Thr 


Asp 




170 






Met 


Glu 


Lys 


Leu 


185 








Lys 


Tyr 


Thr 


Trp 


Lys 


Gly 


Asp 


He 








220 


Cys 


Leu 


Val 


He 






235 




Val 


Gin 


Pro 


Pro 




250 






He 


Asp 


Leu 


Tyr 


265 








Cys 


Glu 


Asp 


He 



Leu 


Asp 


Glu 


Thr 










Leu 


Met 


lie 


(jiy 




*a ft 

-30 






Thr 


Pne 


GiU 


a _La 


45 








Val 


Phe 


lie 


vaj. 


Giy 


Leu 


Lys 


blU 








80 


Glu 


Ser 


Leu 


Leu 










Lys 


Asn 


Thr 


Ser 




110 






Lys 


Asp 


Val 


Gin 


125 








Leu 


Phe 


Arg 


Gly 


Pro 


Gin 


lie 


Pro 








160 


Phe 


Leu 


He 


Gly 






175 




He 


Ala 


He 


Glu 




190 






Val 


Pro 


Val 


Ala 


205 








val 


Val 


Asn 


Pro 


Pro 


Leu 


Val 


Tyr 








240 


Ser 


Leu 


Glu 


Val 






255 




Gly Arg 


Glu 


Glu 




270 






Ser 


Glu 


Val 


Lys 


285 









<210> 228 
<211> 2769 
<212> DNA 
<213> SHRIMP 



<400> 228 

atggagtcaa 

gatcaagttt 

aaggccaagt 

agtccacaca 

actcctccag 

cttatgggca 

tttctggaga 

tcggtactgt 

tgtgatgggt 



tcaaggaaca 
atgaatttga 
ttgctgctgg 
cggaaactgt 
aaatcacacc 
aggtgccagc 
atgacgaaga 
tcaatagtgt 
gtgtctcaaa 



gcagcagcag 
ggatactact 
aagaagaatg 
ggaagaagtt 
cgccgagaag 
tcatcttgac 
tgaggacgaa 
gatgaactct 
ggtcaagtcg 



cagccaactg 
actacttctg 
gtctctaagc 
gtaggagagg 
aagcagcaat 
gtgagtgttc 
gaattggaga 
ggacgcacag 
gcatttgaag 



tcactttctc 
ccaagaaacc 
agaggaggaa 
aagaagaaca 
ctcttcaaga 
tggccaaatc 
agaataagaa 
aactgtcccc 
gaaaggatct 



tgaggaacca 
aactccttcc 
cactatcagg 
gcagcagcag 
actggacgct 
tgttgctgaa 
ggcgcagaaa 
ttccaccttt 
ggttagcaac 



60 

120 

180 

240 

300 

360 
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480 

540 
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attgtcaagg ttgagggaga agctgtgaag 
ctggccaatc tattccttgg ttgtatgaat 
acactcaaca agaaggccct tgataaggga 
gctgtatatg tggacgagat ggatttggag 
gataagtcac tgtttaagga acttggagga 
ggattggtca tgtcaactcc cagttcttcg 
accactaacc aaattgtgga agaaagtgtc 
ttcaagaacg acaagtggtt ggccaagagg 
gtatttggag aagaggatga cgaaaagtca 
gacgaagatg aaaacgagga ggaagttgat 
tcagttggaa atgtcatcaa gaatcttatt 
gaggaaaagg aagagggtga gcagtcagag 
gacgacgcct catcagtgtg ttcttcttcc 
gctgctgaag aagaagagga agaggatgag 
gtagaggatg aagatgacaa ggaaagtgtt 
gatgaggatg atgatggtgc aacttcccag 
gagtgcgaat tcgacgagag cgatgggaac 
agaccatctg cttcagtgac caaatatcct 
ctatcaaaga acagatcaac tcctgtctgt 
atcaccaact cattcaacat gtccactatc 
actgaggctg ctgaggattg tgtgcacatc 
gatgaggagg atcagggact tgatctgcaa 
gtggacaatc ttattatctt ttctatgggt 
gtagattatg ataagagttt ctggtcctct 
agcaagaaga gtgctcttat taatgctcta 
gctactgtcc atacactggc tattccattc 
aacactactc cttatggaag ctacaagacc 
gggcaacata acgacatcaa caacaattgg 
aatcttctaa agagggaagt tgttgagaac 
cttattgttg acggaggcgt gggagaaaat 
aatgaggata tttttggaag cattgaagcg 
gcctttaaga atgttgccaa gaaatgtgac 
actgggcctt tcaagcaata cctcattgat 
atcatggagc cttgtgaggg ggatgaaact 
gcatacaaac aagcactcca cagggcaaag 
atctggcacg agatgatcac cagggatatg 
attcctgatt tttataaata tgttcaggta 
ttggattag 



aaaactgcaa tcgccactga tactacaaaa 600 
ctccagttcc atgagcacgt gactattgaa 660 
ggccctctct tcacattgaa gctctctgat 720 
aagaagaggc agatatttgg aagtaatgga 780 
aactatattg atagtgcaat caagtcaact 840 
tctaccaaga aggcaggaac acatttcaag 900 
actgaatcta tgagaaatgg atgttgttgc 960 
gagagtaacc tcaagtcact gaacaatact 1020 
gcatatgcat acagcgacag tgaggatgaa 1080 
tacgattaca acaatgaaac tattgaaagt 1140 
agaaaaacta tcggtctcag tgatgtagag 1200 
gaagaagaag aagactccga tgatgatgat 1260 
tcttcgtctt catctgtcac tgttgtagct 1320 
gaggataagg ataaggatac agcaacagta 1380 
atcagcagta gcagtgaaga ttcagaagaa 1440 
tgttcagaag ttgtgtttgg ggatgtaaca 1500 
cctctctacc tggcttcaga caacagcttc 1560 
caatctgagg aagagatgga tgtttctctc 1620 
ctatctctat gcagacactc ctctggatgc 1680 
ctcaagtctc ttaagttgtt ccctgccgga 1740 
gagtctacaa agaagaagga tgaggatgaa 1800 
aacagccagt actactcagt acttgtcgat 1860 
tccactacat acgaatcttc aatggtggag 1920 
ttcgacaagt ctgtgaagcc ttattgtgag 1980 
tgtgaagata atgtgactgc caaggtatac 2040 
tgtgaatcta tgcccattaa ccacattaat 2100 
ttcagaatta gtttgccagg gaacttttct 2160 
agatctgaca tgtacactaa gatggttgaa 2220 
aagacacaca gccgcagata cgttcgcaac 2280 
agtggcaact acttgaaggt gcacgagaat 2340 
aattcaatgt ctgccaagac tgctgctgcc 2400 
ctcatccaga ctacaaccaa cgacatcttg 24 60 
tacaagtata attcggccag aaagaatatt 2520 
acagcacatg agatgaagag ggctcaagac 2580 
attacagcaa gctccatctc gctgaggggc 2640 
aatacaactt acaatagcat gtttatgtat 2700 
tcacctgtaa atgtatcacc attatatatg 2760 

2769 



<210> 229 
<211> 922 
<212> PRT 
<213> SHRIMP 



<400> 229 



Met 


Glu 


Ser 


lie 


Lys 


Glu 


Gin 


Gin 


Gin 


Gin 


Gin 


Pro 


Thr 


Val 


Thr 


Phe 


1 








5 










10 










15 




Ser 


Glu 


Glu 


Pro 
20 


Asp 


Gin 


Val 


Tyr 


Glu 
25 


Phe 


Glu 


Asp 


Thr 


Thr 
30 


Thr 


Thr 


Ser 


Ala 


Lys 
35 


Lys 


Pro 


Thr 


Pro 


Ser 
40 


Lys 


Ala 


Lys 


Phe 


Ala 
45 


Ala 


Gly 


Arg 


Arg 


Met 
50 


Val 


Ser 


Lys 


Gin 


Arg 
55 


Arg 


Asn 


Thr 


He 


Arg 
60 


Ser 


Pro 


His 


Thr 


Glu 


Thr 


Val 


Glu 


Glu 


Val 


Val 


Gly 


Glu 


Glu 


Glu 


Glu 


Gin 


Gin 


Gin 


Gin 


65 










70 










75 










80 


Thr 


Pro 


Pro 


Glu 


lie 
85 


Thr 


Pro 


Ala 


Glu 


Lys 
90 


Lys 


Gin 


Gin 


Ser 


Leu 
95 


Gin 


Glu 


Leu 


Asp 


Ala 
100 


Leu 


Met 


Gly 


Lys 


Val 
105 


Pro 


Ala 


His 


Leu 


Asp 
110 


Val 


Ser 


Val 


Leu 


Ala 
115 


Lys 


Ser 


Val 


Ala 


Glu 
120 


Phe 


Leu 


Glu 


Asn 


Asp 
125 


Glu 


Asp 


Glu 
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Asp 


Ul LL 


ox u 


Leu 


(Z 1 ii 
OX LI 


Lys 


Asn 


Lys 


Lys 


nlo 


bin 


T 

Lys 


O A W 

oci 


v aX 


Leu 


rue 




X O \J 










X -J J 










i a n 










Asn 


Cav 
OCX 


vol 




Asn 


ofcr i 


r i ., 

oiy 


Arg 


inr 


blU 


Leu 


Ser 


D W A 


c a r 


inr 


DW A 


J. *i o 










i JU 










X — ) -3 










i fin 

X 


Cys 


Asp 


uiy 


Cys 


vai 


Ser 


Lys 


vai 


Lys 


Caw 

oer 


Aia 


rne 


flu 
blu 


biy 


Lys 


Asp 










lQJ 










X / \J 










x r j 




Leu 


v a i 


Ser 


Asn 


lie 


vai 


Lys 


vai 


blu 


biy 


GlU 


TV i _ 

Aia 


vai 


Lys 


Lys 


i nr 








i an 

iou 










loo 










i on 








Tift 

lie 


Aid 


i nr 


Asp 


i nr 


inr 


Lys 


Leu 


Aia 


Asn 


Leu 


pne 


Leu 


Gly 


Cys 






i y b 










o r* n 










One 

i:Ub 








Ma w 


Asn 


T mi 

Leu 


laln 


rne 


MIS 


^i ii 

blU 


HIS 


vai 


inr 


T 1 a 

lie 


r* l ii 
CalU 


Thr 


T Al« 

Leu 


Asn 


Lys 




Z X u 






























Lys 


Aia 


Leu 


Asp 


Lys 


Qaiy 


biy 


*D WA 

Fro 


T All 

Leu 


rne 


Tnr 


Leu 


Lys 


Leu 


Caw 

ber 


Asp 






















ZOO 










H V 




vai 


Tyr 


vai 


Asp 


C 1 »i 

blU 


KXa4- 

Met 


Asp 


Leu 


G1U 


Lys 


Lys 


71 w*v 

Arg 


Gin 


T 1 A 

lie 


Pne 










9 4 










250 










*> ^ r 




Giy 


C A W 

oer 


Asn 


oiy 


Asp 


Lys 


Ser 


T All 

Leu 


Phe 


Lys 


Glu 


T All 

Leu 


Gly 


Gly 


Asn 


Tyr 








■? fin 










265 










Old 






T 1 a. 

lie 


Asp 


Caw 

ber 


Aia 


lie 


Lys 


Caw 


inr 


Gly 


Leu 


Val 


Met 


Ser 


Thr 


Pro 


Ser 






*? *7 *^ 










<:ou 


















C A W 




Caw 

ber 


w 

i nr 


Lys 


Lys 


Ala 


bly 


Thr 


His 


Phe 


Lys 


TW w 

Tnr 


i nr 


Asn 


Gin 




O Q f\ 

zyu 






























T 1 a 

lie 


vai 


blU 


blU 


C a w 


vai 


■pu w 

inr 


blU 


Ser 


Met 


Arg 


Asn 


Gly 


Cys 


Cys 


Cys 


jUD 










JlU 










315 










-ion 
o^U 


pne 


Lys 


Asn 


Asp 


Lys 


Trp 


Leu 


71 1 -» 

Ala 


Lys 


Arg 


Glu 


Ser 


Asn 


T A.-l 

Leu 


Lys 


Ser 










O^ b 










330 










*5 "a 
job 




Leu 


Asn 


Asn 


Tnr 


val 


pne 


Gly 


G1U 


Glu Asp Asp 


GlU 


Lys 


Ser 


Ala 


Tyr 








A C\ 

o4 U 










345 










350 






Ala 


Tyr 


Ser 


Asp 


Ser 


Glu 


Asp 


Glu 


Asp 


Glu 


Asp 


Glu 


Asn 


Glu 


Glu 


Glu 


































Val 


Asp 


Tyr 


Asp 


Tyr 


Asn 


Asn 


Glu 


Thr 


He 


Glu 


Ser 


Ser 


Val 


Gly 


Asn 




J /u 










*3 *7 C 

375 










3oO 










val 


lie 


Lys 


Asn 


Leu 


lie 


Arg 


Lys 


Thr 


He 


Gly 


Leu 


Ser 


Asp 


Val 


Glu 


^ o c 

job 










o o r\ 










395 










4 00 


Glu 


Glu 


Lys 


Glu 


Glu 


Gly 


Glu 


Gin 


Ser 


Glu 


Glu 


Glu 


Glu 


Glu 


Asp 


Ser 










4 lib 










410 










415 




Asp 


Asp 


Asp 


Asp 


Asp 


Asp 


Ala 


Ser 


Ser 


Val 


Cys 


Ser 


Ser 


Ser 


Ser 


Ser 








A O A 










425 










430 






Ser 


Ser 


Ser 


val 


Thr 


vai 


Val 


Ala 


Ala 


Ala 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 






4 Jb 










440 










445 








Asp 


Glu 


GlU 


Asp 


Lys 


Asp 


T <*a 

Lys 


7A AWV 

Asp 


Thr 


Ala 


Thr 


vai 


Val 


Glu 


Asp 


GlU 




450 










J CL C, 










A CC\ 

4 oU 










Asp 


Asp 


Lys 


r* i ii 

bill 


Caw 


vai 


T 1 A 

lie 


C A W 


Ser 


Ser 


Ser 


Al 

GlU 


Asp 


Ser 


GlU 


GlU 


465 










AH f\ 










475 












Asp 


Glu 


ASp 


Asp 


TV r»w* 

ASp 


oiy 


Aia 


Thr 


Ser 


Gin 


Cys 


Ser 


Glu 


Val 


val 


Phe 




















490 










4 y b 




Gly Asp 


vai 


inr 


blU 


Cys 


blU 


rne 


Asp 


Glu 


Ser 


Asp 


Gly Asn 


Pro 


Leu 


















505 










510 






Tyr 


Leu 


Ala 


Ser 


71 *-w. 

ASp 


Asn 


Caw 

ser 


Pne 


Arg 


Pro 


Ser 


Ala 


Ser 


Val 


Thr 


Lys 






515 










520 










525 








Tyr 


Pro 


Gin 


Ser 


Glu 


Glu 


Glu 


Met 


Asp 


Val 


Ser 


Leu 


Leu 


Ser 


Lys 


Asn 




530 










535 










540 










Arg 


Ser 


Thr 


Pro 


Val 


Cys 


Leu 


Ser 


Leu 


Cys 


Arg 


His 


Ser 


Ser 


Gly 


Cys 


545 










550 










555 










560 


lie 


Thr 


Asn 


Ser 


Phe 


Asn 


Met 


Ser 


Thr 


He 


Leu 


Lys 


Ser 


Leu 


Lys 


Leu 










565 










570 










575 




Phe 


Pro 


Ala 


Gly 


Thr 


Glu 


Ala 


Ala 


Glu 


Asp 


Cys 


Val 


His 


He 


Glu 


Ser 








580 










585 










590 






Thr 


Lys 


Lys 


Lys 


Asp 


Glu 


Asp 


Glu 


Asp 


Glu 


Glu 


Asp 


Gin 


Gly 


Leu 


Asp 






595 










600 










605 








Leu 


Gin 


Asn 


Ser 


Gin 


Tyr 


Tyr 


Ser 


Val 


Leu 


Val 


Asp 


Val 


Asp 


Asn 


Leu 
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lie 


Tl 0 

lie 


IT IlC 




625 








Val 


Asp 


Tyr 


Asp 


Pro 


Tyr 


Cys 


bill 










ASp 


Asn 


val 


Tnr 






D tO 




Pro 


Phe 


Cys 


Glu 




690 






Tyr 


Gly 


Ser 


Tyr 


705 








Gly 


Gin 


His 


Asn 


Lys 


Met 


Val 


Glu 








/ 4 U 


His 


Ser 


Arg 


Arg 






"7 C C 

755 




Glu 


Asn 


Ser 


Gly 




770 






Tit- _ 

Pne 


Gly 


Ser 


lie 


785 








Ala 


Phe 


Lys 


Asn 


Asn 


Asp 


He 


Leu 








820 


Tyr 


Asn 


Ser 


Ala 






835 




Glu 


Thr 


Thr 


Ala 




850 






Ala 


Leu 


His 


Arg 


865 








He 


Trp 


His 


Glu 


Met 


Phe 


Met 


Tyr 








900 


Val 


Asn 


Val 


Ser 






915 









615 




Mat- 


oiy 


Ocl 


•phr 
1 i IX 




oou 






Lys 




Pha 
tilt! 


Trp 


645 








Ser 


Lys 


Lys 


Ser 


nla 


Lys 


Vdl 


Tyr 








oou 


ber 




Ova 

fro 


lie 










Lys 


ini 


fne 


Arg 




Tin 






Asp 


lie 


Asn 


Asn 


725 








Asn 


Leu 


Leu 


Lys 


Tyr 


vai 


Arg 


Asn 








*7 d(\ 


Asn 


Tyr 


Leu 


Lys 






"7 "7 

/to 




blU 


Ala 


Asn 


ber 




i on 
/ yu 






vai 


Aia 


Lys 


Lys 


805 








Thr 


Gly 


Pro 


Phe 


Arg 


Lys 


Asn 


He 








840 


His 


Glu 


Met 


Lys 






855 




Ala 


Lys 


He 


Thr 




870 






Met 


He 


Thr 


Arg 


885 








He 


Pro 


Asp 


Phe 


Pro 


Leu 


Tyr 


Met 








920 



620 



Thr 


Tvr 
j 


Glu 


Ser 






V? J -J 




Ser 


Ser 


Phe 


Asp 




uJU 






Ala 


Leu 


He 


Asn 


665 








nla 


THt- 
ifll 


Vdl 




Asn 


fllo 


1 1*5 


Asn 








*?nn 
/ uu 


Tip 


C ^ y- 


Leu 


Prrt 






r 1 0 




Asn 


irp 


Arg 


Cor 
OCX. 




/ Ju 






Arg 


Glu 


Val 


Val 


745 








Leu 


Tip 

lie 


v ax 


7\ en 
nop 


\7=» 1 

vai 


nlS 


olli 


Asn 










Met. 




Mia 


Lys 










Cys 


Asp 


Leu 


Tip 




ft 1 A 
O 1U 








Gin 

will 


Tut 
iyi 


Leu 


825 








He 


Met 


Glu 


Pro 


Arg 


Ala 


Gin 


Asp 








860 


Ala 


Ser 


Ser 


He 






875 




Asp 


Met 


Asn 


Thr 




890 






Tyr 


Lys 


Tyr 


Val 


905 








Leu 


Asp 







Ser 


Met 


Val 


Glu 








640 


Lys 


Ser 


Val 


Lys 






655 




Ala 


Leu 


Cys 


Glu 




670 






Thr 


Leu 


Ala 


He 


685 








Asn 


i ni 


1 1 IX 


Pro 


oiy 


Asn 


tilt? 










190 


Asp 






Thr 










Glu 


Asn 


Lys 


Thr 










Gly 


Gly 


Val 


Gly 


765 








Asn 


Glu 


Asp 


He 


i nr 


nla 


nld 


nld 










bin 


i nr 


1 £1X7 


TK t- 

i n.xr 






Ol J 




He 


Asp 


Tyr 


Lys 




830 






Cys 


Glu 


Gly 


Asp 


845 








Ala 


Tyr 


Lys 


Gin 


Ser 


Leu 


Arg 


Gly 








880 


Thr 


Tyr 


Asn 


Ser 






895 




Gin 


Val 


Ser 


Pro 




910 







<210> 230 
<211> 846 
<212> DNA 
<213> SHRIMP 



<400> 230 

atgtctcctg 

ataattagta 

gaagaagaac 

ccattttctc 

tcaaagggtg 

caggacgtcc 

accgacaagt 

tacaaategg 

agatatgttc 

acttttatac 

cctcgtccaa 

agctgttctt 

ttccaagtgt 

tgggttgaac 



ttatctccca 
ctgccaacct 
aacaggaagt 
cagcactcac 
tgtgcttaaa 
ctccagaact 
tcctttcatc 
ctctaagcac 
ccttaaacga 
cagaaaagac 
ttacagctaa 
cttcttcttc 
cttcatccat 
aaaatgagct 



acaatcttcc 
tagagttcta 
tgaaccagaa 
aatctgeata 
taggaaaaag 
ageaaatgeg 
ccactgttct 
aacaagaaat 
agataatgtg 
gggaagaaga 
attatgttgt 
tgcgtcctcc 
atttttcaag 
ggatgcagaa 



ccttctgcta 
ggtgtcaaaa 
attattgaac 
tacatcaatg 
ataaaaccta 
tccagttatc 
atatgeaact 
ggagatcagc 
gtagtccaga 
attttgtggt 
cttctggaaa 
tccagcaacg 
aaagaagagt 
tcacctgttc 



ccagcacggc 
ataaggaaga 
cagctaccga 
egaategcat 
cttcaacaat 
ttgtacaaac 
ataacgtgaa 
ccttgatgag 
aaggaacata 
tctctcatta 
ctataaactc 
ccccagggcc 
gttgtcccct 
ttgtacttct 



tgeagecaga 60 
gaaagatgag 120 
ttttgagata 180 
acacattaat 240 
aaacaaaaac 300 
tgaacacgta 360 
egaeggggaa 420 
aaagtcggtc 480 
ttatgggact 540 
caagaagtct 600 
gttcaaegge 660 
tattgaagaa 720 
gcaaatgaag 780 
aatgttagcg 840 
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ctatga 

<210> 231 
<211> 281 
<212> PRT 
<213> SHRIMP 



<400> 231 



Met 


Ser 


Pro 


vai 


ne 


oci 


Gin 


Gin 


J. 
















Ala 


T\ 1 _ 

Ala 


T\ 1 -s 

Aia 


Arg 


ixe 


lie 


Ser 


Thr 








*> n 










Lys 


Asn 


Lys 


UrlU 


oXU 


Lys 


Asp 


Glu 
















40 


Pro 


Glu 


T 1 _ 

ixe 


lie 


bill 


fro 


Ala 


Thr 




cn 

ou 










55 




Ala 


Leu 


Thr 


ne 


Cys 


lie 


Tyr 


He 


OO 










/ u 






Ser 


Lys 


Gly 


val 


Cys 


Leu 


Asn 


Arg 










o c 








lie 


Asn 


Lys 


Asn 


Gin 


Asp 


Val 


Pro 








100 










Tyr 


Leu 


Val 


Gin 


Thr 


G1U 


His 


Val 






115 










120 


Cys 


Ser 


lie 


Cys 


Asn 


Tyr 


Asn 


Val 




130 










135 




Leu 


Ser 


Thr 


Thr 


Arg 


Asn 


Gly Asp 


145 










150 






Arg 


Tyr 


Val 


Pro 


Leu 


Asn 


Glu 


Asp 










165 








Tyr 


Tyr 


Gly 


Thr 


Thr 


Phe 


He 


Pro 








180 










Trp 


Phe 


Ser 


His 


Tyr 


Lys 


Lys 


Ser 






195 










200 


Cys 


Cys 


Leu 


Leu 


Glu 


Thr 


He 


Asn 




210 










215 




Ser 


Ser 


Ser 


Ala 


Ser 


Ser 


Ser 


Ser 


225 










230 






Phe 


Gin 


Val 


Ser 


Ser 


Ser 


He 


Phe 










245 








Leu 


Gin 


Met 


Lys 


Trp 


Val 


Glu 


Gin 








260 










Val 


Leu 


Val 


Leu 


Leu 


Met 


Leu 


Ala 






275 










280 



846 



OCT I. 


OCI 


Pro 




Ala 


Thr 


Car 


X 11 X 




1 0 
x \j 














ri-Lo 


Asn 


Leu 


Arg 


V al 


Leu 




Val 


£- —J 










30 






OX LI 


VJ J. VI 




UJ.II 


OKI 


VJ1U 


v ox 


Glu 


















Asp 


rne 




Tip 

x ic 




pho 
JTUfcr 


OCX 


r x \j 


















Asn 


a i a 

i\±a 


Asn 


Arg 


X 1c 


Hi «=: 


Tip 

X X65 


Asn 
















fin 


Lys 


Lys 


lie 


i#ys 


trro 


lm 


oexT 


Th t- 

1 11 X 




y \J 














Pro 


LtlU 


Leu 


nla 


Asn 


nla 


Ser 




1UD 
















i nr 


Asp 


Lys 


JTIltS 


Leu 


Ser 


OCX 


His 










it j 








Asn 


Asp 


Gly 


Glu 


Tyr 


Lys 


Ser 


Ala 








140 










Gin 


Pro 


Leu 


Met 


Arg 


Lys 


Ser 


Val 






155 










160 


Asn 


Val 


Val 


Val 


Gin 


Lys 


Gly 


Thr 




170 










175 




Glu 


Lys 


Thr 


Gly 


Arg 


Arg 


He 


Leu 


185 










190 






Pro 


Arg 


Pro 


He 


Thr 


Ala 


Lys 


Leu 










205 








Ser 


Phe 


Asn 


Gly 


Ser 


Cys 


Ser 


Ser 








220 










Asn 


Ala 


Pro 


Gly 


Pro 


He 


Glu 


Glu 






235 










240 


Phe 


Lys 


Lys 


Glu 


Glu 


Cys 


Cys 


Pro 




250 










255 




Asn 


Glu 


Leu 


Asp 


Ala 


Glu 


Ser 


Pro 


265 










270 






Leu 

















<210> 232 
<211> 1470 
<212> DNA 
<213> SHRIMP 



<400> 232 

atggaatctg 

aaaaagatac 

ggatatggag 

ttggcaagat 

agtgcgggaa 

tgtgggttta 

tataaaataa 

gccatctgtg 

actaaagaag 



taagagacgt 
agcgcctaaa 
gagcaagact 
gtctctataa 
atattatagt 
atgtcaataa 
caggattaaa 
atttgttcct 
tagggaaaaa 



taaattttac 
caaggataaa 
catggatgtg 
ctgcgatgga 
atattcttta 
tcgcctacga 
tttccctgaa 
ggaggtggct 
acaacaacaa 



acttttatga 
ggttggagga 
cgcttcacgg 
gaatatacaa 
gcctttataa 
atgggaaaaa 
actgtaaaat 
gctcttcaag 
catttcaacg 



atgtattggc 
cttcaataaa 
gcaggaaatc 
ctcttagact 
tgggaatcag 
taatagacag 
gcacatgtga 
aacaccccgc 
aattcgggtc 



agagaaggcg 
cgctgaaatt 
aatggacgaa 
cgtgggttca 
aggcgagtgt 
agagttgttt 
tggagtacga 
ctggcatgaa 
ccagtaccct 



60 

120 

180 

240 

300 

360 

420 

480 

540 
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ggcacaaaat 
gaagagaaga 
tcagatttgt 
aatatggaag 
tctaggtggg 
ataggctcca 
gatttagttc 
aggttcaatg 
cctgtacttt 
aaatccctct 
tttgattcga 
gatagtctat 
aaggaaaata 
gggaaacact 
ataacaccat 
ggtcctagaa 



tcaataagcg 
ctatggaaca 
acgtagaagc 
gaagagtagg 
aaaagaaacc 
aaaaaattgt 
ctctagttcg 
attctactga 
ctaaaataat 
ccaaactagc 
atgattatgt 
caaaaaatca 
aaacaacacc 
tgttcccttt 
ggcaattaaa 
aacgacctcg 



gcacaaacta 
agttctagcc 
acctatacaa 
tgccatgtat 
agggtataaa 
tgatgacgag 
attggcccca 
tatattaatg 
atggcaaaac 
caaatggaaa 
cattgaacat 
ccttccaaat 
cttaatatgt 
agaggaagtt 
ggttgtccag 
cacacagtaa 



tccactaaaa 
tttagtgaag 
tacgtggtta 
aatttgtcga 
aatgactttt 
agctttattt 
tcaaacgaag 
gacagtattg 
gtttctgcac 
tggaatggta 
aaacgtcaac 
tttagtaaaa 
tggaattata 
tgcggattct 
aaaaagagag 



taattcagca 
gaactgccgc 
atatgtatag 
gagttttaat 
atagtaagtg 
tcaccgatct 
atatacagag 
acgtgaggga 
gtctaaaatt 
tggtgtcaac 
ttgcagcgga 
caataacgga 
tttttgaatt 
atgaggcgag 
gaaggcagat 



aatgttttca 
tagtggattt 
agctatcagt 
actcttatgt 
tgagatgtac 
cataactggg 
agatgttata 
tgttgttttg 
gcgcaacaat 
acacgacaac 
tataatgtcc 
gtacgatgaa 
gtcccctatg 
tttacctctc 
ggtcatctac 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1470 



<210> 233 
<211> 487 
<212> PRT 
<213> SHRIMP 



<400> 233 







Ser 


vai 


1 








Ala 


Glu 


Lys 


Ala 








20 


Arg 


Thr 


Ser 


He 






35 




Asp 


Val 


Arg 


Phe 




50 






Leu 


Tyr 


Asn 


Cys 


65 








Ser 


Ala 


Gly 


Asn 


Arg 


Gly 


Glu 


Cys 








100 


Lys 


lie 


lie 


Asp 






115 




Thr 


Val 


Lys 


Cys 




130 






Leu 


Glu 


Val 


Ala 


145 








Glu 


val 


Gly 


Lys 


Tyr 


Pro 


Gly 


Thr 








180 


lie 


Gin 


Gin 


Met 






195 




Phe 


Ser 


Glu 


Gly 




210 






Ala 


Pro 


He 


Gin 


225 








Glu 


Gly 


Arg 


Val 


Leu 


Cys 


Ser 


Arg 








260 


Ser 


Lys 


Cys 


Glu 






275 




Ser 


Phe 


He 


Phe 




290 







Arg 


Asp 


Val 


Lys 


5 








Lys 


Lys 


lie 


Gin 


Asn 


Ala 


Glu 


He 








40 


Thr 


Gly 


Arg 


Lys 






55 




Asp 


Gly 


Glu 


Tyr 




70 






He 


He 


Val 


Tyr 


85 








Cys 


Gly 


Phe 


Asn 


Arg 


Glu 


Leu 


Phe 








120 


Thr 


Cys 


Asp 


Gly 






135 




Ala 


Leu 


Gin 


Glu 




150 






Lys 


Gin 


Gin 


Gin 


165 








Lys 


Phe 


Asn 


Lys 


Phe 


Ser 


Glu 


Glu 








200 


Thr 


Ala 


Ala 


Ser 






215 




Tyr 


Val 


Val 


Asn 




230 






Gly 


Ala 


Met 


Tyr 


245 








Trp 


Glu 


Lys 


Lys 


Met 


Tyr 


He 


Gly 








280 


Thr 


Asp 


Leu 


He 






2 95 





Phe 


Tyr 


Thr 


Phe 




10 






Arg 


Leu 


Asn 


Lys 


25 








Gly 


Tyr 


Gly 


Gly 


Ser 


Met 


Asp 


Glu 








60 


Thr 


Thr 


Leu 


Arg 






75 




Ser 


Leu 


Ala 


Phe 




90 






Val 


Asn 


Asn 


Arg 


105 








Tyr 


Lys 


He 


Thr 


Val 


Arg 


Ala 


He 








140 


His 


Pro 


Ala 


Trp 






155 




His 


Phe 


Asn 


Glu 




170 






Arg 


His 


Lys 


Leu 


185 








Lys 


Thr 


Met 


Glu 


Gly 


Phe 


Ser 


Asp 








220 


Met 


Tyr 


Arg 


Ala 






235 




Asn 


Leu 


Ser 


Arg 




250 






Pro 


Gly 


Tyr 


Lys 


265 








Ser 


Lys 


Lys 


He 


Thr 


Gly 


Asp 


Leu 



300 



Met Asn Val Leu 
15 

Asp Lys Gly Trp 
30 

Ala Arg Leu Met 
45 

Leu Ala Arg Cys 

Leu Val Gly Ser 
80 

He Met Gly He 
95 

Leu Arg Met Gly 
110 

Gin Phe Pro Glu 
125 

Cys Asp Leu Phe 

His Glu Thr Lys 
160 

Phe Gly Ser Gin 
175 

Ser Thr Lys He 
190 

Gin Val Leu Ala 
205 

Leu Tyr Val Glu 

He Ser Asn Met 
240 

Val Leu He Leu 
255 

Asn Asp Phe Tyr 
270 

Val Asp Asp Glu 
285 

Val Pro Leu Val 
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Arg 


Leu 


Ala 


Pro 


Ser 


Asn 


Glu 


Asp 


He 


Gin 


Arg 


Asp 


Val 


He 


Arg 


Phe 


305 










310 










315 










320 


Asn 


Asp 


Ser 


Thr 


Asp 
325 


He 


Leu 


Met 


Asp 


Ser 
330 


He 


Asp 


Val 


Arg 


Asp 
335 


Val 


Val 


Leu 


Pro 


Val 
340 


Leu 


Ser 


Lys 


He 


He 
345 


Trp 


Gin 


Asn 


Val 


Ser 
350 


Ala 


Arg 


Leu 


Lys 


Leu 

355 


Arg 


Asn 


Asn 


Lys 


Ser 
360 


Leu 


Ser 


Lys 


Leu 


Ala 
365 


Lys 


Trp 


Lys 


Trp 


Asn 


Gly Met 


Val 


Ser 


Thr 


His 


Asp 


Asn 


Phe 


Asp 


Ser 


Asn 


Asp 


Tyr 




370 










375 








- 


380 








Val 


He 


Glu 


His 


Lys 


Arg 


Gin 


Leu 


Ala 


Ala 


Asp 


He 


Met 


Ser 


Asp 


Ser 


385 










390 










395 










400 


Leu 


Ser 


Lys 


Asn 


His 


Leu 


Pro 


Asn 


Phe 


Ser 


Lys 


Thr 


He 


Thr 


Glu 


Tyr 










405 










410 










415 


Asp 


Glu 


Lys 


Glu 
420 


Asn 


Lys 


Thr 


Thr 


Pro 
425 


Leu 


He 


Cys 


Trp 


Asn 
430 


Tyr 


He 


Phe 


Glu 


Leu 


Ser 


Pro 


Met 


Gly Lys 


His 


Leu 


Phe 


Pro 


Leu 


Glu 


Glu 


Val 






435 










440 










445 








Cys 


Gly 
450 


Phe 


Tyr 


Glu 


Ala 


Ser 
455 


Leu 


Pro 


Leu 


He 


Thr 
460 


Pro 


Trp 


Gin 


Leu 


Lys 


Val 


Val 


Gin 


Lys 


Lys 


Arg Gly Arg 


Gin 


Met 


Val 


He 


Tyr 


Gly 


Pro 


465 










470 










475 










480 


Arg 


Lys 


Arg 


Pro 


Arg 
485 


Thr 


Gin 





















<210> 234 

<211> 4698 

<212> DNA 

<213> SHRIMP 

<400> 234 

atgtcaacaa cgcaaacgca aacaatagaa cggcccctgc ctggtaagaa caacgaagac 60 
aatagccgtt tagcttgcct attggctgaa gggctacagc aacaacaaca acaacaagat 120 
ggcgatagcg aaatttctct accgcttgtt aatgcgggaa cttttgcgtg ttatgattct 180 
acccttgcaa acctcaccga ggggcgttta ggaagtgaaa cagaaaatgc aaaaataagg 240 
gtaaaaatac acccgtctgt gtttattata gagacgaata aagagatgac tattgaagaa 300 
atatctacaa aatctctaaa tgcattagtg gaaaaaagag cacgagaagc acgacgattt 360 
tcatctctga cagaacaaaa atttcctcga ggaggaggag gatgttattc tcgtaaaaat 420 
gagaggttta tcgaaggcga aattaacaac atcaagctga acatggaaga aactgcttcg 4 80 
tctttagaga gattggcagg gcttttgcct gttgtcatta atattaagga ctggacaatg 540 
catgatgaga aagaaatacg gctagatctt aaaggaaacg acggcatgga agaacttgtc 600 
aatatatccc atctgaatca agaagaatgg gaaatggaaa gactttcttc ctcaatagta 660 
ttgaaagacg cctatggtgt gttttatgcc catcacggca tacttgatat tgttcttacg 720 
acctctagat tcacagggaa attattacaa caccctgtga tatttcgtct aatggatgtg 780 
aaggtgtgga taaacacccc cctccagata gcatttcctg acaccagtaa gaacccaaat 840 
gcaaagaaaa ttctctatca acatccctct cttacaaggc tacgtgattt aaatgacatg 900 
gcttcaaact ccaagtcagt ttcttccatt atcataccag agttgtcaaa atttaactcg 960 
actgaatttg gtatgcacta ctttacggcc cagtgctttt ttggaaaaaa taccaattct 1020 
ttaaaggatt tggttacacg gtattatcaa ctatcattta aaaataaacc tcaacccaaa 1080 
ctttatgaac caacagcaac agcaacagca gcctcctcct cctcctcaac cgcctcattg 1140 
acgaccgaac aaaaggaaaa aatcgcacaa tcgattttat cgtctaaagg aaaatcttta 1200 
ggagatgtat cttccaccct ttctaaggaa tacgatgaaa atagaaaacg tacaaagagg 1260 
caaaaaactt ctaccgatac caacattgtc ccttcagggg cacctacttc catttccatg 1320 
aaaaacccag ttacttgctt ctttgggcct caatacactt caatcatgga ctgtatatcg 1380 
gagaaaacag actggataga aatgcacctt tttttaacct ctcttaacga cgctgaacat 1440 
aataaaacac tcgttgtcga cagaaaaagt aacgtgtcgg aaatacacga ttctggaagg 1500 
tttcttacat tcggtcaaaa taatactaca gcctttatac cagatgttga tattcctacg 1560 
ttaaaattaa ttctgagaga cgattctggg gaatcttctg ctgctataat agcatctctg 1620 
atatattata ataatgtaaa tctagaaggg agggaattct caaatgtctc tgatgctgtt 1680 
gtgggacttt tctcgggcgg atctgctatt acggtaggag atattgctcg agagattgca 1740 
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tccatataca 
atactggccg 
gtggtgggtc 
ccagtgaaag 
cttcttccat 
caactaacag 
cacaaatttt 
tatgcagtta 
cctcccttta 
cagagtattt 
tcttctccat 
cctgtaccaa 
cctctaatat 
ttttcttctg 
agaaatgtga 
ggtgaattgg 
gatatggaaa 
tcttcttcat 
gttagcgcat 
aaacaagtca 
ctctcgtctg 
atgaggatct 
agtaaagtgg 
gatactggat 
tcaggcgaag 
attgctattg 
agagcgatcc 
aatgtgcctg 
tttaggaatg 
atgaaaatta 
gttgcactag 
aatgggatta 
gggagacttt 
ggtgctcttg 
aataccattg 
aataataaca 
agagactctt 
gaccgtataa 
tttattgctt 
ctcaacaaga 
aatccgcgcc 
gcaacaagag 
actcaattgg 
aaatccccag 
attgctgcta 
attgttaaca 
ttttatataa 
gctgttaata 
tcacagacat 
agaaatataa 



atattggaag 
gaagaagaag 
tttatcggtc 
tggatgagac 
tttttgattg 
attatatagt 
tattattcaa 
ttcattcggc 
ttgtagaaaa 
tgaatggaga 
catcctcctc 
caggacggtt 
cctctaagga 
aaaccatttc 
catatggtca 
tgtcgtactt 
ttggtgatat 
tcgttgatgg 
tcctccctgg 
aatattcggt 
ctgtgcttta 
ataaggaatg 
gcacagtggc 
tacccacaac 
actcaaaaat 
cagccgacca 
aacaacaaca 
cccgtttcga 
ttattgctac 
ttaaagaagt 
atcctagaaa 
ctattgatac 
ttataaatga 
gaactggtag 
gaagtactac 
atactgttcc 
ctctattcac 
agacaaacat 
caaaggcaca 
ttccagaaga 
atatccttac 
attcggctat 
aggaactttt 
cttatgt ttc 
tacgagagaa 
ttgatgagct 
aaatgcccaa 
tagcagcaac 
ctgttttata 
ccatgtga 



agaatctaac 
ttatggacgg 
gtgtttagaa 
cgcttggatg 
cgttttgaaa 
acgtgaagtg 
gaagcccgtg 
agccgataat 
tgatacgtct 
taatgtagcc 
cagatattca 
tttaaagatg 
acaaaaacaa 
aaaatcgttc 
gaatatcgca 
taaagttgta 
taataaccat 
tgtacggaca 
tacgacttct 
aaaggaattg 
tgaagcctca 
tgttagtcct 
aggaactgga 
tctagaaagg 
ggaagtaatt 
actcagaggc 
acagcagcaa 
gcctacgttc 
tattatatct 
gtgcgagcac 
ggccattgaa 
tggaaatgaa 
tgaagctagt 
aagggtgcct 
tcaacaacaa 
cgaggatata 
atctaaactt 
ttcagaaaca 
gaaacctatc 
cttgaacttt 
tgacatgcaa 
cgtcatgcaa 
atctcaatgg 
aagtgtaaaa 
ttcagaaaaa 
aggtatagtg 
tgtgtgggct 
tccccttatt 
tgactcttct 



tgcgattcca 
cagtacagat 
acaatgacaa 
tacatgcacc 
tcgggcgtat 
ctactaaaat 
atggatctaa 
ggaggcgtgt 
ctcagatact 
gaaaacttaa 
tcagaaaccc 
aataaggatc 
caacagcagc 
ttaaattatg 
gaagatggat 
aagaatactg 
caagacgaca 
agtttttcgg 
cagcccacaa 
ggcatggaaa 
aaaactaaat 
ctatctacag 
ttcgaattta 
ttgtcaccca 
gcagaatctg 
agtaataata 
caacaagaaa 
acagagatcg 
agaatgatga 
gtgtctcata 
gaaatactgg 
gggtatggat 
gaagaagctg 
gtagaattac 
caacagcagc 
aaggttcaca 
ttgaattaca 
ctaaagaaat 
ccatggaaac 
tctcctgcac 
tggctcaact 
agtttccaag 
aataatattg 
ttggaatggt 
tcaaaaattg 
gctgttgcca 
tctagggatt 
aataatatat 
cttgctctta 



tactattccc 
ggtacgatcc 
gaaatattat 
agcaggtact 
gggctgtaaa 
atacagccga 
ttgcaaagat 
gccttgcttt 
atacactgac 
agtctgccac 
ctattagggt 
tggaactttt 
aaacaactgc 
taccacctaa 
tccttggatt 
aacgcgatga 
ctggatccct 
ttgatgggaa 
atcttcctgt 
ctgtattctt 
cgacacaaca 
ggaggatcga 
tttggaaagt 
agattccttc 
gtaaaggtgt 
tagtaggtgg 
cacaagcggt 
aactatttct 
tgcttgtcag 
ttatggtcga 
aaaggataac 
ctttacgcta 
cagcggccat 
gttcaatatt 
agcgtcaaca 
atgaacaaat 
ttagggatga 
acagtagaat 
acaccaagga 
agaatctgtt 
gtatttccat 
agcaagcgga 
tatctcaagt 
taaataatga 
taatgggtgt 
ggtcaatagt 
ggaagaacct 
cgaggggtat 
ttgctgccga 



aggggagcct 
cattaactgc 
gagaggacag 
tcaagttgtt 
ggaggctaga 
tcctgatcaa 
tgttacacat 
ccctagagat 
cgatacacct 
ttcagtcgct 
agttaatctt 
cattaatgta 
cactgctcca 
atctctaaca 
gaaaaataaa 
tggtataaag 
ttcttcttct 
aattgaacat 
tcatgcatct 
tgaacccctt 
tttaagccca 
tattttcccc 
tcttcagtat 
tgtaccaatt 
acaaaacatc 
tggaacaaga 
tgtjtccagtt 
tcagaacaaa 
caacgaagag 
tggattatat 
agctgaacaa 
tgcttcttcg 
cggcggtgga 
ggacaaatta 
gaggcaggct 
gcaaaaaatt 
tggaagaaag 
cccatcatac 
caatatcaac 
tgttccagtt 
catcgaaact 
caagactaca 
aactgatgaa 
agctagccgt 
tcaggggaag 
agacgttgat 
gatttattat 
catggctgct 
gcaagctaca 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4 500 
4560 
4620 
4680 
4698 



<210> 235 
<211> 1551 
<212> PRT 
<213> SHRIMP 



<400> 235 

Met Ser Thr Thr Gin Thr Gin Thr lie Glu Arg Pro Leu Pro Gly Lys 

15 10 15 

Asn Asn Glu Asp Asn Ser Arg Leu Ala Cys Leu Leu Ala Glu Gly Leu 
20 25 30 
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Gin 


Gin 


Gin 


Gin 






JD 




Leu 


Val 


Asn 


Ala 




50 






Leu 


Thr 


Glu 


Gly 


65 








Val 


Lys 


He 


His 


Thr 


He 


Glu 


Glu 








100 


Arg 


Ala 


Arg 


Glu 






115 




Pro 


Arg 


Gly 


Gly 




130 






Glu 


He 


Asn 


Asn 


145 








Glu 


Arg 


Leu 


Ala 


Thr 


Met 


His 


Asp 








180 


Gly Met 


Glu 


Glu 






195 




Glu 


Met 


Glu 


Arg 




210 






Val 


Phe 


Tyr 


Ala 


225 








Arg 


Phe 


Thr 


Gly 


Asp 


Val 


Lys 


Val 








260 


Thr 


Ser 


Lys 


Asn 






275 




Leu 


Thr 


Arg 


Leu 




290 






Ser 


He 


He 


He 


305 








Met 


His 


Tyr 


Phe 


Leu 


Lys 


Asp 


Leu 








340 


Pro 


Gin 


Pro 


Lys 






355 




Ser 


Ser 


Ser 


Ser 




370 






Ala 


Gin 


Ser 


He 


385 








Ser 


Thr 


Leu 


Ser 


Gin 


Lys 


Thr 


Ser 










Ser 


He 


Ser 


Met 






435 




Thr 


Ser 


He 


Met 




450 






His 


Leu 


Phe 


Leu 


465 








Val 


Val 


Asp 


Arg 


Phe 


Leu 


Thr 


Phe 








500 


Asp 


He 


Pro 


Thr 



Gin Gin Gin Asp 
40 

Gly Thr Phe Ala 
55 

Arg Leu Gly Ser 
70 

Pro Ser Val Phe 
85 

He Ser Thr Lys 

Ala Arg Arg Phe 
120 

Gly Gly Cys Arg 
135 

He Lys Leu Asn 
150 

Gly Leu Leu Pro 
165 

Glu Lys Glu lie 

Leu Val Asn He 
200 

Leu Ser Ser Ser 
215 

His His Gly He 
230 

Lys Leu Leu Gin 
245 

Trp He Asn Thr 

Pro Asn Ala Lys 
280 

Arg Asp Leu Asn 
295 

Pro Glu Leu Ser 
310 

Thr Ala Gin Cys 
325 

Val Thr Arg Tyr 

Leu Tyr Glu Pro 
360 

Thr Ala Ser Leu 
375 

Leu Ser Ser Lys 
390 

Lys Glu Tyr Asp 
405 

Thr Asp Thr Asn 

Lys Asn Pro Val 
440 

Asp Cys He Ser 
455 

Thr Ser Leu Asn 
470 

Lys Ser Asn Val 
485 

Gly Gin Asn Asn 
Leu Lys Leu He 



Gly Asp Ser Glu 

Cys Tyr Asp Ser 
60 

Glu Thr Glu Asn 
75 

He He Glu Thr 
90 

Ser Leu Asn Ala 
105 

Ser Ser Leu Thr 

Lys Asn Glu Arg 
140 

Met Glu Glu Thr 
155 

Val Val He Asn 
170 

Arg Leu Asp Leu 
185 

Ser His Leu Asn 

He Val Leu Lys 
220 

Leu Asp He Val 
235 

His Pro Val He 
250 

Pro Leu Gin He 
265 

Lys lie Leu Tyr 

Asp Met Asn Ser 
300 

Lys Phe Asn Ser 
315 

Phe Phe Gly Lys 
330 

Tyr Gin Leu Ser 
345 

Thr Ala Thr Ala 

Thr Thr Glu Gin 
380 

Gly Lys Ser Leu 
395 

Glu Asn Arg Lys 
410 

He Val Pro Ser 
425 

Thr Cys Phe Phe 

Glu Lys Thr Asp 
460 

Asp Ala Glu His 
475 

Ser Glu He His 
4 90 

Thr Thr Ala Phe 
505 

Leu Arg Asp Asp 



He Ser Leu Pro 
45 

Thr Leu Ala Asn 

Ala Lys He Arg 
80 

Asn Lys Glu Met 
95 

Leu Val Glu Lys 
110 

Glu Gin Lys Phe 
125 

Phe He Glu Gly 

Ala Ser Ser Leu 
160 

He Lys Asp Trp 
175 

Lys Gly Asn Asp 
190 

Gin Glu Glu Trp 
205 

Asp Ala Tyr Gly 

Leu Thr Thr Ser 
240 

Phe Arg Leu Met 
255 

Ala Phe Pro Asp 
270 

Gin His Pro Ser 
285 

Lys Ser Val Ser 

Thr Glu Phe Gly 
320 

Asn Thr Asn Ser 
335 

Phe Lys Asn Lys 
350 

Thr Ala Ala Ser 
365 

Lys Glu Lys He 

Gly Asp Val Ser 
400 

Arg Thr Lys Arg 
415 

Gly Ala Pro Thr 
430 

Gly Pro Gin Tyr 
445 

Trp He Glu Met 

Asn Lys Thr Leu 
480 

Asp Ser Gly Arg 
495 

He Pro Asp Val 
510 

Ser Gly Glu Ser 
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515 










520 










525 








Ser 


Ala 


Ala 


He 


He 


Ala 


Ser 


Leu 


He 


Tvr 


Tvr 


Asn 


Asn 


Val 


Asn 


Leu 




530 










535 










540 










Glu 


Gly 


Arg 


Glu 


Phe 


Ser 


Asn 


Val 


Ser 


Asp 


Ala 


Val 


Val 


Glv 


Leu 


Phe 


54 5 










550 










555 










560 


Cat* 


oiy 


oiy 




Ala 


He 


Thr 


Val 


Glv 
oxy 


Asp 


He 


Ala 


Arg 


Glu 


He 


Ala 










565 










570 










575 




OCX 


Tic, 

1 XC 


iyr 


Asn 


He 


Gly 


Arg 


Glu 


Ser 


Asn 


Cys 


Asp 


Ser 


He 


Leu 


Phe 






580 










585 










590 






riO 


m \r 

oiy 


bill 


Pro 


11c 


Leu 


nla 


viy 


f\X y 


Arg 


Ser 


i yx 


Gly 




Gin 


xyx 
















600 










605 








Arg 


Trp 


Tyr 


Asp 


Pro 


Tip 

1JLC 


Asn 




vol 


Val 


Gly 


Arg 


Ser 


Cys 


Leu 


Glu 




til n 

OX \J 










61 5 

O X J 










620 










inr 




TK -r- 

x nr 


Arg 


Asn 


Tift 

lie 




Arg 


oiy 


Gin 


Pro 


Val 


Lys 


Val 


Asp 


Glu 


— ) 










630 










635 










640 


I lit 


/Ud 


Trp 




Tyr 


Mai- 
ne L. 


Hie; 


Gin 


Gin 


Val 


Leu 


Gin 


Val 


Val 


Leu 


Leu 








645 










650 










655 




rro 


rnc 


rut; 


ASp 




V a 1 


T 


i*y s 


C /=, r 
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Gly 


Val 


i xp 
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Glu 








DOW 










665 










670 
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Arg 


ri ~\ t» 


Leu 
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iyr 


Tip 
lie 
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Arg 


Glu 


Val 


Leu 


Leu 


Lys 


Tvr 
x yx 






D ' J 










680 










685 








i nr 


nla 


Asp 


Pro 


Asp 


Gin 


Hi <3 

ii 1 o 




Phe 


Leu 


Leu 


Phe 


Lys 


Lys 


Pro 


Val 




D .7 W 










D 7 »J 










700 










wee 


Asp 


Leu 


Tl D 
lie 


AT a 

nld 


iiys 


Tip 
1 1c 
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Thr 


His 


a yi 
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Val 


He 


His 


Ser 


/Uj 










f 1U 










715 










720 


n 1 a 
nla 


Ala 


Asp 


Asn 


oiy 


ox y 


V d 1 




It6U 


Ala 


Phe 


Pro 


Arg 


Asp 


Pro 


Pro 




















7^n 










735 




Jrfie 


x le 


Val 


olu 


Asn 


Asp 


TK r- 


DGX 


L6U 


Arg 


i yr 


iyx 


Thr 

X ill 


Leu 


Thr 


Asp 








/ ^ U 










TilC 










750 






TK %• 
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bin 
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Ti- 
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Leu 


Asn 


f21 \/ 
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Asn 
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xyx 
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* f ~J 










780 










C A V- 


blU 


inr 


Pro 


lie 


Arg 


val 


\/a 1 

Val 


Asn 


Leu 


Pro 


vai 


Pro 


X I1X 


ox y 


Arg 


7ft ^ 










7Q0 










795 










800 


OK «. 

Pne 


Lieu 


Lys 


Met 


Asn 


Lys 


Asp 


Leu 


blu 


Leu 


Dhp 

rne 


Tip 

lie 


Asn 


VoX 




Leu 




















pin 

O 1 u 










815 




lie 


Ser 


C A V 


Lys 


blU 


oin 


Lys 


bin 




VjJl 1 1 


oin 


Olll 


TVi -r 

X ill 


Thr 


Ala 


Thr 


















O -J 










830 






Ala 


Pro 


rne 


Dei 
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o6I 


UlU 


i nr 


Tin 

1 1c 




Jjya 
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iieu 


Asn 


Twr 

i yx 


Val 






ft 










84 0 










845 
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Pro 


Lys 


Ser 


Xjeu 


Thr 


Arg 


Asn 


Val 


Thr 


Twr 

xyx 


oiy 


Gin 


Asn 


He 
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ft <^n 
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f i n 
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Asp 


oiy 
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A n 


ys 
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x yx 


Q £ C. 










o / u 
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Asp 
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Tic* 


oiy 
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lie 


Asn 


Asn 


U-i e 

nx 9 


OX 11 


7\ e« 


ASp 


TK r 

A 1 1 X 


oxy 


O w i 


Leu 


Ser 








900 










905 










910 






Ser 


Ser 


Ser 


Ser 


Ser 


Phe 


Val 


Asp 


Gly 


Val 


Arg 


Thr 


Ser 


Phe 


Ser 


Val 






915 










920 










925 








Asp 


Gly 


Lys 


He 


Glu 


His 


Val 


Ser 


Ala 


Phe 


Leu 


Pro 


Gly 


Thr 


Thr 


Ser 




930 










935 










940 










Gin 


Pro 


Thr 


Asn 


Leu 


Pro 


Val 


His 


Ala 


Ser 


Lys 


Gin 


Val 


Lys 


Tyr 


Ser 


945 










950 










955 










960 


Val 


Lys 


Glu 


Leu 


Gly 


Met 


Val 


Phe 


Phe 


Glu 


Pro 


Leu 


Leu 


Ser 


Ser 


Ala 








965 










970 










975 




Val 


Leu 


Tyr 


Glu 


Ala 


Ser 


Lys 


Thr 


Lys 


Ser 


Thr 


Gin 


His 


Leu 


Ser 


Pro 






980 










985 










990 






Met 


Arg 


He 


Tyr 


Lys 


Glu 


Cys 


Val 


Ser 


Pro 


Leu 


Ser 


Thr 


Gly 


Arg 


He 



995 1000 1005 
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Asp He Phe Pro Ser Lys Val Gly Thr Val Ala Gly Thr Gly Phe Glu 

1010 1015 1020 

Phe He Trp Lys Val Leu Gin Tyr Asp Thr Gly Leu Pro Thr Thr Leu 
1025 1030 1035 1040 

Glu Arg Leu Ser Pro Lys lie Pro Ser Val Pro He Ser Gly Glu Asp 

1045 1050 1055 

Ser Lys Met Glu Val He Ala Glu Ser Gly Lys Gly Val Gin Asn He 

1060 1065 1070 

He Ala He Ala Ala Asp Gin Leu Arg Gly Ser Asn Asn He Val Gly 

1075 1080 1085 

Gly Gly Thr Arg Arg Ala He Gin Gin Gin Gin Gin Gin Gin Gin Gin 

1090 1095 1100 

Glu Thr Gin Ala Val Val Pro Val Asn Val Pro Ala Arg Phe Glu Pro 
1105 1110 1115 1120 

Thr Phe Thr Glu He Glu Leu Phe Leu Gin Asn Lys Phe Arg Asn Val 

1125 1130 1135 

He Ala Thr He He Ser Arg Met Met Met Leu Val Ser Asn Glu Glu 

1140 1145 1150 

Met Lys He He Lys Glu Val Cys Glu His Val Ser His He Met Val 

1155 1160 1165 

Asp Gly Val Asp Pro Arg Lys Ala He Glu Glu He Arg He Thr Ala 

1170 1175 1180 

Glu Gin Asn Gly He Thr He Asp Thr Gly Asn Glu Gly Tyr Gly Ser 
1185 " 1190 1195 1200 

Leu Arg Tyr Ala Ser Ser Gly Arg Leu Phe He Asn Asp Glu Ala Ser 

1205 1210 1215 

Glu Glu Ala Ala Ala Ala He Gly Gly Gly Gly Ala Leu Gly Thr Gly 

1220 1225 1230 

Arg Arg Val Pro Val Glu Leu Arg Ser He Leu Asp Lys Leu Asn Thr 

1235 1240 1245 

He Gly Ser Thr Thr Gin Gin Gin Gin Gin Gin Gin Arg Gin Gin Arg 

1250 1255 1260 

Gin Ala Asn Asn Asn Asn Thr Val Pro Glu Asp He Lys Val His Asn 
1265 1270 1275 1280 

Glu Gin Met Gin Lys He Arg Asp Ser Ser Leu Phe Thr Ser Lys Leu 

1285 1290 1295 

Leu Asn Tyr He Arg Asp Asp Gly Arg Lys Asp Arg He Lys Thr Asn 

1300 1305 1310 

He Ser Glu Thr Leu Lys Lys Tyr Ser Arg He Pro Ser Tyr Phe He 

1315 1320 1325 

Ala Ser Lys Ala Gin Lys Pro lie Pro Trp Lys His Thr Lys Asp Asn 

1330 1335 1340 

He Asn Leu Asn Lys He Pro Glu Asp Leu Asn Phe Ser Pro Ala Gin 
1345 1350 1355 1360 

Asn Leu Phe Val Pro Val Asn Pro Arg His He Leu Thr Asp Met Gin 

1365 1370 1375 

Trp Leu Asn Cys He Ser He He Glu Thr Ala Thr Arg Asp Ser Ala 

1380 1385 1390 

He Val Met Gin Ser Phe Gin Glu Gin Ala Asp Lys Thr Thr Thr Gin 

1395 1400 1405 

Leu Glu Glu Leu Leu Ser Gin Trp Asn Asn He Val Ser Gin Val Thr 

1410 1415 1420 

Asp Glu Lys Ser Pro Ala Tyr Val Ser Ser Val Lys Leu Glu Trp Leu 
1425 1430 1435 1440 

Asn Asn Glu Ala Ser Arg He Ala Ala He Arg Glu Asn Ser Glu Lys 

1445 1450 1455 

Ser Lys He Val Met Gly Val Gin Gly Lys He Val Asn He Asp Glu 

1460 1465 1470 

Leu Gly He Val Ala Val Ala Arg Ser He Val Asp Val Asp Phe Tyr 

1475 1480 1485 

He Lys Met Pro Asn Val Trp Ala Ser Arg Asp Trp Lys Asn Leu He 
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1490 1495 
Tyr Tyr Ala Val Asn lie Ala Ala 
1505 1510 
Arg Gly lie Met Ala Ala Ser Gin 
1525 

Leu Ala Leu He Ala Ala Glu Gin 
1540 



1500 

Thr Pro Leu He Asn Asn lie Ser 

1515 1520 
Thr Ser Val Leu Tyr Asp Ser Ser 

1530 1535 
Ala Thr Arg Asn He Thr Met 
1545 1550 



<210> 236 
<211> 669 
<212> DNA 
<213> SHRIMP 



<400> 236 

atggaatgga taaaccaacg gacaagtaga gaggatcttt tcaacacgta tacagggaat 60 

gcagtaatac gttcggcagc taagcaagca ctggctattg aaaaacacgc agcagaaaga 120 

agaggagaaa aggcatggac gacttcagca gcagcagcag cttcttctaa ttttaataat 180 

gtacaacaag attatactga tgatgatatt acacaagtgt ctattgcaaa cagtgttttg 240 

aataacccct ttttaaagag atatgcaaaa cttatagata atttagcaat atcttcttta 300 

cctcctgata tagaggatga tgtcattata cacactagag atgcctccaa ctctacagtc 360 

agagtagatg gagccaatat ctatttcgcc ataattgacg gtgatttatg tgtataccct 420 

aaacaatata tatctgataa agtgctgtgt ggttctctca accgggaaaa ggcactgttc 4 80 

tataatagct ccaagaataa gtggacgtat ggatgtaacc taaactttga tatcgttgac 54 0 

gctgccatca tgaaacaccc cgactacaag gaagagacta catctacaaa acatatacgt 600 

aaaatattgg gtatcggagc atcggaaaaa ctgaacatta cccactattt aaactacttt 660 
atccaataa 669 



<210> 237 
<211> 218 
<212> PRT 
<213> SHRIMP 



<400> 237 



Met 


Glu 


Trp 


He 


Asn 


Gin 


Arg 


Thr 


1 








5 








Tyr 


Thr 


Gly 


Asn 


Ala 


Val 


He 


Arg 








20 










Lys 


His 


Ala 


Ala 


Glu 


Arg 


Arg 


Gly 






35 










40 


Ala 


Ala 


Ala 


Ala 


Ser 


Ser 


Asn 


Phe 




50 










55 




Asp 


Asp 


Asp 


He 


Thr 


Gin 


Val 


Ser 


65 










70 






Pro 


Phe 


Leu 


Lys 


Arg 


Tyr 


Ala 


Lys 










85 








Ser 


Leu 


Pro 


Pro 


Asp 


He 


Glu 


Asp 








100 










Asn 


Ser 


Thr 


Val 


Arg 


Val 


Asp 


Gly 






115 










120 


Asp 


Gly 


Asp 


Leu 


Cys 


Val 


Tyr 


Pro 




130 










135 




Leu 


Cys 


Gly 


Ser 


Leu 


Asn 


Arg 


Glu 


145 










150 






Lys 


Asn 


Lys 


Trp 


Thr 


Tyr 


Gly 


Cys 










165 








Ala 


Ala 


He 


Met 


Lys 


His 


Pro 


Asp 








180 










Lys 


His 


He 


Arg 


Lys 


He 


Leu 


Gly 




195 










200 


He 


Thr 


His 


Tyr 


Leu 


Asn 


Tyr 


Phe 



Ser 


Arg 


Glu 


Asp 


Leu 


Phe 


Asn 


Thr 




10 










15 




Ser 


Ala 


Ala 


Lys 


Gin 


Ala 


He 


Glu 


25 










30 






Glu 


Lys 


Ala 


Trp 


Thr 


Thr 


Ser 


Ala 










45 








Asn 


Asn 


Val 


Gin 


Gin 


Asp 


Tyr 


Thr 








60 










He 


Ala 


Asn 


Ser 


Val 


Leu 


Asn 


Asn 






75 










80 


Leu 


He 


Asp 


Asn 


Leu 


Ala 


He 


Ser 




90 










95 




Asp 


val 


He 


He 


His 


Thr 


Arg 


Asp 


105 










110 






Ala 


Asn 


He 


Tyr 


Phe 


Ala 


He 


He 










125 








Lys 


Gin 


Tyr 


He 


Ser 


Asp 


Lys 


Val 








140 










Lys 


Ala 


Leu 


Phe 


Tyr 


Asn 


Ser 


Ser 




155 










160 


Asn 


Leu 


Asn 


Phe 


Asp 


He 


Val 


Asp 




170 










175 




Tyr 


Lys 


Glu 


Glu 


Thr 


Thr 


Ser 


Thr 


185 








190 






He 


Gly 


Ala 


Ser 


Glu 


Lys 


Leu 


Asn 










205 








He 


Gin 
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210 215 



<210> 238 
<211> 6948 
<212> DNA 
<213> SHRIMP 

<400> 238 

atgagcgata cagggcagat ggaagaaaat 
gatgaagaag aggaagaaac tggtagtagt 
gacgccacgt actccatgta cactggagaa 
cctaaagaaa gaagtgtaca aagggtgcaa 
caacgttcta atgttcggac acggagacct 
gatcgatcgc gagaacggtc agaaaaactt 
gaacggcaaa agaaatatac tccaaaggta 
tttcgtgaag gaggaagaaa attcaaggcg 
gcaaccaatg ttcttgaaag ggaagaaatt 
gaaattacag gagatactat attaggtggg 
gaggatgaat taactataca acattcatct 
gttgtcagtt ctcctatacc gccaaagccc 
gaagatatag tggggaaaaa tattagccag 
gacgaagaag acgaacattt gtacgaagaa 
gcagcagcag cagcttccac aagacctccc 
gttgttgctg ttgcagacga aaccttgaag 
gaggccgagg aacaaagagc ggccgcagtt 
gcggccgctg ctgctgctgc tgctgctgct 
gaaagagaag cagaggaaca aagagcggcc 
cgggaagcag acgaaaaaaa ggaaagagaa 
gctgctgccg ctgctgccca acgggaagca 
caaagagcgg ccgctgctgc tgaaagggaa 
gaacaaatgc gcataaagga agaggagagg 
aaacgtcgag aactagcagc caaggaggaa 
gagcaacttg aaaaattgaa tttccagttg 
gaacaaatgt tagaagaaga gaaggcctca 
gcgatccaag caatagaata tgaggatgaa 
ttagttccta tggatacgga tttgtacgga 
gtacagaata atacattaac ttctgcattt 
caattggttg ctcaatccct tgaaaaatcc 
aaacatcttc ctgtggatga tcctgctttt 
tctttaaaga atctgggaaa tgtttacaaa 
agggaccttt ttaaatcgaa ggcattaatt 
cgtctaacag atgtatcgac agatctcagg 
ttggatgtat ccgttaacat acgctctggt 
gacacggcat tagcagatca agttccttcg 
gccttattgc cattttcatc tgcaggtttg 
tacaatgaaa tagtgaacca actaagcagt 
aatattgtcc caacattgaa ccaaatcaaa 
agctctcgtc aatatgctat tgaagaaagg 
gaaataagaa aattcctcgc tataatgaac 
tggacggatg aaagacaacg ctctattgct 
gataaaatta aagagagtgt tgctacacta 
agtaatcccc tcctgcacaa atcctcagtt 
aacgacttta gaaattttct cgatatccaa 
tcaggccaaa atattgatga cctttcactg 
ttgtgcctcg aattatccat aattttagac 
ttgcctgcaa ttacttaccc cgcaggggaa 
gttgacatta gacaagaaat tggtaagaac 
acactgtcag aggcgttaca aatttttcag 
caacagcagc tgttacaaca acaacaggac 
caaatagaag aacaacaacg ggttcaggaa 



aggcctgcta cccagaaaag gagacctgga 60 
aatgttccat attatgccaa ctttggcgat 120 
ggaaaaaggg gtaaatttgt attagagcca 180 
aaaccaccta aagaaaagga ggaaagggaa 24 0 
ggtcaagagt ttgaacagaa agtgctacaa 300 
gggcaaaatt tggcagagaa aggattgcaa 360 
gcacaaacaa tgacaaaaaa aataatcagg 420 
ccgcaacagc agacatctga caaaggtgca 4 80 
gagatggctg cagaaagaga acaaccagta 54 0 
ctaggagaag aagatgacga agatatggga 600 
atggctgtat cacaacccgt tcaacaaatc 660 
actaggcccg ctcctgatat tcccatacaa 720 
ttaccaccat taccacttga tgattatgag 780 
gtgaatgatt tcttagtggc accaccaaca 84 0 
aggcctaata ttcctcctcc acctcctcct 900 
aacttggctt caattgcagc cttggaaaag 960 
gaaagggaaa gagaagtaga ggaacaaaga 1020 
gccgcccaac gggaagcaga cgaaaaaagg 1080 
gctgctgctg ctgctgccgc tgctgcccaa 1140 
gtagaggaac aaagagcggc cgctgctgct 1200 
gacgaaaaaa gggaaagaga agtagaggaa 1260 
atacttgccc aacaacttca agaaatgaaa 1320 
cggaaagaac tagcagataa ggaggaagaa 1380 
gaaaagcgtc aagaaatatt agctaaagaa 14 4 0 
ggtacagaaa tcacgtccaa aagagcactc 1500 
cgctcacggt ccgcagccag tgcacagtta 1560 
cttcctcagg cagtcgaacc tcaaggacag 1620 
aaaatgtacg atctcaataa gaaattagaa 1680 
gaagacgtga acaaaacaaa cgaacagaac 174 0 
gctaaagcca ttgaaaaatt aactagtcaa 1800 
atgcagagaa taataacaga gagggatttt 1860 
agagttctcg gggtctattt tacattaaaa 1920 
acagataaag aatcaaggga tctggaggtg 1980 
gctaatgatc tcaatacaat actggaaagg 204 0 
ggaacattat acactaaatt tacagaggca 2100 
aggattgaaa taagtaacag atcaagatct 2160 
gatactaatt ttactaatag ttccgacaag 2220 
ataaatgagg ctatgaatat tttgaaagaa 2280 
attgatgtca ccaatctatt aacagtttca 2340 
gtgtattctg atgtgtcccg aatggattct 2400 
agtaaaatat ccccttattt taaaggcgat 24 60 
gacagtattt cctctcagat aaaatcaaac 2520 
cacgatatca atacaacctc aagaatacgt 2580 
ttatcatctc cagacttttt aaatgctgtt 2 64 0 
ggaggttctc aatttactta tgatgtcctt 2700 
gcatcaaaaa ccactgaaaa ggttacagaa 2760 
gtgatccata aaaatgcttt gagtttaaat 2820 
acatctatgg aagaaagtgg ttcattggct 2880 
atatcagatt ctagcgctga acttagtcgc 2940 
caacaacagc aacaacaaca gcaacaattc 3000 
caacaaaatc aacaacaatt attacaacaa 3060 
cagcagcaac agcagcaaag ggaccaacaa 3120 
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caacaggaac 
caaagggaac 
ttgcaacaac 
gacgatggtg 
cgtaaggttg 
cgagaagaaa 
gccacccctc 
acaacccagt 
gagaggattg 
attataacta 
ttcaatcaaa 
tcagcttctt 
agcatgcaag 
gcaattactc 
gcggctgagc 
aaagtggatc 
agtacttcca 
cataatgaaa 
gccaatctta 
ctccgcacat 
ttccgccaat 
aaggcactgg 
agtgtcaaaa 
atatttgcta 
aaatttttaa 
ctgtgtgaag 
cccttagccg 
gacacgtcaa 
gtggcttatg 
tcgggtctgg 
tcccaccggt 
caactacaaa 
tgtgttgccg 
caatccatac 
gcagaaaaaa 
gcaaaggaac 
aagttggcga 
aataaagccg 
attccaattg 
gtaattaaac 
tcttcagcct 
attttagatg 
tcaggatatg 
aaattccctg 
gaaagcgtat 
ctcttaatac 
agcagcagca 
caagttgttg 
agacgtgatg 
aaaggagtat 
cttaaaactt 
agacctatat 
ctcgaaggat 
tcatccttga 
cttgccacaa 
ggacaaaaac 
caacaacatc 
gatcaagcta 
tccatgagaa 
agggatgggg 
ataaaccctg 



aacagcaaag 
aacaacagca 
agcaacaatt 
atgaagaaag 
cagaatcagt 
ataacttcca 
cttcacctat 
gtttttcaaa 
tagaagtatg 
ccgcacaaac 
ttgaagagtt 
ctactaatcc 
tttgtgagtt 
tggccaatgc 
gagaaattgc 
taagttcagg 
aacaagaatt 
tggatattgg 
ggctcgtgaa 
ggctaggaga 
acattgatcg 
gtgtttcttt 
atgatgcatc 
tcgacggagg 
ggcaatctat 
cgacaatagc 
ataaagtaga 
ccggagacat 
tgaacctatg 
caaacaaaaa 
ttggatctta 
aacagaatat 
ccatggcagc 
tcttcccact 
atgtcgatat 
taaattcata 
gagtgtgcag 
agagtttagt 
atgataccaa 
catggatggt 
atttctcgtt 
catgggcttg 
aaaatcatat 
ccgacggacg 
tggaagattt 
cgccagatcc 
gcagcaagga 
gacaagaaca 
caaatattga 
caatttcagt 
ccatcatcaa 
ttgatcatca 
cagatttaaa 
ctctggcctc 
aacaagtaga 
ttaatgaaaa 
aatattctga 
gtaatggtgg 
aggcaaaact 
aaaattcatc 
gagcactttt 



ggaacaacag 
gcaacagcag 
tcagcaatta 
agaggaaaga 
agcgacaaaa 
atctaaaata 
catgaacttg 
actaacagct 
caagagtaag 
agaaatggaa 
ttcaaacaat 
tcctcctcct 
ggatgatcaa 
tgtacttcaa 
tctaaagcta 
agctactggt 
ggaagatttc 
attaattttg 
acctcatgga 
aacagatata 
caattggaag 
atcagatacg 
aaaatcagta 
aatgatttct 
gacagataaa 
gtctcttttc 
accacgcctt 
gtttcgtata 
caataatcgt 
agtcgtggcc 
tgatgtcaca 
ctctttactt 
cttcactgcc 
agaaggggga 
ggttcaagaa 
ctacaactgg 
gatgattatc 
agacactaat 
aactctggca 
ttcgttcaag 
tcaaaatatc 
cgcgcccggt 
caaattaaaa 
aggcactgtg 
taccgacttt 
tttactttct 
taatacaatt 
agcaccacca 
accaataata 
actggatgat 
tgatgccata 
aacacagaaa 
gaatggacaa 
cgattggaat 
gaaggaagaa 
attatcattt 
aaggggagga 
catagatgat 
ggcattagcc 
agctaaaccg 
attacgtaga 



cagcaacagc 
caacagagtg 
ctacaacaac 
gaagaagggg 
tatactgctg 
gcatcagcaa 
acaaaattga 
gaaaataata 
aatctcaatt 
gatcgagtga 
gtaaaacaac 
tctactcctt 
cgtaccctag 
actacaaaat 
gagaatggga 
gtttccgacc 
attgcagaag 
gatgccaaga 
ataaatgtgc 
ttggatgaag 
gtggaagaac 
ttggcacaca 
ccatttgcgt 
agcctttcaa 
gaggttgctc 
acggcatgta 
caagaaaaat 
cgagtttgtc 
ataaactata 
ggtaaaacaa 
tatgattttt 
ttagagaagg 
gacccttctc 
gagattgtta 
ttgtggaagg 
ttcatataca 
ggtatagtta 
gctctttctg 
ataaatattg 
caaatgttta 
caacaacaaa 
aagttaacaa 
aaagatgatc 
gtcgaaggat 
tcaattgaag 
tccatgttcg 
attggaaagg 
a taaacacgt 
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1505 1510 1515 1520 

Asp Lys Glu Val Ala Gin Gly Pro Val Arg Ser Gin Leu Cys Glu Ala 

1525 1530 1535 

Thr lie Ala Ser Leu Phe Thr Ala Cys Ser Asn Leu Leu Arg Ser Ser 

1540 1545 1550 

Pro Leu Ala Asp Lys Val Glu Pro Arg Leu Gin Glu Lys Leu Ala Ala 

1555 1560 1565 

Ala Ala Ala Val Asp Thr Ser Thr Gly Asp Met Phe Arg lie Arg Val 

1570 1575 1580 

Cys His Leu Met Tyr Asn Phe lie Val Ala Tyr Val Asn Leu Cys Asn 
1585 1590 1595 1600 

Asn Arg lie Asn Tyr Thr Leu Asn Val Leu Arg Ala Ser Gly Leu Ala 

1605 1610 1615 

Asn Lys Lys Val Val Ala Gly Lys Thr Thr Lys Gly His Thr Ser Ser 

1620 1625 1630 

Ser His Arg Phe Gly Ser Tyr Asp Val Thr Tyr Asp Phe Ser Val Leu 

1635 1640 1645 

Tyr Lys lie Leu Gin Leu Gin Lys Gin Asn lie Ser Leu Leu Leu Glu 

1650 1655 1660 

Lys Gly Phe Asn Ala Trp Glu Ser Cys Val Ala Ala Met Ala Ala Phe 
1665 1670 1675 1680 

Thr Ala Asp Pro Ser Leu Ser lie Ser Asp Ala Asp Gin Ser lie Leu 

1685 1690 1695 

Phe Pro Leu Glu Gly Gly Glu lie Val lie Glu Lys His Glu Asn Asp 

1700 1705 1710 

Ala Glu Lys Asn Val Asp Met Val Gin Glu Leu Trp Lys Glu Thr Ala 

1715 1720 1725 

Leu Thr Leu Met Ala Lys Glu Leu Asn Ser Tyr Tyr Asn Trp Phe lie 

1730 1735 1740 

Tyr Ser Lys Asp Thr Asp Met Glu Lys Leu Ala Arg Val Cys Arg Met 
1745 1750 1755 1760 

lie lie Gly lie Val Lys Ala Val Leu Arg Leu Thr Asn Lys Ala Glu 

1765 1770 1775 

Ser Leu Val Asp Thr Asn Ala Leu Ser Asp lie Phe Lys Leu Pro Val 

1780 1785 1790 

lie Pro lie Asp Asp Thr Lys Thr Leu Ala lie Asn lie Val Val Phe 

1795 1800 1805 

Thr Leu Asn Asn Val lie Lys Pro Trp Met Val Ser Phe Lys Gin Met 

1810 1815 1820 

Phe Arg Gin Lys Asp Gly Gly Val Ser Ser Ala Tyr Phe Ser Phe Gin 
1825 1830 1835 1840 

Asn lie Gin Gin Gin Lys His Gin Gin Thr Ala Ser lie Leu Asp Ala 

1845 1850 1855 

Trp Ala Cys Ala Pro Gly Lys Leu Thr Lys Ala Ala His Val Phe lie 
1860 1865 1870 
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Ser Gly Tyr Glu Asn His lie Lys Leu Lys Lys Asp Asp Leu Leu Trp 

1875 1880 1885 

Gly Ala Ser Met Lys Phe Pro Ala Asp Gly Arg Gly Thr Val Val Glu 

1890 1895 1900 

Gly Trp Ala Gin Gin Tyr Asn Asn Glu Ser Val Leu Glu Asp Phe Thr 
1905 1910 1915 1920 

Asp Phe Ser lie Glu Val Asn Ala Pro Ala Ser Gly Leu Leu lie Pro 

1925 1930 1935 

Pro Asp Pro Leu Leu Ser Ser Met Phe Gly Lys Gly Asn Gly Gly Ser 

1940 1945 1950 

Ser Ser Ser Ser Ser Lys Asp Asn Thr lie lie Gly Lys Gly Gly Leu 

1955 1960 1965 

lie Leu Asn Arg Gin Val Val Gly Gin Glu Gin Ala Pro Pro lie Asn 

1970 1975 1980 

Thr Ser Ser Asp Thr Lys Lys lie Arg Arg Asp Ala Asn lie Glu Pro 
1985 1990 1995 2000 

lie lie Gly Thr Pro Tyr Ser Val lie Lys Ala Ser Lys Gly Val Ser 

2005 2010 2015 

lie Ser Val Leu Asp Asp Phe Asn Glu Asp Ser Pro Glu Asp Phe Ala 

2020 2025 2030 

Leu Lys Thr Ser lie lie Asn Asp Ala lie Arg Glu lie Gly Gin Arg 

2035 2040 2045 

Met Thr Tyr Trp lie Phe Asp His Gin Thr Gin Lys Asn lie His Tyr 

2050 2055 2060 

Ser Ser Pro Lys lie lie Gly Ser Asp Leu Lys Asn Gly Gin Arg Ser 
2065 2070 2075 2080 

Gly Gin Ser Trp Ala Pro Ser Ser Ser Ser Leu Thr Leu Ala Ser Asp 

2085 2090 2095 

Trp Asn Leu Pro Ser Leu Glu Leu Leu Tyr Arg Glu Leu Ala Thr Lys 

2100 2105 2110 

Gin Val Glu Lys Glu Glu Glu Glu Lys Ser Glu Glu Asp Lys Gly Gin 

2115 2120 2125 

Lys Leu Asn Glu Lys Leu Ser Phe Val Val Asn Lys Ala lie Gly Thr 

2130 2135 2140 

lie Gin Gin Gin His Gin Tyr Ser Gly Gly Gly Met Lys Arg Tyr Gin 
2145 2150 2155 2160 

Gin His Ser Ala Asp Gin Asn Gly Gly lie Asp Asp lie Glu Leu Met 

2165 2170 2175 

Asn Ser Lys Asp Ala Thr Ser Met Arg Lys Ala Lys Leu Ala Val Thr 

2180 2185 2190 

Asn Lys lie Ala Ala Ala Ala Ala Arg Asp Gly Glu Asn Ser Ser Ala 

2195 2200 2205 

Lys Pro Ser Asn Phe Gly Asn Arg Leu Asp Glu Ala lie Asn Pro Gly 

2210 2215 2220 

Ala Leu Leu Leu Arg Arg Gly Gly Gly Val Arg Gly Gly Gin Thr Pro 
2225 2230 2235 2240 

Gin Ser Ser Met Leu Thr Met Phe Arg Pro Gly Gin Thr Gly Gly Asn 

2245 2250 2255 

Ser Ser Trp Trp Thr Thr Asn Thr Pro Leu lie Gin Arg Thr Thr Ser 

2260 2265 2270 

Val Gly Asn Asn Leu Val Val Leu Val Pro Asn Leu Leu Asp Ser His 

2275 2280 2285 

Pro Pro Thr Phe Asn 
2290 



<210> 240 
<211> 4311 
<212> DNA 
<213> SHRIMP 
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<400> 240 

atggattttg 

gtgaagaaag 

aaggttatta 

gatggacaaa 

aagagcactg 

agtgtgcgca 

aaagtgtttt 

aagttaaacg 

aaacacaacg 

gattgtttta 

ggaaatacat 

gtaccagttt 

gccaagcttc 

tacatcactt 

tgtcacaccg 

catgtggccg 

gagtatgaaa 

ttttctaacg 

gatgttactg 

gacaggcgtc 

tcatcgtcat 

gttgtggagg 

ggttcagaag 

agaagggtta 

agtatcaata 

ccccaaactg 

aatattttgt 

atgagcagtt 

agtatccgtc 

ttgtttgatg 

tctgaaatat 

ggacaaaaga 

acggaatcta 

acagtcatac 

cgcaagcgta 

atgatgaaaa 

tctacaccag 

ctcagtttca 

atattcggtc 

aagacttttg 

tatgagaggg 

gaaaatgaaa 

ccgatcgctt 

aacaccccta 

aggacatatg 

attaagagag 

ctgtggtacc 

aacaacaata 

gaggaggata 

gggagaaaga 

aaaagtttgt 

ataagctgga 

acccttgatg 

ggaacagatg 

gtaggcccct 

ggcaaattgt 

gattcattaa 

ttgcatttgt 

tataatattt 

aagatctcat 



aaggaactac 
ttgcagagca 
aggacctgga 
aactggacaa 
cagataattc 
gaaccaagat 
ctgtcgagga 
ccaactgttc 
tctcctcatt 
tgcttgaaac 
atgcagtttt 
tcgacatgaa 
ctgttctgtg 
caattgagac 
aagatttgga 
tgtacgacta 
acacggctag 
aatgtgcaat 
ttaataatga 
tttttgatgc 
cgtcttcctc 
gggaggaaga 
atgctcttgt 
agaatgaacc 
acgaagatga 
aagctacagt 
atgaaactat 
accgtgtatt 
gccgtccaac 
aggttgtcac 
ttggaggaat 
agagtctgta 
atgtgaataa 
ctaacgtatt 
cctctagtgc 
atgatgaagg 
aacaacaaca 
taaaaatgcc 
aacgtctttg 
aagatcttga 
aggtaactcg 
tggatgtttt 
actttgaaga 
agcgcaaaaa 
aagaatacca 
aattggaagg 
tgactgtaat 
atgataacga 
aaatggaaga 
agggtcaaaa 
gtgaaaattc 
tgaacaagaa 
aggtctcaag 
ataaatatga 
ttagtatacc 
acgttaacac 
acgatgagga 
ctaagcatga 
ctcctagctc 
ctggtatgag 



cagttctacc 
ttcctttgcc 
aggggagagg 
ggctatgttg 
cattgaatcg 
tgaaagtgaa 
tttagaaatc 
aagaattgga 
tgagaacaac 
aacctatcct 
cttgactggt 
tgcaggtatt 
catgtttgga 
gcagtctttt 
gaggaagaag 
cagtggagac 
ttggtgcaac 
tagtgatatg 
agaacatgaa 
tagtccttcc 
tccaacagtt 
agaggaagaa 
tatgaagaaa 
tgcactcaaa 
tggtgatgat 
gaaaaattcc 
gcaaatttct 
caccaacaaa 
catttggact 
atcaattgtg 
gtttaacgcc 
tggaaacaag 
tgcactattt 
ctccttcaaa 
ttcatcatct 
ggaaaaggtt 
acaagctggt 
acgcagtaat 
tggactgtct 
aaatgaaatc 
cttgtatgag 
gtctcaccaa 
agtactggag 
caagtatgca 
tgcgtttagc 
agacaactat 
gaccaggaac 
agaaaccgat 
aagtatggac 
attcaacagt 
gatggtagtt 
aatccctccc 
gtttgtgttt 
aactgttttt 
tgttgatttt 
catagacgga 
tgttgattct 
cacctttttc 
ttctctcact 
caataataat 



ccctcaaaaa 
aatcttcatg 
aagaaaatgt 
gacgatatta 
accatcaagg 
gccaagaaca 
tactccaagg 
ggcaagtatg 
aacaaccaag 
cttgttgggt 
gttgggctag 
caaaccctaa 
cgtacagaat 
gacgaagagg 
cgcatgaatg 
gggaaagaac 
gcaaacggtg 
aatgacttgt 
gaacgttcta 
cccatcaaga 
cctcctccta 
attgatgaag 
ttagccaagc 
attacttctg 
gacgatgccg 
ttcatggccc 
cttgctaaaa 
ctccaagagt 
gaagaaagtc 
gcacatcaga 
aactctacca 
cacatttctt 
gcgtgggtga 
atggcatcag 
aacgatgaac 
gcacaagaat 
catgacaagg 
gtcaatggct 
gatgcttcca 
atgaggagct 
aaatgcaggt 
ggggaattgt 
aatattaaga 
aaggtactgg 
aagtttgtac 
acccatgacg 
aggatttgcg 
attgtggagg 
gtagaacaac 
attggggatc 
tctattgcaa 
ggtttcttga 
agcgatgtca 
ggagtcagta 
tcaagcgcag 
aagggcattt 
actacaacag 
aatattaata 
gaaaagaaaa 
ggcatgtgtg 



tgtcccagtt 
acaaggctac 
ctaccccaaa 
tcaacgagta 
aaattgaaaa 
gtgtaacttc 
ggcgagtgtg 
cagtgtcaat 
ttttctctga 
tcgaaacttc 
aaagatctct 
acatgactgg 
atgacaactt 
aaaatgatgc 
acgcaccagc 
aattgctcta 
tggtcacatc 
gctgttttgc 
tgaatattgt 
cggaagaaga 
caccatacga 
acgaaagtag 
tttctactat 
ggggtaacaa 
ttgacgctac 
caaacgacga 
tttgtaataa 
gtttgaatac 
aacaatttgc 
tggctcaaga 
acattaaggg 
cctcgtgctt 
aatcgaaact 
aaaagccctc 
accaagaacc 
catcatctcc 
aaactatcaa 
cggcttcata 
gcacatttaa 
cattcactag 
ctcaagctgt 
ttgccgagt t 
gttggagcct 
tgagcgttaa 
caatgttct t 
ttcacttttc 
atgtgctcca 
aagaggagga 
agaagcaagt 
aagtcattag 
ttaatagttt 
aggattctag 
aaatcaatag 
cgcgtgtgga 
gactagataa 
tgacaatttc 
acaagctaga 
agaataaggt 
agacaaaat t 
tacaaactcc 



gtattcatca 
tcttgcatca 
gtcctc ttct 
tcaggccgtt 
tgtacttgaa 
cagcccagaa 
caaaggtctc 
gagtatcaaa 
agaacccagg 
tacagaagat 
acctaaatat 
tttgaggatg 
ggaagatttt 
cagaatgagg 
gattacacct 
tatgataacc 
tgacagtgga 
tgactgcatc 
ggtcgaatct 
tggagaaaat 
aggtaacgca 
caagtatgaa 
gaaacaaatg 
tagtagcagt 
tgcattatgc 
gagaactgaa 
tccatcatct 
catggacgat 
taagggtttg 
tatttgcaag 
taaatatgaa 
caaaaccaac 
ccattctggc 
aaaaatgaag 
a tcaacaaaa 
ttcttcatcg 
t ttaattccc 
tttgtctgaa 
gagaatgtgc 
actgactaga 
agatattgag 
ct tggaggac 
agaaaacgtt 
tgctattcgt 
gttcaatctg 
ttccacttgc 
gtacatcaac 
aggagaagga 
tcgcaaggga 
aaaa tttgtg 
gatctctgga 
cacaatcaat 
gaaaatcaat 
ttcacatatt 
ggcctcatgt 
acccaaatat 
gaaggatatt 
tct tccattc 
caataggaag 
ttctagttca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 
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aattcagtct 
tgctcccttt 
aggaacatgt 
catagacacg 
attattgata 
agcgccttta 
agtgatttga 
ttgacccatg 
aactatacat 
gctgctgctg 
gctgctgcaa 
attagaggaa 



cttccgtctc 
cttctacaaa 
ggaggtgtgg 
ctgtaaaact 
ggaataggaa 
ccggcgagtc 
acctatgtat 
aagagtacca 
ttaagagtat 
ctgatgatga 
tccaagactt 
agtacaagaa 



gtctattgta 
gaaaaagagc 
atttgttgta 
tgtagctgaa 
gattagattt 
tacatatttg 
ttatacttca 
ggacggaaat 
caaatcctgc 
tggttctact 
tatgcatgta 
atctttggca 



<210> 241 
<211> 1430 
<212> PRT 
<213> SHRIMP 



gctccttcat 
atctggaacg 
ccacccaaac 
actgcaccta 
aacggtctaa 
ctcaataaga 
tctttaaatg 
gctttggatg 
tcgtccaaag 
acttctacct 
atgattaaga 
aagaaaacaa 



cttctgttct 
agaacatgtt 
tttgcagttt 
aaacaaagtt 
agaaggtatg 
atatgactgc 
acccattgta 
actatggtgc 
atgaaaccgc 
cttcttcaac 
aaattgatgc 
agaagcatta 



ggctctatct 
tttgacatct 
tattgttaac 
gtgtaggaat 
caagagtgtt 
aacttcacct 
tacttgtaaa 
agtttttgta 
tgacgacaat 
tgatactgat 
catgaaagac 



3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4311 



<400> 241 



Met 


Asp 


Phe 


Glu 


Gly 


Thr 


Thr 


Ser 


Ser 


Thr 


Pro 


Ser 


Lys 


Met 


Ser 


Gin 


1 








5 










10 










15 




Lys 


Ser 


Val 


Lys 


Lys 


Val 


Ala 


Glu 


His 


Ser 


Phe 


Ala 


Asn 


Leu 


His 


Asp 








20 










25 










30 






Lys 


Ala 


Thr 


Leu 


Ala 


Ser 


Lys 


Val 


He 


Lys 


Asp 


Leu 


Glu 


Gly 


Glu 


Arg 






35 










40 










45 








Lys 


Lys 


Met 


Ser 


Thr 


Pro 


Lys 


Ser 


Ser 


Ser 


Asp 


Gly 


Gin 


Lys 


Leu 


Asp 




50 










55 










60 










Lys 


Ala 


Met 


Leu 


Asp 


Asp 


He 


He 


Asn 


Glu 


Tyr 


Gin 


Ala 


Val 


Lys 


Ser 


65 










70 










75 










80 


Thr 


Ala 


Asp 


Asn 


Ser 


He 


Glu 


Ser 


Thr 


lie 


Lys 


Glu 


He 


Glu 


Asn 


Val 










85 










90 










95 




Leu 


Glu 


Ser 


Val 


Arg 


Arg 


Thr 


Lys 


He 


Glu 


Ser 


Glu 


Ala 


Lys 


Asn 


Ser 








100 










105 










110 






Val 


Thr 


Ser 


Ser 


Pro 


Glu 


Lys 


Val 


Phe 


Ser 


Val 


Glu 


Asp 


Leu 


Glu 


He 






115 










120 










125 








Tyr 


Ser 


Lys 


Gly 


Arg 


Val 


Cys 


Lys 


Gly 


Leu 


Lys 


Leu 


Asn 


Ala 


Asn 


Cys 




130 










135 










140 










Ser 


Arg 


He 


Gly 


Gly 


Lys 


Tyr 


Ala 


Val 


Ser 


Met 


Ser 


He 


Lys 


Lys 


His 


145 










150 










155 










160 


Asn 


Val 


Ser 


Ser 


Phe 


Glu 


Asn 


Asn 


Asn 


Asn 


Gin 


Val 


Phe 


Ser 


Glu 


Glu 










165 










170 










175 




Pro 


Arg 


Asp 


Cys 


Phe 


Met 


Leu 


Glu 


Thr 


Thr 


Tyr 


Pro 


Leu 


Val 


Gly 


Phe 








180 










185 










190 






Glu 


Thr 


Ser 


Thr 


Glu 


Asp 


Gly 


Asn 


Thr 


Tyr 


Ala 


Val 


Phe 


Leu 


Thr 


Gly 






195 










200 










205 








Val 


Gly 


Leu 


Glu 


Arg 


Ser 


Leu 


Pro 


Lys 


Tyr 


Val 


Pro 


Val 


Phe 


Asp 


Met 




210 










215 










220 










Asn 


Ala 


Gly 


He 


Gin 


Thr 


Leu 


Asn 


Met 


Thr 


Gly 


Leu 


Arg 


Met 


Ala 


Lys 


225 










230 










235 










240 


Leu 


Pro 


Val 


Leu 


Cys 


Met 


Phe 


Gly 


Arg 


Thr 


Glu 


Tyr 


Asp 


Asn 


Leu 


Glu 










245 










250 










255 




* Asp 


Phe 


Tyr 


He 


Thr 


Ser 


He 


Glu 


Thr 


Gin 


Ser 


Phe 


Asp 


Glu 


Glu 


Glu 








260 










265 










270 






Asn 


Asp 


Ala 


Arg 


Met 


Arg 


Cys 


His 


Thr 


Glu 


Asp 


Leu 


Glu 


Arg 


Lys 


Lys 






275 










280 










285 








Arg 


Met 


Asn 


Asp 


Ala 


Pro 


Ala 


He 


Thr 


Pro 


His 


Val 


Ala 


Val 


Tyr 


Asp 




290 










295 










300 










Tyr 


Ser 


Gly 


Asp 


Gly 


Lys 


Glu 


Gin 


Leu 


Leu 


Tyr 


Met 


He 


Thr 


Glu 


Tyr 


305 










310 










315 










320 


Glu 


Asn 


Thr 


Ala 


Ser 


Trp 


Cys 


Asn 


Ala 


Asn 


Gly 


Val 


Val 


Thr 


Ser 


Asp 










325 










330 










335 
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Ser Gly Phe Ser Asn Glu Cys Ala lie Ser Asp Met Asn Asp Leu Cys 

340 345 350 

Cys Phe Ala Asp Cys He Asp Val Thr Val Asn Asn Glu Glu His Glu 

355 360 365 

Glu Arg Ser Met Asn He Val Val Glu Ser Asp Arg Arg Leu Phe Asp 

370 375 380 

Asp Ser Pro He Lys Thr Glu Glu Asp Gly Glu Asn Ser Ser Ser Ser 
385 390 395 400 

Ser Ser Ser Pro Thr Val Pro Pro Pro Thr Pro Tyr Glu Gly Asn Ala 

405 410 415 

Val Val Glu Gly Glu Glu Glu Glu Glu Glu He Asp Glu Asp Glu Ser 

420 425 430 

Ser Lys Tyr Glu Gly Ser Glu Asp Ala Leu Val Met Lys Lys Leu Ala 

435 440 445 

Lys Leu Ser Thr Met Lys Gin Met Arg Arg Val Lys Asn Glu Pro Ala 

450 455 460 

Leu Lys He Thr Ser Gly Gly Asn Asn Ser Ser Ser Ser He Asn Asn 
465 470 475 480 

Glu Asp Asp Gly Asp Asp Asp Asp Ala Val Asp Ala Thr Ala Leu Cys 

485 490 495 

Pro Gin Thr Glu Ala Thr Val Lys Asn Ser Phe Met Ala Pro Asn Asp 

500 505 510 

Glu Arg Thr Glu Asn He Leu Tyr Glu Thr Met Gin He Ser Leu Ala 

515 520 525 

Lys He Cys Asn Asn Pro Ser Ser Met Ser Ser Tyr Arg Val Phe Thr 

530 535 540 

Asn Lys Leu Gin Glu Cys Leu Asn Thr Met Asp Asp Ser lie Arg Arg 
545 550 555 560 

Arg Pro Thr He Trp Thr Glu Glu Ser Gin Gin Phe Ala Lys Gly Leu 

565 570 575 

Leu Phe Asp Glu Val Val Thr Ser He Val Ala His Gin Met Ala Gin 

580 585 590 

Asp He Cys Lys Ser Glu He Phe Gly Gly Met Phe Asn Ala Asn Ser 

595 600 605 

Thr Asn He Lys Gly Lys Tyr Glu Gly Gin Lys Lys Ser Leu Tyr Gly 

610 615 620 

Asn Lys His Ser Ser Cys Phe Lys Thr Asn Thr Glu Ser Asn Val Asn 
625 630 635 640 

Asn Ala Leu Phe Ala Trp Val Lys Ser Lys Leu His Ser Gly Thr Val 

645 650 655 

He Pro Asn Val Phe Ser Phe Lys Met Ala Ser Glu Lys Pro Ser Lys 

660 665 670 

Met Lys Arg Lys Arg Thr Ser Ser Ala Ser Ser Ser Asn Asp Glu His 

675 680 685 

Gin Glu Pro Ser Thr Lys Met Met Lys Asn Asp Glu Gly Glu Lys Val 

690 695 700 

Ala Gin Glu Ser Ser Ser Pro Ser Ser Ser Ser Thr Pro Glu Gin Gin 
705 710 715 720 

Gin Gin Ala Gly His Asp Lys Glu Thr He Asn Leu He Pro Leu Ser 

725 730 735 

Phe He Lys Met Pro Arg Ser Asn Val Asn Gly Ser Ala Ser Tyr Leu 

740 745 750 

Ser Glu He Phe Gly Gin Arg Leu Cys Gly Leu Ser Asp Ala Ser Ser 

755 760 765 

Thr Phe Lys Arg Met Cys Lys Thr Phe Glu Asp Leu Glu Asn Glu He 

770 775 780 

Met Arg Ser Ser Phe Thr Arg Leu Thr Arg Tyr Glu Arg Glu Val Thr 
785 790 795 800 

Arg Leu Tyr Glu Lys Cys Arg Ser Gin Ala Val Asp He Glu Glu Asn 

805 810 815 

Glu Met Asp Val Leu Ser His Gin Gly Glu Leu Phe Ala Glu Phe Leu 
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820 










825 










830 






1 ii 

\3 JL ii 




Pro 

835 


He 


Ala 


±yr 


trl 1c 


V? J-U 

840 


VJL Li 


VOX 




Glu 


Asn 
845 


He 




Ser 


Trp 


•JCJL 

ocn 


Leu 


UlU 


Asn 


vox 


Asn 

Djj 


X III 




T \/c 
Xjy o 


Arg 


Lys 

OOU 


Asn 


Lys 


lyr 


Ala 


Lys 


Val 


Leu 


val 


Oct 


vai 


Asn 


nla 


lie 


Arg 


Arg 


i nr 


Tyr 


UiU 


olU 


Tyr 


865 










870 










875 










880 


MIS 


n!3 


rile 


Ser 


Lys 
885 


rne 


vai 


Pro 


rae c. 


irne 
890 


Leu 


OK e» 

rne 


Asn 


Leu 


Tl * 

a le 
895 


Lys 


Arg 


bill 


Leu 


olU 

ann 
yuu 


ft i » 
iaxy 


Asp 


Asn 


Tyr 


inr 
yuo 


HIS 


Asp 


vai 


HIS 


rne 

qi n 
yiu 


oer 


ber 


i nr 


Cys 


Leu 
yio 


Trp 


Tyr 


Leu 


inr 


Val 
920 


Met 


inr 


Arg 


Asn 


Arg 
925 


He 


Cys 


Asp 


val 


Leu 
930 


Gin 


Tyr 


lie 


Asn 


Asn 
935 


Asn 


Asn 


Asn 


Asp 


Asn 
940 


Glu 


Glu 


i nr 


Asp 


He 


Val 


Glu 




/Tin 
olu 


Glu 


Glu 


Gly 


Glu 


Liiy 


Glu 


Glu 


Asp 


Lys 




ulU 


945 










950 










955 












Glu 


Ser 


Met 


Asp 


Val 
965 


Glu 


Gin 


Gin 


Lys 


Gin 
970 


val 


Arg 


Lys 


Gly 


Gly 
975 


Arg 


Lys 


Lys 


Gly 


Gin 
980 


Lys 


Phe 


Asn 


Ser 


He 
985 


Gly 


Asp 


Gin 


Val 


He 
990 


Arg 


Lys 


Phe 


Val 


Lys 


Ser 


Leu 


Cys 


Glu 


Asn 


Ser 


Met 


Val 


Val 


Ser 


He 


Ala 


He 






995 










1000 








1005 






Asn 


Ser 


Leu 


He 


Ser 


Gly 


He 


Ser 


Trp 


Met 


Asn 


Lys 


Lys 


He 


Pro 


Pro 




1010 








1015 








1020 








Gly 


Phe 


Leu 


Lys 


Asp 


Ser 


Ser 


Thr 


He 


Asn 


Thr 


Leu 


Asp 


Glu 


Val 


Ser 


1025 








1030 








1035 








1040 


Arg 


Phe 


Val 


Phe 


Ser 


Asp 


Val 


Lys 


He 


Asn 


Arg 


Lys 


He 


Asn 


Gly 


Thr 










1045 








1050 








1055 


Asp 


Asp 


Lys 


Tyr 


Glu 


Thr 


Val 


Phe 


Gly 


Val 


Ser 


Thr 


Arg 


Val 


Asp 


Ser 








1060 








1065 








1070 




His 


He 


Val 


Gly 


Pro 


Phe 


Ser 


He 


Pro 


Val 


Asp 


Phe 


Ser 


Ser 


Ala 


Gly 






1075 








1080 








1085 






Leu 


Asp 


Lys 


Ala 


Ser 


Cys 


Gly 


Lys 


Leu 


Tyr 


Val 


Asn 


Thr 


He 


Asp 


Gly 




1090 








1095 








1100 








Lys 


Gly 


He 


Leu 


Thr 


He 


Ser 


Pro 


Lys 


Tyr 


Asp 


Ser 


Leu 


Asn 


Asp 


Glu 


1105 








1110 








1115 








1120 


Asp 


Val 


Asp 


Ser 


Thr 


Thr 


Thr 


Asp 


Lys 


Leu 


Glu 


Lys 


Asp 


He 


Leu 


His 










1125 








1130 








1135 


Leu 


Ser 


Lys 


His 


Asp 


Thr 


Phe 


Phe 


Asn 


He 


Asn 


Lys 


Asn 


Lys 


Val 


Leu 








1140 








1145 








1150 




Pro 


Phe 


Tyr 


Asn 


He 


Ser 


Pro 


Ser 


Ser 


Ser 


Leu 


Thr 


Glu 


Lys 


Lys 


Lys 






1155 








1160 








1165 






Thr 


Lys 


Phe 


Asn 


Arg 


Lys 


Lys 


He 


Ser 


Ser 


Gly 


Met 


Ser 


Asn 


Asn 


Asn 




1170 








1175 








1180 








Gly Met 


Cys 


Val 


Gin 


Thr 


Pro 


Ser 


Ser 


Ser 


Asn 


Ser 


Val 


Ser 


Ser 


val 


1185 








1190 








1195 








1200 


Ser 


Ser 


He 


Val 


Ala 


Pro 


Ser 


Ser 


Ser 


Val 


Leu 


Ala 


Leu 


Ser 


Cys 


Ser 










1205 








1210 








1215 


Leu 


Ser 


Ser 


Thr 


Lys 


Lys 


Lys 


Ser 


He 


Trp 


Asn 


Glu 


Asn 


Met 


Phe 


Leu 








1220 








1225 








1230 




Thr 


Ser 


Arg 


Asn 


Met 


Trp 


Arg 


Cys 


Gly 


Phe 


Val 


Val 


Pro 


Pro 


Lys 


Leu 






1235 








1240 








1245 






Cys 


Ser 


Phe 


He 


Val 


Asn 


His 


Arg 


His 


Ala 


Val 


Lys 


Leu 


Val 


Ala 


Glu 




1250 








1255 








1260 








Thr 


Ala 


Pro 


Lys 


Thr 


Lys 


Leu 


Cys 


Arg Asn 


He 


He 


Asp 


Arg 


Asn 


Arg 


1265 








1270 








1275 








1280 


Lys 


He 


Arg 


Phe 


Asn 


Gly 


Leu 


Lys 


Lys 


Val 


Cys 


Lys 


Ser 


Val 


Ser 


Ala 










1285 








1290 








1295 


Phe 


Thr 


Gly 


Glu 


Ser 


Thr 


Tyr 


Leu 


Leu 


Asn 


Lys 


Asn 


Met 


Thr 


Ala 


Thr 



1300 1305 1310 
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Ser Pro Ser Asp Leu Asn Leu Cys He Tyr Thr Ser Ser Leu Asn Asp 

1315 1320 1325 

Pro Leu Tyr Thr Cys Lys Leu Thr His Glu Glu Tyr Gin Asp Gly Asn 

1330 1335 1340 

Ala Leu Asp Asp Tyr Gly Ala Val Phe Val Asn Tyr Thr Phe Lys Ser 
1345 1350 1355 1360 

He Lys Ser Cys Ser Ser Lys Asp Glu Thr Ala Asp Asp Asn Ala Ala 

1365 1370 1375 

Ala Ala Asp Asp Asp Gly Ser Thr Thr Ser Thr Ser Ser Ser Thr Asp 

1380 1385 1390 

Thr Asp Ala Ala Ala He Gin Asp Phe Met His Val Met He Lys Lys 

1395 1400 1405 

He Asp Ala Met Lys Asp He Arg Gly Lys Tyr Lys Lys Ser Leu Ala 

1410 1415 1420 

Lys Lys Thr Lys Lys His 
1425 1430 



<210> 242 
<211> 909 
<212> DNA 
<213> SHRIMP 



<400> 242 

atggtgtcgt 

gtttttacaa 

gacgttattg 

ggtgacgacg 

cttggacgca 

cgcaaaaagc 

aagcgtgccg 

gccgcacctc 

cctccacctg 

cctgaggatg 

gatgaagaag 

gaagaggatg 

gatgatttct 

ttctaccgca 

cgcaaaaagc 

aagcgttaa 



ctattaccca 
ctgaaggagc 
accccgacca 
acgacgacga 
aaaagcgtgc 
gtgccgcacc 
cacctccacc 
cacctgagga 
aggatgaaga 
aagaagagga 
aggatgagtt 
agttctaccg 
accgcaaaaa 
aaaagcgtgc 
gtgccgcacc 



cctctctctg 
tagtgtgaga 
ccgctgccaa 
tgacgatgga 
cgcacctcca 
tccacctgag 
tgaggatgaa 
tgaagaagag 
agaggatgag 
tgagttctac 
ctaccgcaaa 
caaaaagcgt 
gcgtgccgca 
cgcacctcca 
tccacctgag 



ttgttcgtcg 
gtgaaacggt 
gggcgactgt 
ggaactttcg 
cctgaggatg 
gatgaagaag 
gaagaggatg 
gatgagttct 
ttctaccgca 
cgcaaaaagc 
aagcgtgccg 
gccgcacctc 
cctccacctg 
cctgaggatg 
gatgaagaag 



tggctgtagt 
gtgctgttag 
gccgcaggtc 
atacagtagg 
aagaagagga 
aggatgattt 
agttctaccg 
accgcaaaaa 
aaaagcgtgc 
gtgccgcacc 
cacctccacc 
cacctgagga 
aggatgaaga 
aagaagagga 
aggatgattt 



agcttccgtc 
cccgtgcccc 
tactcgagga 
gtctggtata 
tgatttctac 
ctaccgcaaa 
caaaaagcgt 
gcgtgccgca 
cgcacctcca 
tccacctgag 
tgaggatgaa 
tgaagaagag 
agaggatgat 
tgatttctac 
ctaccgcaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

909 



<210> 243 
<211> 302 
<212> PRT 
<213> SHRIMP 



<400> 243 



Met 


Val 


Ser 


Ser 


He 


Thr 


His 


Leu 


Ser 


Leu 


Leu 


Phe 


Val 


Val 


Ala 


Val 


1 








5 










10 










15 




Val 


Ala 


Ser 


Val 
20 


Val 


Phe 


Thr 


Thr 


Glu 
25 


Gly 


Ala 


Ser 


Val 


Arg 
30 


Val 


Lys 


Arg 


Cys 


Ala 
35 


Val 


Ser 


Pro 


Cys 


Pro 
40 


Asp 


Val 


He 


Asp 


Pro 
45 


Asp 


His 


Arg 


Cys 


Gin 


Gly 


Arg 


Leu 


Cys 


Arg 


Arg 


Ser 


Thr 


Arg 


Gly 


Gly 


Asp 


Asp 


Asp 


• 


50 










55 










60 










Asp 


Asp 


Asp 


Asp 


Asp 


Gly 


Gly 


Thr 


Phe 


Asp 


Thr 


Val 


Gly 


Ser 


Gly 


He 


65 










70 










75 










80 


Leu 


Gly 


Arg 


Lys 


Lys 
85 


Arg 


Ala 


Ala 


Pro 


Pro 
90 


Pro 


Glu 


Asp 


Glu 


Glu 
95 


Glu 


Asp 


Asp 


Phe 


Tyr 
100 


Arg 


Lys 


Lys 


Arg 


Ala 
105 


Ala 


Pro 


Pro 


Pro 


Glu 
110 


Asp 


Glu 
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olU 


ulU 


Asp 


Asp 






j. j. j 




Asp 


W>XU 


rM ii 






-X J w 






Pro 


bill 


Asp 




145 








Pro 


Pro 


Pro 


Glu 


ax a. 


Ala 




Pro 








i on 


Lys 


Arg 


Axa 


A±a 






195 




Arg 


Lys 


Lys 


Arg 




210 






Phe 


Tyr 


Arg 


Lys 


225 








Asp 


Asp 


Phe 


Tyr 


Glu 


Glu 


Asp 


Asp 








260 


Asp 


Glu 


Glu 


Glu 






275 




Pro 


Glu 


Asp 


Glu 



290 



Phe Tyr Arg Lys 
120 

Asp Glu Phe Tyr 
135 

Glu Glu Asp Glu 
150 

Asp Glu Glu Glu 
165 

Pro Glu Asp Glu 

Pro Pro Pro Glu 
200 

Ala Ala Pro Pro 
215 

Lys Arg Ala Ala 
230 

Arg Lys Lys Arg 
245 

Phe Tyr Arg Lys 

Asp Asp Phe Tyr 
280 

Glu Glu Asp Asp 
295 



Lys Arg Ala Ala 

Arg Lys Lys Arg 
140 

Phe Tyr Arg Lys 
155 

Asp Glu Phe Tyr 
170 

Glu Glu Asp Glu 
185 

Asp Glu Glu Glu 

Pro Glu Asp Glu 
220 

Pro Pro Pro Glu 
235 

Ala Ala Pro Pro 
250 

Lys Arg Ala Ala 
265 

Arg Lys Lys Arg 

Phe Tyr Arg Lys 
300 



Pro Pro Pro Glu 
125 

Ala Ala Pro Pro 

Lys Arg Ala Ala 
160 

Arg Lys Lys Arg 
175 

Phe Tyr Arg Lys 
190 

Asp Glu Phe Tyr 
205 

Glu Glu Asp Glu 

Asp Glu Glu Glu 
240 

Pro Glu Asp Glu 
255 

Pro Pro Pro Glu 
270 

Ala Ala Pro Pro 
285 

Lys Arg 



<210> 244 
<211> 1119 
<212> DNA 
<213> SHRIMP 



<400> 244 

atggataact 

cagttggata 

aatagtgaac 

gagatacaac 

gatgatatca 

ataattgaag 

atagcggaca 

agacactcta 

tgtgacgact 

aaattcatta 

ataattttca 

gtaaaagaag 

gacaacctca 

tttgatcacg 

tacttgaaaa 

ttgacttttc 

tctgatagag 

caactcccca 

tctctttctg 



tgaaagggga 
gatctatatt 
aaacagctaa 
aattcaagga 
tctacatttt 
cgagggaata 
tgttaacctt 
gtgaacaagc 
tgtgcaatca 
ctttccatgc 
aagatgtcat 
gtgtggccat 
aggataatta 
ggggacattt 
cactcatggg 
taacagtgta 
agatcaataa 
aacctaaaca 
tccaaaaaca 



atttgttgcg 
ggtatttgtt 
aaaggaaggt 
cgaaataaac 
tgatcatgga 
ctctaaaccg 
gacttttatg 
cactaatagt 
attaccaaaa 
gaacctatac 
acaacaactt 
tagatgtgcg 
caatacattt 
tgaagaagta 
attaaaatgc 
cactaatgtt 
ttatcttgtc 
ataccgtccc 
tctgagtggg 



cttaaaacag 
gatgttgttg 
ctagcaacta 
aacaaatata 
gggagtttca 
ctgagggaat 
actgtgtaca 
ataaatgtca 
gaagaggaag 
atgctggaca 
cacgtgattt 
aaacagatga 
tcaaacattt 
aaacacaaag 
atgttcaaac 
atagcagaat 
caacttgtaa 
ctcagtttga 
tttctttag 



acctcaccca 
gtagattata 
gagtggcaaa 
atgctctaac 
aaagagcaaa 
tagagtgcat 
ccaatatcat 
ccctcggacg 
aagaggaaga 
cacgcctaaa 
tgcaaaagga 
acgaaataag 
tgaatgaaat 
ccataactct 
gcatatccga 
ttataaacgc 
catgtaacga 
tagataacat 



ttacaaaaca 
tgttatagta 
gcaagccaca 
aaatactttg 
acataaggcc 
gtttacgcgt 
tactgaatt t 
tcttttcttg 
tttgaaacag 
gaaagat ttg 
tacctatgct 
tcaatacagg 
tgtctacatt 
gactagaaat 
aatgttgtca 
tagcaatatt 
attgtgcaac 
agcttatttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1119 



<210> 245 
<211> 368 
<212> PRT 
<213> SHRIMP 



<400> 245 

Met Asp Asn Leu Lys Gly Glu Phe Val Lys Thr Asp Leu Thr His Tyr 

15 10 15 

Lys Thr Gin Leu Asp Arg Ser lie Leu Val Phe Val Asp Val Val Gly 
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Arg 


Leu 


Tyr 


Val 






35 




Leu 


Ala 


Thr 


Arg 




50 






Asp 


Glu 


He 


Asn 


65 








He 


He 


Tyr 


He 


Lys 


Ala 


He 


He 








100 


Glu 


Cys 


Met 


Phe 






115 




Thr 


Val 


Tyr 


Thr 




130 






Ala 


Thr 


Asn 


Ser 


145 








Asp 


Leu 


Cys 


Asn 


Lys 


Gin 


Lys 


Phe 








180 


Arg 


Leu 


Lys 


Lys 






195 




His 


Val 


He 


Leu 




210 






He 


ftt y 


Cys 


Ala 


225 








Leu 


L ys 


Asp 


Asn 


Tyr 


He 


Phe 


Asp 








260 


He 


Thr 


Leu 


Thr 






275 




Met 


Phe 


Lys 


Arg 




290 






Tyr 


Thr 


Asn 


Val 


305 








He 


Asn 


Asn 


Tyr 


Gin 


Leu 


Pro 


Lys 








340 


He 


Ala 


Tyr 


Phe 






355 





He 


Val 


Asn 


Ser 








40 


Val 


Ala 


Lys 


Gin 






55 




Asn 


Lys 


Tyr 


Asn 




70 






Phe 


Asp 


His 


Glv 


85 








Glu 


Ala 


Arg 


Glu 


Thr 


Arg 


He 


Ala 








120 


Asn 


He 


He 


Thr 






135 




He 


Asn 


Val 


Thr 




150 






Gin 


Leu 


Pro 


Lvs 


165 








He 


Thr 


Phe 


His 


Asp 


Leu 


He 


He 








200 


Gin 


Lys 


Asp 


Thr 






215 




Lys 


Gin 


Met 


Asn 




230 






Tvr 


Asn 


Thr 


Phe 


245 








His 


Gly 


Gly 


His 


Arg 


Asn 


Tyr 


Leu 








280 


lie 


Ser 


Glu 


Met 






295 




He 


Ala 


Glu 


Phe 




310 






Leu 


Val 


Gin 


Leu 


325 








Pro 


Lys 


Gin 


Tyr 


Ser 


Leu 


Ser 


Val 



360 



25 

Glu Gin Thr Ala 

Ala Thr Glu He 
60 

Ala Leu Thr Asn 
75 

Gly Ser Phe Lys 
90 

Tyr Ser Lys Pro 
105 

Asp Met Leu Thr 

Glu Phe Arg His 
140 

Leu Gly Arg Leu 
155 

Glu Glu Glu Glu 
170 

Ala Asn Leu Tyr 
185 

Phe Lys Asp Val 

Tyr Ala Val Lys 
220 

Glu He Ser Gin 

235 

Ser Asn He Leu 
250 

Phe Glu Glu Val 
265 

Lys Thr Leu Met 

Leu Ser Leu Thr 
300 

He Asn Asn He 
315 

Val Thr Cys Asn 
330 

Arg Pro Leu Ser 
345 

Gin Lys His Leu 



30 



Lvs 


Lys 


Glu 


Gly 


45 








Gin 


Gin 


Phe 


Lys 


Thr 


Leu 


Asp 


Asp 








80 


Arg 


Ala 


Lys 


His 






95 




Leu 


Arg 


Glu 


Leu 




110 






Leu 


Thr 


Phe 


Met 


125 








Ser 


Ser 


Glu 


Gin 


Phe 


Leu 


Cys 


Asp 








160 


Glu 


Glu 


Asp 


Leu 






175 




Met 


Leu 


Asp 


Thr 




190 






He 


Gin 


Gin 


Leu 


205 








Glu 


Gly Val 


Ala 


Tyr 


Arg 


Asp 


Asn 








240 


Asn 


Glu 


He 


Val 






255 




Lys 


His 


Lys 


Ala 




270 






Gly 


Leu 


Lys 


Cys 


285 








Phe 


Leu 


Thr 


Val 


Ser 


Asp 


Arg 


Glu 








320 


Glu 


Leu 


Cys 


Asn 






335 




Leu 


He 


Asp 


Asn 




350 






Ser 


Gly 


Phe 


Leu 


365 









<210> 246 
<211> 1545 
<212> DNA 
<213> SHRIMP 

<400> 246 

atggatagtt gttgcctgat atcgaggata acaccagaac tggctggcaa gttaacctgg 60 

atcttcatac cagaaaacaa cttcaagatt gtccagaact cactcccaga cgaccaagtt 120 

atctcccaat tcagatattt cgaccataga cattgctata cgtttatgga gattttgatg 180 

gcaaacatta aaatccaaga caggaaacaa aacaccacag ccatatgtga attgacaact 240 

ggaagagaag gacttttatg tagaagaacc atacctgtat ttttgggttc agaggaaaaa 300 

cgagaagagt tattggggaa tctccctgaa ggtgcagaaa ttttcaggcc tagagaagtt 360 

atgcaagtaa ttggtactct cttggacaag aaactagaaa ttgacgacgg tatagcttct 4 20 

gtaaaggctg ccctctgtgc tggttcatca tcgttatacc taatcatgag ccacatagtg 4 80 

aaaatgacct tttctgctat cacaaacatg aaggatataa acgaagaata tttcgtagac 54 0 

tttatatttc gtcataaaca attcctcaac cctgaattct tcaagcacct tatatctttg 600 
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ctcaagaatt ccaggaagga acatgttgcc catctagtaa gacgtctaga acactttctc 660 

atgctatgga ccctttccaa gatgaggttc acagaaatgg aagaaaacta cttcccaatc 720 

tccagcgata gtgattacgg catctgtgaa aaatgtgcac gaaaaactcc caaatacaag 780 

ctccgtattt ttagggaacg aaaatgctgc gatagatgtt gccgtcttta tcaccaacaa 840 

ccgcctccgg aggtgtataa ttgggatgga aaaataaccc aacaatccaa taaaggctac 900 

attaatgcag gcgatgaaat tatcggcatg ctaaactcaa atgataaggg aaaaacattc 960 

cctcctatac ctaagatggt tgtacgaaga gtggtggacg gtgtctacgg gcaaggaact 1020 

atcctgtcaa agattttgaa gttcaggcag gcaaatatcc ccacgtgtct attcgtgaca 1080 

tgcaataaat gcaataggat tttcaggctc actatcttag ggcctacaag aaacatcctt 1140 

tgcccacctt gcagaaagaa aagtgttgca gtaaatacac aacagaaagg agaaaataaa 1200 

ccttcgtttg tgcaaaaagg aacaaaacgt ctacgagtgg ataccggtag caacaagaac 1260 

acgttagaaa aattctgttc ctgggaaaga ttcaatactg aagttttgct cccttggctt 1320 

ggctacacta ttgagtctaa gtggcagaac tgggaatctt ttctgggtta ttcgagtacc 1380 

agatataagg aactgtgggc ctttgtgaac aaacaggaaa tatcttccat gaaagactcc 144 0 

tacataaaaa ttgaagacat cgaccagtta ttgaggagta tcttgcaaga ccagaagggt 1500 

gtatttgaga ccgtctgcaa aataaagagc agagatggtt tgtga 154 5 



<210> 247 
<211> 514 
<212> PRT 
<213> SHRIMP 



<400> 247 



Met 


Asp 


Ser 


Cys 


1 








Lys 


Leu 


Thr 


Trp 








20 


Asn 


Ser 


Leu 


Pro 






35 




His 


Arg 


His 


Cys 




50 






lie 


Gin 


Asp 


Arg 


65 








Gly 


Arg 


Glu 


Gly 


Ser 


Glu 


Glu 


Lys 








100 


Glu 


He 


Phe 


Arg 






115 




Asp 


Lys 


Lys 


Leu 




130 






Leu 


Cys 


Ala 


Gly 


145 








Lys 


Met 


Thr 


Phe 


Tyr 


Phe 


Val 


Asp 








180 


Phe 


Phe 


Lys 


His 






195 




Val 


Ala 


His 


Leu 




210 






Leu 


Ser 


Lys 


Met 


225 








Ser 


Ser 


Asp 


Ser 


Pro 


Lys 


Tyr 


Lys 








260 


Cys 


Cys 


Arg 


Leu 






275 




Asp 


Gly 


Lys 


lie 




290 







Cys 


Leu 


He 


Ser 


5 








He 


Phe 


He 


Pro 


Asp 


Asp 


Gin 


Val 








40 


Tyr 


Thr 


Phe 


Met 






55 




Lys 


Gin 


Asn 


Thr 




70 






Leu 


Leu 


Cys 


Arg 


85 








Arg 


Glu 


Glu 


Leu 


Pro 


Arg 


Glu 


Val 








120 


Glu 


He 


Asp 


Asp 






135 




Ser 


Ser 


Ser 


Leu 




150 






Ser 


Ala 


He 


Thr 


165 








Phe 


He 


Phe 


Arg 


Leu 


He 


Ser 


Leu 








200 


Val 


Arg 


Arg 


Leu 






215 




Arg 


Phe 


Thr 


Glu 




230 






Asp 


Tyr 


Gly 


He 


245 








Leu 


Arg 


He 


Phe 


Tyr 


His 


Gin 


Gin 








280 


Thr 


Gin 


Gin 


Ser 






295 





Arg lie Thr Pro 
10 

Glu Asn Asn Phe 
25 

lie Ser Gin Phe 

Glu He Leu Met 
60 

Thr Ala He Cys 
75 

Arg Thr He Pro 
90 

Leu Gly Asn Leu 
105 

Met Gin Val He 

Gly He Ala Ser 
140 

Tyr Leu He Met 
155 

Asn Met Lys Asp 
170 

His Lys Gin Phe 
185 

Leu Lys Asn Ser 

Glu His Phe Leu 
220 

Met Glu Glu Asn 
235 

Cys Glu Lys Cys 
250 

Arg Glu Arg Lys 
265 

Pro Pro Pro Glu 

Asn Lys Gly Tyr 
300 



Glu Leu Ala Gly 
15 

Lys He Val Gin 
30 

Arg Tyr Phe Asp 
45 

Ala Asn He Lys 

Glu Leu Thr Thr 
80 

Val Phe Leu Gly 
95 

Pro Glu Gly Ala 
110 

Gly Thr Leu Leu 
125 

Val Lys Ala Ala 

Ser His He Val 
160 

He Asn Glu Glu 
175 

Leu Asn Pro Glu 
190 

Arg Lys Glu His 
205 

Met Leu Trp Thr 

Tyr Phe Pro He 
240 

Ala Arg Lys Thr 
255 

Cys Cys Asp Arg 
270 

Val Tyr Asn Trp 
285 

lie Asn Ala Gly 
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Asp 


GlU 


He 


He 


Gly Met 


Leu 


Asn 


t> r\ c 

305 










310 






Pro 


Pro 


He 


Pro 


Lys 


Met 


Val 


Val 










o o c 

325 








Gly 


Gin 


Gly 


Thr 


He 


Leu 


Ser 


Lys 








340 










lie 


Pro 


Thr 


Cys 


Leu 


Phe 


Val 


Thr 






355 










360 


Arg 


Leu 


Thr 


He 


Leu 


Gly 


Pro 


Thr 




370 










375 




Arg 


Lys 


Lys 


Ser 


IT . 1 

Val 


Ala 


Val 


Asn 


385 










390 






Pro 


Ser 


Phe 


Val 


Gin 


Lys 


Gly 


Thr 










4 05 








Ser 


Asn 


Lys 


Asn 


Thr 


Leu 


Glu 


Lys 








420 










Thr 


Glu 


Val 


Leu 


Leu 


Pro 


Trp 


Leu 






435 










440 


Gin 


Asn 


Trp 


Glu 


Ser 


Phe 


Leu 


Gly 




450 










455 




Leu 


Trp 


Ala 


Phe 


Val 


Asn 


Lys 


Gin 


4 65 










470 






Tyr 


He 


Lys 


He 


Glu 


Asp 


He 


Asp 










485 








Asp 


Gin 


Lys 


Gly Val 


Phe 


Glu 


Thr 








500 










Gly 


Leu 















Ser 


Asn 


Asp 
315 


Lys 


Gly 


Lys 


inr 


Fne 
320 


Arg 


Arg 


Val 


val 


Asp 


Gly 


val 


Tyr 


He 


Leu 


Lys 


Phe 


Arg 


Gin 


Ala 


Asn 


345 










350 






Cys 


Asn 


Lys 


Cys 


Asn 
365 


Arg 


He 


Phe 


Arg 


Asn 


He 


Leu 
380 


Cys 


Pro 


Pro 


Cys 


Thr 


Gin 


Gin 


Lys 


Gly Glu 


Asn 


Lys 






395 










400 


Lys 


Arg 

41U 


Leu 


Arg 


Val 


Asp 


Thr 

4 ID 


Gly 


Phe 


Cys 


Ser 


Trp 


Glu 


Arg 


Phe 


Asn 


425 










430 






Gly 


Tyr 


Thr 


He 


Glu 
445 


Ser 


Lys 


Trp 


Tyr 


Ser 


Ser 


Thr 
460 


Arg 


Tyr 


Lys 


Glu 


Glu 


He 


Ser 
475 


Ser 


Met 


Lys 


Asp 


Ser 
480 


Gin 


Leu 
490 


Leu 


Arg 


Ser 


He 


Leu 
495 


Gin 


Val 


Cys 


Lys 


He 


Lys 


Ser 


Arg 


Asp 


505 










510 







<210> 248 
<211> 1242 
<212> DNA 
<213> SHRIMP 



<400> 248 

atggagtcaa 

gctgaagaaa 

tttgacaaat 

agagaaaacc 

tacaaaaaag 

aaagattggg 

tttgcatcct 

cagattccag 

aacgtctacg 

tttaacgctg 

atgcaaagca 

ttctttagtg 

ctcacctcct 

atgctgttga 

actgaagccg 

ggaatgaact 

gaaatgggac 

tccctcgaga 

gtcatggctt 

ctcccaattc 

gaagacggcg 

<210> 249 
<211> 409 
<212> PRT 



tcaaactgtt 
tcaagtcaga 
tcaccaaaaa 
ccaaccgttt 
aggttgcctg 
agaaactcac 
ctgacggaat 
aagcgaggag 
gagaactgat 
cacaacactt 
ataacgattt 
gtgcattcgc 
ccaatgagtt 
aaaagggttt 
tccgaattga 
gcaagttgat 
tagaaaagca 
ataagaccaa 
ctatcaataa 
tgactgcacc 
tcggggacta 



caccgttgct 
atataaaacc 
actcattatg 
tgtatcccgc 
tttttggaca 
tcaagatgag 
tgtaattgaa 
tttctttgac 
tgatagactg 
ccccgccatc 
ggcggaacta 
atccattttc 
catttctagg 
tgttgatacc 
acaagaattt 
gagccagtac 
ctataatgtt 
cttttttgaa 
gatcaagaag 
tcctccagtc 
catcagttat 



ggtctgaata 
gaggaggaaa 
caagtagata 
cccattgtcc 
ttggaagaga 
aaggatttca 
aatcttacaa 
ttccaagttg 
gtgcccgacg 
aagaagaagg 
attgttgcct 
tggatcaaga 
gacgaaggtc 
ccatcaagag 
ctcacagttt 
attgaatttg 
accaacccct 
aagagagtcg 
gaccaacaaa 
tcttcctcat 
gacgattttt 



tggagcaagc 
agaggattgc 
cgtctaaaca 
atgaagatct 
ttgatttcga 
ttctccagat 
cacg tcttcg 
gaatggagag 
aaaaagacaa 
agcagtgggc 
ttgctgcagt 
acaggggtat 
ttcatcgcga 
aaaggattct 
ccctgcctgt 
tggcagataa 
tcccattcat 
ccgagtatca 
cccaagaaac 
catccgaaca 
ag 



caaccaagtg 
ccaggaagtg 
cttacttaca 
ctgggaaatg 
aagggatcct 
tctggcgttc 
tcaagtggcg 
tattcatggc 
ggctatcttg 
tattaattgg 
tgaaggaatc 
tttgcctggt 
ctttgcatgc 
tgaaattgtc 
taaattagtg 
actattggtt 
ggacaatatt 
acgtgcccag 
tggttctcct 
agaagatgtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1242 



WO 01/38351 



344 



PCT/USOO/28888 



<213> SHRIMP 



<400> 249 



Mot- 


UlU 


Car 

OtJ x 


Tip 
11C 


Lys 


Leu 


Phe 


Thr 


Vai 


Ala 


Gin 


Met 


Glu 


Gin 


Ala 


Asn 


1 


















1 0 










15 




uJ.n 


Vai 


nl cl 


V3l u 


V3l U 


Tip 
1 Xti 


Lys 




wl U 


xyr 


Lys 


Thr 


Glu 


Glu 


Glu 


Lys 




























30 






Arg 


lie 


nlo 


V31I1 




v a. -L 




Asp 


Lys 


Phe 


Thr 

X I1X 


Lys 


Lys 


Leu 


He 


Met 






JO 










40 


















Ui.Il 


V a X 


Asp 


Th r 

J. UX 


OCX. 


Lys 


His 


Leu 


Leu 


Thr 

X 11X 


Arg 


Glu 


Asn 


Pro 


Asn 


Arg 




*>U 










55 




















rue 


vai 


oer 


Arg 


riO 


He 


Vai 


His 


Glu 


Asp 


Leu 


Trp 




L Jfcr l_ 


Tyr 


Lys 


DO 










70 




















80 


Lys 


bill 


Vol 


Ala 
nlo 




Phe 


Trp 


Thr 


Leu 


UJ.U 


f51 n 
Ul u 


Tip 

IXC 




Phe 


Glu 


Arg 










ft ^ 










qn 














Asp 


Pro 


Lys 


Asp 


irp 


Glu 


Lys 


Leu 


Thr 


win 


Asp 


uXu 


Lys 


Asp 


Php 
rile 


Tip 

11C 


















105 










1 1 n 

X X \J 






Leu 


bin 


lie 


Leu 


Ala 


Phe 


Phe 


Ala 


Ser 


Cor 


Asp 


vjx _y 


Tip 

X Xti 


Vai 


He 


Glu 






1 1 c 

113 










120 










1 9^ 








Asn 


Leu 


i nr 


inr 


Arg 


Leu 


Arg 


Gin 


Vai 


nla 


bill 


Tip 
1 lt= 


r> f-r^i 
nu 


rn ii 


Ala 


Arg 














135 










1 4f| 
14U 










Ssr 


DKa 

rue 


fne 


Asp 




Gin 


Vai 


Gly Met 


/tin 


O t5 X 


Tip 

1 Its 


uio 
ii X o 


ox y 


Asn 


Vai 


IXC 










150 










1 EC 

1 DO 










i fin 

1 DU 


Tyr 


L»iy 


olU 


Leu 


Ti- 
ne 


Asp Arg 


Leu 


Vai 


Pro 


Asp 


bill 


Lys 


Asp 


x*ys 


a l a 

Ala 










1 DO 










l f\j 










I/O 




lie 


Leu 


rne 


Asn 


Aid 


Ala 


Gin 


His 


Phe 


pro 


nio 


1 16 


Lys 


Lys 


Lys 


CZ 1 ii 
bxu 








loU 










185 










t on 






Gin 


Trp 


Ala 


l le 


Asn 


Trp 


Met 


Gin 


Ser 


Asn 


Asn 


Asp 


Leu 


Aid 


ii 

OlU 


Leu 






19o 










200 










o n c 

zuo 








lie 


vai 


Ala 


Phe 


Ala 


Ala 


Vai 


Glu Gly 


lie 


pne 


pne 


i>er 




a 1 n 


rne 




2. 10 










215 










*? *> a 










Ala 


O _ 

ser 


i le 


Pne 


Trp 


He 


Lys 


Asn 


Arg 


Liiy 


i ie 


Leu 


pro 


biy 


Leu 


l nr 


225 










230 










o c 










z 4 u 


Ser 


Ser 


Asn 


Glu 


Pne 


He 


Ser 


Arg Asp 


blU 


La±y 


Leu 


nls 


Arg 


Asp 


pne 










Z4o 










oc n 
*£OU 










^ oo 




Ala 


Cys 


Met 


Leu 


Leu 


Lys 


Lys 


Gly Phe 


vai 


Asp 


i nr 


Pro 


ber 


Arg 


blU 








o c 










265 










9 "7 fl 






Arg 


lie 


T 1 n 

i le 


vai 


i nr 


Glu 


Ala 


Vai 


Arg 


Tin 

ne 


olu 


r*l n 
bin 


bill 




Leu 


TH r- 
X 11X 






£. IZ> 










280 










ZOO 








Va 1 
v ax 


Ser 


Leu 


Pro 


Vai 


Lys 


Leu 


Vai 


Gly 


Met 


Asn 


Cys 


Lys 


Leu 


Met 


Ser 




290 










295 










300 










Gin 


Tyr 


lie 


Glu 


Phe 


Vai 


Ala 


Asp Lys 


Leu 


Leu 


Vai 


Glu 


Met 


Gly 


Leu 


305 










310 










315 










320 


Glu 


Lys 


His 


Tyr 


Asn 


Vai 


Thr 


Asn 


Pro 


Phe 


Pro 


Phe 


Met 


Asp 


Asn 


He 










325 










330 










335 




Ser 


Leu 


Glu 


Asn 


Lys 


Thr 


Asn 


Phe 


Phe 


Glu 


Lys 


Arg 


Vai 


Ala 


Glu 


Tyr 








340 










345 










350 






Gin 


Arg 


Ala 


Gin 


Vai 


Met 


Ala 


Ser 


He 


Asn 


Lys 


He 


Lys 


Lys 


Asp 


Gin 






355 










360 










365 








Gin 


Thr 


Gin 


Glu 


Thr 


Gly 


Ser 


Pro 


Leu 


Pro 


He 


Leu 


Thr 


Ala 


Pro 


Pro 




370 










375 










380 










Pro 


Vai 


Ser 


Ser 


Ser 


Ser 


Ser 


Glu 


Gin 


Glu 


Asp 


Vai 


Glu 


Asp 


Gly 


val 


385 










390 










395 










400 


Gly 


Asp 


Tyr 


He 


Ser 


Tyr 


Asp 


Asp 


Phe 

















405 



<210> 250 
<211> 915 
<212> DNA 
<213> SHRIMP 
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<400> 250 

atgaatttac 

tctaccatct 

gaagaggtgg 

ggtgtggatc 

aaaggtcctt 

gaggattgta 

gtcttttccc 

aaccctggta 

aaatactctt 

tatcaagaga 

gctcgaggcc 

tttgctctag 

attgaaaatc 

taccaacaca 

gccctctaca 

atgtcttgga 



ttccaatatt 
gccttttacc 
tataccctac 
ttt tgaccct 
gtccaaggat 
aagggtttag 
tttgttgggt 
tagtaactga 
ctaaaatttt 
ggattaagat 
acttggctcc 
agaatgcagt 
gtgcaagaac 
agaagaagga 
agattgtgta 
aataa 



cctgacaacc 
tgatgggaag 
agatgtgtgt 
ctgtatagga 
caataacgct 
aaaggggttc 
agacagagat 
tgatgaagat 
tggtaaagga 
attaaagtct 
ggctggagat 
acctcaaata 
tacgccaggt 
atatctagac 
cgacaagaat 



ttttttgttg 
aaacaaccct 
gggccaaagg 
ggtaaaaaca 
gtcgttgaac 
cgaattcctg 
atgcacgcca 
ttggtagatt 
ttcaatccga 
cattttaaca 
tttttcctcg 
cagaaccata 
gccgcgtggg 
aagaagaaga 
aacaaggaat 



cggtagatgc 
tggtttttga 
gagctggcga 
atggaggtga 
gagattactc 
gcactgacca 
agtgggtgcg 
ctggtattag 
gacctcttta 
agaggacggg 
cttcagagag 
acaatggtga 
ctgagactgg 
agtacatccc 
tgttccgtgt 



atgctcttgc 
ttctgtatta 
attattcact 
atggtcagga 
ccttgacgag 
ttttcatact 
caacaaaata 
gactaaattt 
ctccctcgac 
taatttcttt 
atgggcaact 
atggaaagat 
accaatattt 
tatccctcat 
acagagtgat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

915 



<210> 251 
<211> 298 
<212> PRT 
<213> SHRIMP 



<400> 251 



Met 


Asn 


Leu 


Leu 


Pro 


lie 


Phe 


Leu 


Thr 


Thr 


Phe 


Phe 


Val 


Ala 


Val 


Asp 


1 








5 










10 










15 




Ala 


Cys 


Ser 


Cys 


Ser 


Thr 


He 


Cys 


Leu 


Leu 


Pro 


Asp 


Gly 


Lys 


Lys 


Gin 








20 










25 










30 






Pro 


Leu 


Val 


Phe 


Asp 


Ser 


Val 


Leu 


Glu 


Glu 


Val 


Val 


Tyr 


Pro 


Thr 


Asp 






35 










40 










45 








Val 


Cys 


Gly 


Pro 


Lys 


Gly 


Ala 


Gly 


Glu 


Leu 


Phe 


Thr 


Gly 


Val 


Asp 


Leu 




50 










55 










60 










Leu 


Thr 


Leu 


Cys 


lie 


Gly 


Gly 


Lys 


Asn 


Asn 


Gly 


Gly 


Glu 


Trp 


Ser 


Gly 


65 










70 










75 










80 


Lys 


Gly 


Pro 


Cys 


Pro 


Arg 


He 


Asn 


Asn 


Ala 


Val 


Val 


Glu 


Arg 


Asp 


Tyr 










85 










90 










95 




Ser 


Leu 


Asp 


Glu 


Glu 


Asp 


Cys 


Lys 


Gly 


Phe 


Arg 


Lys 


Gly 


Phe 


Arg 


He 








100 










105 










110 






Pro 


Gly 


Thr 


Asp 


His 


Phe 


His 


Thr 


Val 


Phe 


Ser 


Leu 


Cys 


Trp 


Val 


Asp 






115 










120 










125 








Arg 


Asp 


Met 


His 


Ala 


Lys 


Trp 


Val 


Arg 


Asn 


Lys 


He 


Asn 


Pro 


Gly 


He 




130 










135 










140 










Val 


Thr 


Asp 


Asp 


Glu 


Asp 


Leu 


Val 


Asp 


Ser 


Gly 


He 


Arg 


Thr 


Lys 


Phe 


145 










150 










155 










160 


Lys 


Tyr 


Ser 


Ser 


Lys 


He 


Phe 


Gly 


Lys 


Gly 


Phe 


Asn 


Pro 


Arg 


Pro 


Lys 










165 










170 










175 




Leu 


Asp 


Tyr 


Gin 


Glu 


Arg 


He 


Lys 


He 


Leu 


Lys 


Ser 


His 


Phe 


Asn 


Lys 








180 










185 










190 






Arg 


Thr 


Gly 


Asn 


Phe 


Phe 


Arg 


His 


Leu 


Ala 


Pro 


Ala 


Gly 


Asp 


Phe 


Phe 






195 










200 










205 








Leu 


Ala 


Ser 


Trp 


Ala 


Thr 


Phe 


Ala 


Leu 


Glu 


Asn 


Ala 


Val 


Pro 


Gin 


He 




210 










215 










220 










Gin 


Asn 


His 


Asn 


Asn 


Gly 


Glu 


Trp 


Lys 


Asp 


He 


Glu 


Asn 


Arg 


Ala 


Arg 


225 










230 










235 










240 


Thr 


Thr 


Pro 


Gly 


Ala 


Ala 


Trp 


Ala 


Glu 


Thr 


Gly 


Pro 


He 


Phe 


Tyr 


Gin 










245 










250 










255 




His 


Lys 


Lys 


Lys 


Glu 


Tyr 


Leu 


Asp 


Lys 


Lys 


Lys 


Lys 


Tyr 


He 


Pro 


He 








260 










265 










270 






Pro 


His 


Ala 


Leu 


Tyr 


Lys 


He 


Val 


Tyr 


Asp 


Lys 


Asn 


Asn 


Lys 


Glu 


Leu 



275 



280 



285 
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Phe Arg Val Gin Ser Asp Met Ser Trp Lys 
290 295 



<210> 252 
<211> 789 
<212> DNA 
<213> SHRIMP 



<400> 252 

atgtcgttgg 

acctctgtat 

tacataattt 

ttcgatgaag 

tgcgcacaga 

atgggggaag 

gttaatttta 

gatactacta 

ccttttttca 

catgataacc 

cataaagctc 

actctcgatt 

gactatattg 

ctccgttga 



ctgtgacaga 
atcacacccc 
ctcgtcgatt 
aaacaggagc 
aaggaggaat 
taacgcgcct 
aacatgggca 
ctgttactcc 
cacgcactct 
caaaaagatc 
cttctgttcc 
gttcaatgga 
ctaacgaccc 



agattacggg 
tctattaggt 
catgaactac 
agttatagct 
atatgatgca 
atttactatt 
gacaaataat 
tcctgttact 
catatccgag 
ttcatttaag 
taccttcccc 
gattttttca 
cgtaccattt 



cacaatgaaa 
gcagaccatg 
acaaatttat 
aata tctgtc 
ccagaagatg 
ataggaggta 
cctgcctatg 
cctcctccat 
tcgtcttcag 
gtgtatgata 
cctaaaacct 
cgagacaggg 
ttagtggatg 



agttgatcaa 
taatgaaatc 
taaaacaagt 
tgttaaaaat 
ttgcattctt 
ggcccaatat 
gttatctcac 
ctccagctgc 
ttgaccatta 
ttcacgcaga 
cgtttgaaat 
atgttttaga 
ttgtgcaccg 



acggttacaa 
catatcagac 
tgaatatgtt 
cctagaaaga 
caattctaag 
gacggtgcgg 
agatgataat 
aagaagatcc 
tgtattgatg 
aacctttccc 
ttctgacgtg 
caatgttcac 
tggatctagt 



<210> 253 
<211> 260 
<212> PRT 
<213> SHRIMP 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

789 



<400> 253 



Met 


Ser 


Leu 


Ala 


Val 


Thr 


Glu 


Asp 


Tyr 


Gly 


His 


Asn 


Glu 


Lys 


Leu 


He 


1 








5 










10 










15 




Lys 


Arg 


Leu 


Gin 


Thr 


Ser 


Val 


Tyr 


His 


Thr 


Pro 


Leu 


Leu 


Gly 


Ala 


Asp 




20 










25 










30 






His 


Val 


Met 


Lys 


Ser 


He 


Ser 


Asp 


Tyr 


He 


He 


Ser 


Arg 


Arg 


Phe 


Met 






35 










40 










45 








Asn 


Tyr 


Thr 


Asn 


Leu 


Leu 


Lys 


Gin 


Val 


Glu 


Tyr 


Val 


Phe 


Asp 


Glu 


Glu 




50 










55 










60 










Thr 


Gly 


Ala 


Val 


He 


Ala 


Asn 


He 


Cys 


Leu 


Leu 


Lys 


He 


Arg 


Cys 


Ala 


65 










70 










75 










80 


Gin 


Lys 


Gly 


Gly 


He 


Tyr 


Asp 


Ala 


Pro 


Glu 


Asp 


Val 


Ala 


Phe 


Phe 


Asn 










85 










90 










95 




Ser 


Lys 


Met 


Gly 


Glu 


Val 


Thr 


Arg 


Leu 


Phe 


Thr 


He 


He 


Gly 


Gly 


Arg 








100 










105 










110 






Pro 


Asn 


Met 


Thr 


Val 


Arg 


Val 


Asn 


Phe 


Lys 


His 


Gly 


Gin 


Thr 


Asn 


Asn 






115 










120 










125 








Pro 


Ala 


Tyr 


Gly 


Tyr 


Leu 


Thr 


Asp 


Asp 


Asn 


Asp 


Thr 


Thr 


Thr 


Val 


Thr 




130 










135 










140 










Pro 


Pro 


Val 


Thr 


Pro 


Pro 


Pro 


Ser 


Pro 


Ala 


Ala 


Arg 


Arg 


Ser 


Pro 


Phe 


145 










150 










155 










160 


Phe 


Thr 


Arg 


Thr 


Leu 


He 


Ser 


Glu 


Ser 


Ser 


Ser 


Val 


Asp 


His 


Tyr 


Val 








165 










170 










175 




Leu 


Met 


His 


Asp 


Asn 


Pro 


Lys 


Arg 


Ser 


Ser 


Phe 


Lys 


Val 


Tyr 


Asp 


He 








180 










185 










190 






His 


Ala 


Glu 


Thr 


Phe 


Pro 


His 


Lys 


Ala 


Pro 


Ser 


Val 


Pro 


Thr 


Phe 


Pro 






195 










200 










205 








Pro 


Lys 


Thr 


Ser 


Phe 


Glu 


He 


Ser 


Asp 


Val 


Thr 


Leu 


Asp 


Cys 


Ser 


Met 




210 










215 










220 










Glu 


lie 


Phe 


Ser 


Arg 


Asp 


Arg 


Asp 


Val 


Leu 


Asp 


Asn 


Val 


His 


Asp 


Tyr 


225 










230 










235 










240 
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lie Ala Asn Asp Pro Val Pro Phe L u Val Asp Val Val His Arg Gly 
245 250 255 

Ser Ser Leu Arg 
260 



<210> 254 
<211> 2571 
<212> DNA 
<213> SHRIMP 



<400> 254 

atggcatcca 

ccacaaccca 

gctgtggata 

ctttctagta 

aaacatcttc 

cctattgaaa 

atatcattat 

tccaacgcgt 

cgtaaaaagt 

catggtcctc 

aacaggctgg 

ataattaata 

atatcgaggg 

gacagtaatc 

aatacatttt 

ttagtgcata 

tcattatcgg 

aaggtgattt 

tattctaagg 

acgtttttgc 

attatatgta 

ctccctctct 

tttagagatt 

cagtctgctt 

gatgatgatg 

cccttaataa 

agagacaagc 

aatacactct 

gagaatgttt 

ggaacattat 

acaacacctt 

ggtct ttaca 

gagtttgtcc 

tcctacgata 

tacgaaatag 

actatgaaca 

caaaagaata 

tctgaatgtg 

tgcggtcatt 

aatggcagcc 

ctcggaaaaa 

aaaaatgtcg 

agatatgaat 



cttcttcatc 
aaaagaaaaa 
aaaaggtggt 
ccgaactgca 
tagacgaagg 
caatatacga 
caatgttgat 
gcgttcaaaa 
atctgatatt 
tttctcacat 
tgcgttactt 
gtgtcttgtg 
gtgttttcat 
ggccacaaaa 
catccagaat 
actttgatta 
acagcgtcaa 
ccgactggaa 
ggttgatgac 
gcgccttgaa 
tactgaaaaa 
tcaaacactt 
acagtctcat 
cacttttcat 
aaggaggaaa 
gccaacttat 
actttactcc 
tgtgccttat 
tgcatatagc 
ccagcgaaca 
taatgatcgc 
agcccaaaat 
tcttaaaggg 
ttaacatcat 
cagcagcagg 
cgtggaactt 
ggcagaaatt 
caatatgctt 
gcttccacaa 
gcgcaagagg 
gactaggggt 
tgccttcggt 
ttcttgtagg 



aactaagaaa 
atcaaagaaa 
aaatctcata 
aatgataact 
agaatttaac 
aatactctct 
ccacatactt 
tgttttgggg 
ggaagactta 
gctccattct 
tagcacgtat 
gggaattgat 
tgtgtcgtac 
tactccttca 
ctcagtgtac 
cggttccaaa 
cacttttgtg 
aatgccttat 
attatcagtg 
ggaaagggaa 
gtctttgttc 
tttaaaaatt 
gtgcaaaact 
ggaagaattc 
gggacatacc 
ttcacaccca 
cctcatgcac 
aataaatgga 
gattgaaaat 
aattgaaaaa 
cttggcgagg 
aaaggtccgt 
cctaaaggaa 
aaaggatgca 
actgcgaggc 
tttcaccaaa 
tgtaaagatt 
ggatagtctg 
cgtctgttgg 
aggaggaata 
tgccgactat 
agaagaagga 
tggagtgtgg 



agagtacatg 
gtactaccat 
cacaagatat 
gaatgtaatg 
cctactataa 
tcttctgctg 
ttcttcgctg 
aacgactata 
ctgaacggcg 
tcaatcccta 
gatggagatg 
aaaagtgttc 
gtaccgatgc 
caaaatatgt 
agaaacaata 
gatgcatcat 
aggttgtaca 
gggttcttta 
tctgaatata 
attttaatct 
ggatttaatt 
gaagaggtta 
caaaaagatt 
ccttggcttg 
ctattaacat 
attttaaaat 
ctcgccaaca 
gctaaaccag 
gttaactatg 
atggtcaatg 
gagaatattg 
ttcggttctt 
tcagttgcat 
gtaatggaca 
aataactgcg 
aattcaacca 
gtggatggta 
gacggggatc 
ttatccttga 
aaatgcccgt 
gatattgaaa 
agaagggaat 
acaaatgaaa 



aagaagatga 
ttcctgttga 
tagatcaaga 
gtgcgcgaga 
ttgaagtagt 
acgacaagaa 
ataagggtac 
aagtggaatt 
tttcaaatca 
ttgtacaaga 
ctcaattcga 
tcaacgaatt 
gtgtacgaac 
cagcactagg 
cctttaaaaa 
catcatctcc 
ccaactatga 
agaaaacttt 
cactcaaaaa 
ataaaatgga 
tcaggtgttt 
aacatatagc 
tgcagagttt 
caaaaacttg 
ttgctatagt 
cgttagtgaa 
cgtctataat 
agttcataaa 
gagtcatcac 
taagaagaat 
tactcgctca 
caaagaggct 
atttggaaac 
acagtctttt 
accctgaagc 
aatgggcaag 
tgaataggac 
ttccttcagg 
taaggatgag 
cctgcagaca 
cagaggaaga 
ggaggaagat 
taaaactata 



aaatctcatc 
caagtataga 
aaaggaccac 
agatctgctt 
atcatccatg 
gtttgtacag 
tatgtgggta 
tgaaaatata 
ttggtctgaa 
catgttattg 
tatatcattc 
gacacaattg 
accttcaaag 
tatgaaactc 
actaaccgag 
tcctcctcct 
catattctta 
tgacgtcctt 
agagttggtt 
gaagagagac 
aaaacaatta 
acgttttgtc 
ccctgccata 
gatcgacgac 
gcacagatat 
tactacatgt 
gtaccaatgc 
caagttcaac 
tgaattgaga 
gatggataat 
gctttttgac 
aaggatacca 
gaggaatata 
tgaagaggag 
agacgagaag 
ctctattttc 
atatgaagac 
gagaacaacg 
cgggccgaat 
agtcacctgc 
acgtgacacg 
tggtgttgac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2571 



<210> 255 
<211> 846 
<212> PRT 
<213> SHRIMP 
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<400> 255 



Met 


Ala 


Ser 


Thr 


1 








G±U 


Asn 


Leu 


lie 








20 


Pro 


Phe 


Pro 


Val 






35 




Leu 


lie 


His 


Lys 




50 






Glu 


Leu 


Gin 


Met 


65 








Lys 


His 


Leu 


Leu 


Val 


Ser 


Ser 


Met 








100 


Ala 


Asp 


Asp 


Lys 






115 




lie 


Leu 


Phe 


r*» W j~ 

Pne 




130 






Val 


Gin 


Asn 


Val 


145 








Arg 


Lys 


Lys 


Tyr 


Ser 


Glu 


His 


Gly 








180 


Val 


Gin 


Asp 


Met 






195 




Asp 


Gly 


Asp 


Ala 




210 






Trp 


Gly 


He 


Asp 


225 








Arg 


Gly 


Val 


Phe 


Ser 


Lys 


Asp 


Ser 








260 


Ala 


Leu 


Gly 


Met 






275 




Arg 


Asn 


Asn 


Thr 




290 






Tyr 


Gly 


Ser 


Lys 


305 








Ser 


Asp 


Ser 


Val 


Phe 


Leu 


Lys 


Val 








340 


Lys 


Thr 


Phe 


Asp 






355 




Tyr 


Thr 


Leu 


Lys 




370 






Arg 


Glu 


He 


Leu 


385 








Leu 


Lys 


Lys 


Ser 


Leu 


Pro 


Leu 


Phe 








420 


Ala 


Arg 


Phe 


Val 






435 




Asp 


Leu 


Gin 


Ser 




450 






Glu 


Phe 


Pro 


Trp 


465 









Ser Ser Ser Thr 
5 

Pro Gin Pro Lys 

Asp Lys Tyr Arg 
40 

He Leu Asp Gin 
55 

He Thr Glu Cys 
70 

Asp Glu Gly Glu 
85 

Pro He Glu Thr 

Lys Phe Val Gin 
120 

Ala Asp Lys Gly 
135 

Leu Gly Asn Asp 
150 

Leu He Asp Leu 
165 

Pro Leu Ser His 

Leu Leu Asn Arg 
200 

Gin Phe Asp He 
215 

Lys Ser Val Leu 
230 

He Val Ser Tyr 
245 

Asn Arg Pro Gin 

Lys Leu Asn Thr 
280 

Phe Lys Lys Leu 
295 

Asp Ala Ser Ser 
310 

Asn Thr Phe Val 
325 

He Ser Asp Trp 

Val Lys Lys Gly 
360 

Lys Glu Leu Val 
375 

He Tyr Lys Met 
390 

Leu Phe Gly Phe 
405 

Lys His Phe Leu 

Phe Arg Asp Tyr 
440 

Phe Pro Ala He 
455 

Leu Ala Lys Thr 
470 



Lys 


Lys 


Arg 


Vdl 




1 U 






Lys 


Lys 


Lys 


Ser 


25 








TV 1 

a j. a 


val 


Asp 


Lys 


(aJ-U 


Lys 


Asp 


MIS 








60 


Asn 


Gly 


Ala 


Arg 






75 




Phe 


Asn 


Pro 


Thr 




90 






He 


Tyr 


Glu 


He 


105 








He 


Ser 


Leu 


Ser 


Thr 


Met 


Trp 


Val 








140 


Tyr 


Lys 


Val 


Glu 






155 




Leu 


Asn 


Gly Val 




170 






Met 


Leu 


His 


Ser 


185 








Leu 


Val 


Arg 


Tyr 


Ser 


Phe 


He 


He 








220 


Asn 


Glu 


Leu 


Thr 






235 




Val 


Pro 


Met 


Arg 




250 






Asn 


Thr 


Pro 


Ser 


265 








Phe 


Ser 


Ser 


Arg 


Thr 


Glu 


Leu 


Val 








300 


Ser 


Ser 


Pro 


Pro 






315 




Arg 


Leu 


Tyr 


Thr 




330 






Lys 


Met 


Pro 


Tyr 


345 








Leu 


Met 


Thr 


Leu 


Thr 


Phe 


Leu 


Arg 








380 


Glu 


Lys 


Arg 


Asp 






395 




Asn 


Phe 


Arg 


Cys 




410 






Lys 


He 


Glu 


Glu 


425 








Ser 


Leu 


Met 


Cys 


Gin 


Ser 


Ala 


Ser 








460 


Trp 


lie 


Asp 


Asp 






475 





His Glu Glu Asp 
15 

Lys Lys Va 1 Leu 
30 

Lys Val Val Asn 
45 

Leu Ser Ser Thr 

Glu Asp Leu Leu 
80 

He He Glu Val 
95 

Leu Ser Ser Ser 
110 

Met Leu He His 
125 

Ser Asn Ala Cys 

Phe Glu Asn He 
160 

Ser Asn His Trp 
175 

Ser He Pro He 
190 

Phe Ser Thr Tyr 
205 

Asn Ser Val Leu 

Gin Leu He Ser 
240 

Val Arg Thr Pro 

255 

Gin Asn Met Ser 
270 

He Ser Val Tyr 
285 

His Asn Phe Asp 

Pro Pro Ser Leu 
320 

Asn Tyr Asp He 
335 

Gly Phe Phe Lys 
350 

Ser Val Ser Glu 
365 

Ala Leu Lys Glu 

He He Cys He 
400 

Leu Lys Gin Leu 
415 

Val Lys His He 
430 

Lys Thr Gin Lys 
445 

Leu Phe Met Glu 

Asp Asp Asp Glu 
480 
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Gly 


oiy 


Lys 


C 1 w 

Cjiy 


nlS 


lnr 


Leu 


Leu 










HOD 








Pro 


Leu 


J. ie 


O ~ 


v?in 


Leu 


i ie 


Ser 








jUU 










Asn 


Tnr 


Tnr 


Cys 


Arg 


Asp 


Lys 


nis 
















^*?n 


Asn 


Thr 


Ser 


i le 


Met 


-Tyr 


tjin 


Cys 














jjj 




Asn 


Gly 


Ala 


Lys 


pro 


tjlU 


pne 


lie 


545 
















His 


lie 


Ala 


He 


Glu 


Asn 


vai 


Asn 


















Gly 


Thr 


Leu 


Ser 


Ser 


Glu 


Cain 


Ti- 
ne 






Do U 










Met 


Met 


Asp 


Asn 


inr 


Tnr 


Pro 


Leu 






595 










bUU 


Leu 


Ala 


Gin 


Leu 


pne 


Asp 


Gly 


Lys 




blU 










CI c 

bio 




Ser 


Ser 


Lys 


Arg 


Leu 


Arg 


lie 


Pro 


















Lys 


Glu 


Ser 


17-. 1 

val 


TV 1 

Ala 


Tyr 


Leu 


CalU 










Of* j 








Asn 


lie 


lie 


Lys 


Asp 


Ala 


vai 


Met 








dan 
bou 










Tyr 


Glu 


He 


Ala 


Ala 


Ala 


Gly 


Leu 
















con 


Ala 


Asp 


Glu 


Lys 


Thr 


Met 


Asn 


Thr 


















Thr 


Lys 


Trp 


Ala 


Ser 


Ser 


He 


Phe 


705 










-11 A 

710 






Lys 


He 


Val 


Asp 


Gly 


Met 


Asn 


Arg 










f <ZD 








lie 


Cys 


Leu 


Asp 


Ser 


Leu 


Asp 


Gly 








74 0 










Cys 


Gly 


His 


Cys 


Phe 


His 


Asn 


Val 






755 










760 


Ser 


Gly 


Pro 


Asn 


Asn 


Gly 


Ser 


Arg 




770 










775 




Cys 


Arg 


Gin 


Val 


Thr 


Cys 


Leu 


Gly 


785 










790 






Asp 


He 


Glu 


Thr 


Glu 


Glu 


Glu 


Arg 










805 








val 


Glu 


Glu 


Gly 


Arg 


Arg 


Glu 


Trp 








820 










Glu 


Phe 


Leu 


Val 


Gly 


Gly 


Val 


Trp 






835 










840 



X III. 


£r Jlt= 




Tip 
X 1C 


Val 


His 


Arg 


xy t 




490 










495 




Hid 
no. o 


Pro 


Tip 

lie 


- 

Leu 






Leu 


vox 












510 






Xr lie 


Thr 

X I IX. 


Pro 


l^eu 


Met 


His 


Leu 


Ala 










525 








Asn 


inr 


lieu 


Leu 


Uys 


T 

Leu 


Tip 


Tip 


















Asn 


Lys 


pne 


Asn 


bill 


Asn 


vol 


Leu 
















560 


Tyr 


uiy 


vai 


lie 


inr 


ulu 


lieu 


Arg 




O f u 










3 / D 






Lys 


ncT. 


vai 


Asn 


vai 


Arg 


Arg 


-/ O J 










•J -7U 






net 


lie 




Arg 


olU 


Asn 


Tic* 

lie 


vai 










DUO 








pro 




116 


Lys 


vai 


Arg 


IT lie 


oiy 








O £. \3 










IjIU 


pne 


vai 


Leu 


Leu 


Lys 


diy 


Leu 
















640 


inr 


Arg 


Asn 


lie 


Ser 


Tyr 


Asp 


lie 


















Asp 


Asn 


OCX 


Leu 


Phe 


Glu 


blU 


ulu 


D Oj 










670 






Arg 


Csiy 


Asn 


Asn 


Cys 


Asp 


pro 


C1 1 ti 
OlU 










685 








Trp 


Asn 


pne 


pne 


Thr 


Lys 


Asn 


Cor 

t>er 








■?nn 










Cain 


Lys 


Asn 


Arg 


Gin 


Lys 


pne 


vai 






/ID 












Tnr 


Tyr 


GlU 


ASp 


Ser 


Glu 


Cys 


7V 1 — > 

Ala 




/ JU 














Asp 


Leu 


Pro 


Ser 


Gly Arg 


Tnr 


Tnr 












750 






Cys 


Trp 


Leu 


Ser 


Leu 


He 


Arg 


Met 










765 








Arg 


Gly 


Gly 


He 


Lys 


Cys 


Pro 


Ser 








780 










Lys 


Arg 


Leu 


Gly 


Val 


Ala 


Asp 


Tyr 






795 










800 


Asp 


Thr 


Lys 


Asn 


Val 


Val 


Pro 


Ser 




810 










815 




Arg 


Lys 


He 


Gly 


Val 


Asp 


Arg 


Tyr 


825 










830 






Thr 


Asn 


Glu 


He 


Lys 


Leu 







845 



<210> 256 
<211> 930 
<212> DNA 
<213> SHRIMP 



<400> 256 

atggacaatc 

gteggctget 

gacagaacta 

aggaaeggga 

aaggacctgg 

acccctcaag 

cccaggccag 



ttatcaccaa 
ccatgacta t 
etacttgeat 
gcaacaccga 
acatctctga 
tcacaaaact 
ttgecaatae 



cgacaacatt 
tgggctcgea 
ttcttgetea 
atccagtttc 
acccatgctc 
ggtaacattt 
acctcaacaa 



atcctcgtca 
ctggccatga 
ccgtgggaga 
atcagccacg 
aaatctacca 
tctggtccaa 
caaccaacaa 



ctttcctgag 
acatgetegt 
agaataagaa 
tccggttcaa 
cttacgatct 
ectatgetag 
gtacaaataa 



tggattggca 60 
gaagtgcatc 120 
taagaagaac 180 
cactccagat 240 
ggccaatgtt 300 
tccgcctaca 360 
agaggaagaa 420 
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agtgtctata tgccaatgtc gagctgctcg tcgtcatttt cttctgacaa tagtcttcct 4 80 

ctgccaacac cgccgccatc tccacctaga agcaatggcg gtgattacgt gtcatatgta 540 

aacggacgac atctgaagct tccttcaaac ccaccttctc ccatcttcaa tatcaagaat 600 

gaggagggag aggatgataa tgtggaagaa catgtctacg aatacgtgcc agaagtacct 660 

caacaatctc catctatcca gaagtgtatc caggaattga aggagatgaa acacaagaaa 720 

aacaccctaa ccaggagcag tagtaacaac aacaacaatg ctccacgtat aacccaagtt 780 

acgtttaaga aattcccacc taacaataat aacatgtggg agaatcatgt gtatggaaac 840 

actacaattg tgtcttccac accttctcct acctttattc cttcacctaa aagtatcata 900 

aggaaattgt catttaagag gaaacaataa 930 

<210> 257 
<211> 305 
<212> PRT 
<213> SHRIMP 



<400> 257 



Met 


Asp 


Asn 


Leu 


He 


Thr 


Asn 


Asp 


Asn 


He 


He 


Leu 


Val 


Thr 


Phe 


Leu 


1 








D 










10 










15 




Ser 


Gly 


Leu 


Ala 


Val 


Gly 


Cys 


Ser 


Met 


Thr 


He 


Gly 


Leu 


Ala 


Met 


Asn 








20 










25 










30 






Met 


Leu 


Val 


Lys 


Cys 


He 


Asp 


Arg 


Thr 


Thr 


Thr 


Cys 


He 


Ser 


Cys 


Ser 






35 










40 










45 








Pro 


Trp 


Glu 


Lys 


Asn 


Lys 


Asn 


Lys 


Lys 


Asn 


Arg 


Asn 


Gly 


Ser 


Asn 


Thr 




50 










55 










60 










Glu 


Ser 


Ser 


Phe 


He 


Ser 


His 


Val 


Arg 


Phe 


Asn 


Thr 


Pro 


Asp 


Lys 


Asp 


65 










70 










75 










oO 


Leu 


Asp 


He 


Ser 


Glu 


Pro 


Met 


Leu 


Lys 


Ser 


Thr 


Thr 


Tyr 


Asp 


Leu 


Ala 










85 










90 










95 




Asn 


Val 


Thr 


Pro 


Gin 


Val 


Thr 


Lys 


Leu 


Val 


Thr 


Phe 


Ser 


Gly 


Pro 


Thr 








100 










105 










110 






Tyr 


Asp 


Pro 


Thr 


Pro 


Arg 


Pro 


Val 


Ala 


Asn 


Thr 


Pro 


Gin 


Gin 


Gin 


Pro 






115 










120 










125 








Thr 


Ser 


Thr 


Asn 


Lys 


Glu 


Glu 


Glu 


Ser 


Val 


Tyr 


Met 


Pro 


Met 


Ser 


Ser 




130 










135 










140 










Cys 


Ser 


Ser 


Ser 


Phe 


Ser 


Ser 


Asp 


Asn 


Ser 


Leu 


Pro 


Leu 


Pro 


Thr 


Pro 


145 










150 










155 










160 


Pro 


Pro 


Ser 


Pro 


Pro 


Arg 


Ser 


Asn 


Gly 


Gly 


Asp 


Tyr 


Val 


Ser 


Tyr 


Val 










165 










170 










175 




Asn 


Gly 


Arg 


His 


Leu 


Lys 


Leu 


Pro 


Ser 


Asn 


Pro 


Pro 


Ser 


Pro 


He 


Phe 








180 










185 










190 






Asn 


He 


Lys 


Asn 


Glu 


Glu 


Gly 


Glu 


Asp 


Asp 


Asn 


Val 


Glu 


Glu 


His 


Val 






195 










200 










205 








Tyr 


Glu 


Tyr 


Val 


Pro 


Glu 


Val 


Pro 


Gin 


Gin 


Ser 


Pro 


Ser 


He 


Gin 


Lys 




210 










215 










220 










Cys 


He 


Gin 


Glu 


Leu 


Lys 


Glu 


Met 


Lys 


His 


Lys 


Lys 


Asn 


Thr 


Leu 


Thr 


225 










230 










235 










240 


Arg 


Ser 


Ser 


Ser 


Asn 


Asn 


Asn 


Asn 


Asn 


Ala 


Pro 


Arg 


He 


Thr 


Gin 


Val 










245 










250 










255 




Thr 


Phe 


Lys 


Lys 


Phe 


Pro 


Pro 


Asn 


Asn 


Asn 


Asn 


Met 


Trp 


Glu 


Asn 


His 








260 










265 










270 






Val 


Tyr 


Gly Asn 


Thr 


Thr 


He 


Val 


Ser 


Ser 


Thr 


Pro 


Ser 


Pro 


Thr 


Phe 






275 










280 










285 








lie 


Pro 


Ser 


Pro 


Lys 


Ser 


He 


He 


Arg 


Lys 


Leu 


Ser 


Phe 


Lys 


Arg 


Lys 




290 










295 










300 











Gin 

305 



<210> 258 
<211> 549 
<212> DNA 



WO 01/38351 



351 



PCT/US00/28888 



<213> SHRIMP 



<40O> 258 

atgggagaaa gtattttcga tgctgtaagt 
tccagaaaca agaagttatt gcgggaactc 
tttctccagt gtcggatgat agattttcac 
tgtcattctg taaacgtgcc agatgttgtg 
gaagaggacc gtgccaataa ccccgggcta 
gtcgaacaag gtgaattatg catcatcaac 
gtgtccttgc ataaggactt atttggtgaa 
agggaagaat ctcggtctat ccacctatat 
atcccaagac cagaagggac aaatactgtg 
acggtataa 



ctggcaacaa ataatcctaa aaagtcaaac 60 

aagaatatgc ggaaagattt tccttcaacc 120 

ttttctggtg acattataga taaacattac 180 

cctaatacaa tatttgcagt tttcttacct 240 

tacgattcta ttgaaggagt atgtataaca 300 

aagtcaagcg ttcacgagtt caatattctg 360 

gatatcttgg atggaataga aactgcatca 420 

ctggaggctg ggcagagtat cagaacccca 4 80 

aactacacta tcgttttttc taaccaagtg 540 
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<210> 260 
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<212> DNA 
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<400> 260 

atggaagcgg 

gaggacgata 

gaggaaatga 

gacacgggag 

tttgtgagag 

gaggaagagg 

gaggaagaag 

gctgaccctg 

aaccagtgga 



catcaaatct 
tcaataataa 
acgaggatga 
ttagaaatgg 
atgtcactaa 
aagatgaaga 
aaggagatga 
cgttattggc 
aaagtatagt 



gcgtataact 
tgttgattat 
ggaggaggag 
aagaaataaa 
tgacatgtac 
gggagaagaa 
atatgaagat 
gctagcggca 
gaacaccccc 



gaaggtgcgg. 
agtaacttgt 
gaggaggagg 
gatcctccat 
gatgatgatg 
gggggtgaat 
gacaatgaag 
caacaagaag 
tcgccagtag 



gagtgttgga 
atgaagacga 
attacgaaga 
cttctaaaaa 
atgaagaaga 
atgacggaaa 
gtgaagggga 
atgcgacaat 
ggccaaatag 



tatcgacaac 
agaggatgag 
tgaagatgag 
acaaagtaaa 
agaggaagag 
tctagaagac 
agaagatgag 
tatacctgaa 
gcaagttctt 



60 
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240 

300 

360 

420 

480 
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cccatgttga actttttact tgaaaatgtg 
cagaagaaca aagaagatga taaccaacaa 
gaagaggaag aacaggagga ggaagaggaa 
gaaaaagaac ctatagaaca agaaaaaaat 
aatgaaagcg tgcactctca tagagttgaa 
gatggaatgg attatttctt ctcttcaata 
gaagaagagg atgaagaaga gggagaggaa 
gaagagcatg tagaaactaa agagtctgtt 
gaagaatatg aagaatatga agatgaatct 
actatgcatc aatttaacaa tgcgcccagg 
gagtgcgagg atgcagttgt gttccctcca 
ataaaggaac cttcaccaaa agcacctaga 
gaacaatatg accaactaaa cgatatagca 
cctccagaca atgaaatggg agaagaaagt 
atgcctcctc ctcctcctcc acctccacca 
accaatattc ttcttcctcc ccctccacct 
aacaacaaca atccatcctt tttgagtaca 
gggaaagagg ccgaagaaag attacacaag 
gtaaaaactc ttctagaaac aacaaagaac 
ttccaagatc cagaaaaccc agtaaaacca 
atcatcgctg cagaattaac catgaaggca 
aaatctgccg agctgttcag gaaaacaaat 
atcatgtcca accctctttt ttcagcagca 
gctagcaaaa taagaccatc gactcctgct 
gaattaaacg aggatgaaac ttccatgtct 
gacgagtcta ttgctggagg aggaggagga 
cattcttcat tttatagtaa tcaaactcaa 
ctgaaagaag acgacgataa tcaaccatgt 
ttcctaaaca acctcatatc ctttaaaaca 
aatatgcttt cggacatagt gaccaaggca 
gcccaagagg attttgaaaa acatcaaact 
tcatctaaac taaaacagat gagtaaagaa 
ggttctatag gggcagaaat ttgtttcggg 
aacaaactat gtatggacgt gggaaggtta 
ttaaattggc caaaagagtt cactttgtca 
atttctagtt gttcttctaa aatggctgtc 
tttgataagg cagtcgatga agaggatgaa 
agtgaagacg gcaacatgtt ttcagagaag 
gataatataa gatacgggga cagtaatgac 
gcgtattctg gagtgagtaa caataacagc 
ggcgggggaa ggtttggtgt agcatacagt 
cctatttttg ataatgcatt aaatacacta 
ctaagcgcag tagtgatcaa acttttaaag 
ataaaatata agctaaatca agcatctgag 
tctacatctg ttgtgcctat gcgaaactta 
tctccctcca ccccagcagc agcaacagca 
gttggtagga aaagaacgtt ttctttttca 
agcagtgtgt atattgacca agaatcttcc 
cttttgaata acaaatctag cgtcaattct 
acaaaatatt acaattcttc atccaattca 
taa 



aacgccatgg gcggatcagc aggtgaagaa 600 
atagaacccg tagaagagga agaggatgaa 660 
gaggaagaag aacaggagga ggaagaggaa 720 
gaacccgaga aggatgaaga tgcaatagaa 7 80 
tccagcccta tgagtgaagg aggtaatgat 840 
gcaggtggcg gcaatgataa tgaagaggat 900 
gaggaggagg aggaacctgc acaaaaatct 960 
caatctcaca cggaatatat tgaagaagag 1020 
cggcacacac tcgaggatga agaaataagt 1080 
gttcgaagat ctccccctcc agatatacaa 1140 
atcatgaaag agacagatat tctcccccag 1200 
atgttttcta ttcttggtag tggaggtgag 1260 
ccaccgcctg ttccatctat cgtgactttt 1320 
agagacatca tggaccaaga ttcgatgctg 1380 
ccacctcatc aacctcctca actcaaacct 1440 
ccacctacaa atcagtcctt attttcaaac 1500 
gtggttggaa aggttaacaa cacattaggt 1560 
actatggagt ctataatctt aaaaacaagg 1620 
cttcaatgct ctgaactggt gaaagttgtg 1680 
tccgagaaag taatggagcg cc ttaaaaat 17 40 
tttttagatt ctgctgcggt gacagacatt 1800 
gagaaattgg agttgtttca gaggaagcaa 1860 
tacgcgtcaa cttatataat gggggaaaga 1920 
ccttctctta aaaaggttga aagtatatct 1980 
tcatctgctg gtggagtatg tgctgaaggc 2040 
ggaggaggag gaggaggaga ggtggtggag 2100 
gcaaatttgc acatggaact gataaatata 2160 
caaacgtata agcttggtca aagactggcg 2220 
tcaagtgctg tttcttggtc tagattagtc 2280 
tcagtggcac tgttcggaga caccaacaaa 2340 
gaaacaaatg atgtatcaga tttgtctacc 24 00 
tctgcaaaca taatggaaga aatggggtta 24 60 
gcaatttcta caatcataga gaaacacatc 2520 
acaattttct tgaacatacc aattgtacta 2580 
aaagattata aagttcttct tttggatagt 2640 
cctcccattt acgtcttaaa cagtatacaa 2700 
gatggtaatg gaagtgaggc agagaaaaga 27 60 
gacaagaagg aagcaattcg tcgagtatac 2820 
cgtacatcat taaaccactt tttcggtgac 2880 
aagaatagta tgtttgatct ccagacacaa 2940 
gcaggctcat caatcatcga acacagatcc 3000 
gtcaatttca tggacaagag gaaacacttg 3060 
aaagctaaat tgtccattga agtgtactgc 3120 
aaatacaata aaaagggtaa gcacggaaaa 3180 
atgtaccgtc cttctaaaaa tcaagatgtt 3240 
atggacgtgc ctagcagtgt atcttctcat 3300 
aatgacatca attccaacat gagtagcgct 3360 
acaccttcta gacggaggac ttttatggac 3420 
ctagcaaagc aagtgaaaag aatgaagcac 3480 
gaagacgacg atgaagacga ccaatacgag 3540 

3543 



<210> 261 
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<212> PRT 
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<400> 261 

Met Glu Ala Asn Leu Arg lie Thr Glu Gly Ala Gly Val Leu Asp lie 

15 10 15 

Asp Asn Glu Asp Asp lie Asn Asn Asn Val Asp Tyr Ser Asn Leu Tyr 
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1010 1015 1020 

Lys Ala Lys Leu Ser lie Glu Val Tyr Cys lie Lys Tyr Lys Leu Asn 
1025 1030 1035 1040 

Gin Ala Ser Glu Lys Tyr Asn Lys Lys Gly Lys His Gly Lys Ser Thr 

1045 1050 1055 

Ser Val Val Pro Met Arg Asn Leu Met Tyr Arg Pro Ser Lys Asn Gin 

1060 1065 1070 

Asp Val Ser Pro Ser Thr Pro Ala Ala Ala Thr Ala Met Asp Val Pro 

1075 1080 1085 

Ser Ser Val Ser Ser His Val Gly Arg Lys Arg Thr Phe Ser Phe Ser 

1090 1095 1100 

Asn Asp He Asn Ser Asn Met Ser Ser Ala Ser Ser Val Tyr He Asp 
1105 1110 1115 1120 

Gin Glu Ser Ser Thr Pro Ser- Arg Arg Arg Thr Phe Met Asp Leu Leu 

1125 1130 1135 

Asn Asn Lys Ser Ser Val Asn Ser Leu Ala Lys Gin Val Lys Arg Met 

1140 1145 1150 

Lys His Thr Lys Tyr Tyr Asn Ser Ser Ser Asn Ser Glu Asp Asp Asp 

1155 1160 1165 

Glu Asp Asp Gin Tyr Glu 
1170 



<210> 262 
<211> 786 
<212> DNA 
<213> SHRIMP 

<400> 262 

atgtctaatg gcgcaactat aagcgatgag aggttgatcc tcatccttga taaaattgta 60 

gaaagacgtg gtgtttctaa tctatctgaa ttgttgatac accccataac caaacacata 120 

aacgaattgt tgaagaacac tgtaagacat ggagacagag tttacatgaa ggatgcagaa 180 

ctggatgtga gatctcgcct agaagacata aaaaaggatt gtgttttaaa ggcaattgaa 240 

aaacaaggaa tagatgttag acaaataata actgattact tggctaaacg aaaactaacg 300 

caaaatcttg tacattggta tcggccccca atatcttgca cagatataga cgaaaaaatt 360 

caacaagaaa ctggtcaagt agggcggtgt agtgttgcta cgtacaattt gagaattggt 420 

ggtgacgatg gagaatttac aaggtacgat ttctccattc ccttgggaga ttttaaaata 480 

acggcaaaat tgtttcgttc cataaatgat gaggatgtag atgcagtgat tcttgtgtct 540 

cgtagtgacg tagttaatga cgtgctaagc tttgaagcat ttaatcgaac aggagaacgc 600 

gtagtcatat tctttaatgt gattgttgaa gggaagagta aagatattga tattgtatgt 660 

aaatctagat ataaacacac ccatatacta aacggagaat ctgcaacata cgctgttaaa 720 

cgtataaaaa gaggcgatac aagggacgat atattgtttg caatcactgc ttttaaggag 780 
gagtaa 786 

<210> 263 
<211> 261 
<212> PRT 
<213> SHRIMP 

<400> 263 

Met Ser Asn Gly Ala Thr lie Ser Asp Glu Arg 

15 10 
Asp Lys He Val Glu Arg Arg Gly Val Ser Asn 

20 25 
He His Pro He Thr Lys His He Asn Glu Leu 

35 40 
Arg His Gly Asp Arg Val Tyr Met Lys Asp Ala 

50 55 
Ser Arg Leu Glu Asp He Lys Lys Asp Cys Val 



Leu He Leu He Leu 
15 

Leu Ser Glu Leu Leu 
30 

Leu Lys Asn Thr Val 
45 

Glu Leu Asp Val Arg 
60 

Leu Lys Ala He Glu 
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65 

Lys Gin Gly lie 

Arg Lys Leu Thr 
100 

Cys Thr Asp lie 
115 

Arg Cys Ser Val 
130 

Glu Phe Thr Arg 
145 

Thr Ala Lys Leu 

lie Leu Val Ser 
180 

Ala Phe Asn Arg 
195 

Val Glu Gly Lys 
210 

Lys His Thr His 
225 

Arg lie Lys Arg 

Ala Phe Lys Glu 
260 



70 

Asp Val Arg Gin 
85 

Gin Asn Leu Val 

Asp Glu Lys lie 
120 

Ala Thr Tyr Asn 
135 

Tyr Asp Phe Ser 
150 

Phe Arg Ser lie 
165 

Arg Ser Asp Val 

Thr Gly Glu Arg 
200 

Ser Lys Asp lie 
215 

lie Leu Asn Gly 
230 

Gly Asp Thr Arg 

245 

Glu 







75 




Tip 


Tip 


Thr 


ASp 










nis 


Trp 


Tyr 


Arg 


105 








Gin 


Gin 


Glu 


Thr 


Leu 


Arg 


j. j.e 


ui y 










lie 


Pro 


Leu 


oiy 






J. _) -J 




Asn 


Asp 


v-LU 


Asp 




170 






Val 


Asn 


Asp 


Val 


185 








Val 


Val 


He 


Phe 


Asp 


He 


Val 


Cys 








220 


Glu 


Ser 


Ala 


Thr 






235 




Asp 


Asp 


He 


Leu 




250 







80 

Tyr Leu Ala Lys 
95 

Pro Pro He Ser 
110 

Gly Gin Val Gly 
125 

Gly Asp Asp Gly 

Asp Phe Lys He 
160 

Val Asp Ala Val 
175 

Leu Ser Phe Glu 
190 

Phe Asn Val He 
205 

Lys Ser Arg Tyr 

Tyr Ala Val Lys 
240 

Phe Ala He Thr 
255 



<210> 264 
<211> 852 
<212> DNA 
<213> SHRIMP 



<400> 264 

atgtcgtcta 

caattgggag 

tatcgcttag 

gatcaagtaa 

tacgaagacg 

gttatggaaa 

actgaaattt 

ttttctttca 

ataaacgata 

gatttaagtt 

cgcaaatcca 

ctaagaaata 

tatattagtc 

acaattcaca 

tttgtttttt 



acggagatga 
ctgctcacca 
gtgcaattgc 
ggcaagaaat 
ctttcaatcc 
aagtatacga 
taaaagagga 
agtacaatac 
aacacatgtt 
tatcgttcaa 
cgttcgctgg 
gtgattcttc 
atgatacaaa 
tgcctgaaat 
ag 



gcctgctgtg 
tgacaattct 
ctattcggta 
caatagccat 
tgttatctgt 
tgtgaataga 
gcggtggaga 
tattgccttt 
atcaatcact 
aatagattct 
tattgtaaac 
tgataacata 
tggcataatc 
cgaaaacgcg 



<210> 265 
<211> 283 
<212> PRT 
<213> SHRIMP 



actgaagctg 
tggatcacaa 
gcaaaaaatg 
ttaactaatg 
gaagcaattt 
cggatcatgg 
agatatatac 
accgactctt 
ccccctggag 
tcagccaaaa 
agaccaaaaa 
ggtgattatc 
aacccatccg 
agtgatgatt 



aaatcgcttc 
gaaagagtga 
cctctataaa 
taatgact tt 
ttgagaaagg 
aacccaggga 
ctggttttta 
caacttcatt 
ctgctcaagg 
ctctcacgtt 
gtgtagtgat 
taaagagaaa 
aggattcggc 
tatacataga 



agtggaggct 
ccaattaaag 
atatatagag 
tgaacatctt 
aataccagtg 
agatttcata 
tcatacatca 
tagtgtacca 
ggatttaatt 
agaatttaac 
attatcaaat 
tgatcctata 
ctctctcatt 
tttcaatctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

852 



<400> 265 

Met Ser Ser Asn Gly Asp Glu Pro Ala Val Thr Glu Ala Glu He Ala 

15 10 15 

Ser Val Glu Ala Gin Leu Gly Ala Ala His His Asp Asn Ser Trp II 

20 25 30 

Thr Arg Lys Ser Asp Gin Leu Lys Tyr Arg Leu Gly Ala He Ala Tyr 
35 40 45 
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Ser 


Val 


Ala 


Lys 


Asn 


Ala 


Ser 


Tip 
l 1 e 




50 










55 




Gin 


Glu 


He 


Asn 


Car 


Hie; 


T.en 

liCU 


X III 


65 










70 






lyx 


Glu 


Asp 


Ala 


Phe 


Asn 


Pro 


V Ctl. 










85 








uxy 


Tip 

lie 


ITX U 


val 


Val 


ric; L 


r* l 
biu 


Lys 








100 












OX u 


Pro 


Arg 


uiu 


Asp 


r fie 


Tin 

x lc 






1 1 t 










x^ u 


Trp 


Arg 


Arg 


Tyr 


X 1c 




uiy 


file 




130 










135 




Tyr 


Asn 


1 fix 


T 1 <=v 

lie 


nlcl 


rue 


Thr 


Asp 


145 










1 SO 

1 






Tip 


Asn 


Asp 


Lys 


Hie: 

nib 


net 


Leu 


Ser 










X DO 








uiy 


Asp 


Leu 


Tie* 

x xe 


Asp 


Leu 


Ser 


Leu 








180 










Lys 


Thr 


Leu 


Thr 


Leu 


Glu 


Phe 


Asn 






195 










200 


Val 


Asn 


Arg 


Pro 


Lys 


Ser 


Val 


Val 




210 










215 




Asp 


Ser 


Ser 


Asp 


Asn 


He 


Gly Asp 


225 










230 






Tyr 


lie 


Ser 


His 


Asp 


Thr 


Asn 


Gly 










245 








Ala 


Ser 


Leu 


He 


Thr 


He 


His 


Met 








260 










Asp 


Leu 


Tyr 


He 


Asp 


Phe 


Asn 


Leu 



275 280 



Lys 


xyx 


Tip 
1 ic 


Ul u 


Asp 


Ulll 


vol 


r\ i y 








fin 

DU 










Asn 


VOX 


Met 

l ie L. 


1111 


triit; 


1 u 


nio 


Leu 






/ O 










ft n 


T 1 1= 

lie 


cy s 


ulU 




Tie 
1 lc 


rr I ltr 


oiu 


Lys 


















vajL 


Tyr 


Asp 


vax 


Asn 


Arg 


Arg 


Tin 

± ie 


lu J 










i i n 

1 1 u 






x ni 




T1 0 

ne 


Leu 


Lys 




vjIU 


Arg 










i / j 








Tyr 


nis 


xnr 


oer 


rfle 


Oar 


rtie 


Lys 








±HU 










oer 


Ser 


inr 


Ser 


Fne 


O y-» V- 

oei 


vax 


fro 






1DO 










1 bU 


ixe 


i nr 


r> -w- /-v 

fro 


Fro 






Ala 


Gi_Ln 




1 / u 










x to 




Ser 


rile 


Liys 


lie 


ASp 


ber 


Ser 


Ala 


loo 










1 90 






Arg 


Lys 


Ser 


Thr 


Phe 


Ala 


Gly 


He 










205 








He 


Leu 


Ser 


Asn 


Leu 


Arg 


Asn 


Ser 








220 










Tyr 


Leu 


Lys 


Arg 


Asn 


Asp 


Pro 


He 






235 










240 


He 


He 


Asn 


Pro 


Ser 


Glu 


Asp 


Ser 




250 










255 




Pro 


Glu 


He 


Glu 


Asn 


Ala 


Ser 


Asp 


265 










270 






Phe 


Val 


Phe 













<210> 266 
<211> 1302 
<212> DNA 
<213> SHRIMP 



<400> 266 

atggcacttt 

ggagcttcta 

aatgccaaaa 

gaaaaaactg 

atcaatgaga 

gctgaggatg 

aacgtatggt 

ataaaegtec 

acgaaccatg 

atcacttacg 

gttgatcctt 

gaaatggtga 

caacctacac 

ggaggttt ta 

aacatgaaac 

gttggat tag 

aaagcccgga 

aacaaaacag 

tcagactatt 

aataaaaatg 

ctggacattt 

agacctat ta 



caaacaatgg 
ttgccctctt 
caaaaaagag 
acgacgataa 
cagcgcttca 
aattcttcac 
taataaagga 
ccgatgtcgg 
acgaagtcaa 
aacaactcgt 
ctgatgaaaa 
tgttgacaac 
ctgattacct 
tcagtgatcg 
agaggacatt 
aacccgtgct 
tagagttttc 
aggactgttt 
ggataagctc 
atgaaacaag 
ttatgacaat 
cagataataa 



aggaatatac 
etttgetate 
taagaaatat 
taataataat 
aegtcaaacg 
caaattagca 
taaaataaca 
acaggcaatt 
ggaaacgatg 
aatagacttg 
tggggatgaa 
aacaccttct 
tgcccggtac 
tgatatgege 
taatgttatt 
acaaaaacaa 
atttgtagta 
ttgtagttta 
tgcaagcaca 
tttcttctat 
agattgtgca 
tataatggtt 



attgtttttg 
tegggegtag 
aaattagact 
ggaggaggag 
agagagcatt 
gatcaggaat 
gatggaaaag 
gctgatgaaa 
gatgaagttg 
accaatattt 
agectacaga 
tcacaactag 
tcaaaggaat 
acttggcaag 
agtgcagcaa 
ggtagagcag 
gaaggtaacc 
ctccccaact 
gctaaggaaa 
aactttgagg 
cctaatcttc 
gcactgtcat 



cggttattgt 
ggaagggaac 
ctaaatacac 
ggggaggaac 
ttgeaagaac 
ttgacacata 
tttcaatccc 
acttgttcga 
ttgcacaaaa 
tattgtttgg 
gatcaacaga 
ctagacaaca 
tggtgataaa 
gacgaatgtc 
cgaatctgga 
ctgtgggagg 
gtgtacgggt 
gttataatgt 
aaaegtaett 
aaggtgttga 
ctttcattaa 
aa 



tttggtaata 
tctacattca 
tgacgatgat 
agttgatgtt 
tcttgaaaaa 
caaatcagaa 
tgaaggtgac 
tctcataggg 
at ctaccaat 
tacagtaaca 
cccagacgca 
acaacctcct 
taatatacga 
tgtacatgtc 
ttctctacaa 
aegtattgaa 
ataegctaca 
taaaaaggca 
gtttattget 
agaaattgac 
aaatttacca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1302 
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<210> 267 




























<211> 431 




























<212> PRT 




























<213> SHRIMP 


























<400> 267 




























Wet 


Ala 


Leu 


Ser 


Asn 


Asn 


Giy 


oiy 


j. ie 


Tyr 


He 


Val 


pne 


Ala 


v ai 


lie 


i 
j. 








D 










1 n 














val 


Leu 


vai 


He 


uiy 


Aia 


061 


x j.e 


A±a 


Leu 


Phe 


Phe 


Ala 


lie 


Car 

ber 


Giy 








20 










4.0 
















vai 


Giy 


iiys 


Gly 


Thr 


T A1 4 

lieu 


HIS 


ber 


Asn 


A 1 ^ 

Ala 


Lys 


Thr 


Lys 


Lys 


Ser 


Lys 
















^ u 










A Q 
HO 








Xjys 


Tyr 


Lys 


Leu 


Asp 


oer 


Lys 


Tyr 


Tk ■*- 

inr 


Asp 


Asp 


Asp 


GlU 


Lys 


Tnr 


Asp 














D D 










60 










ASp 


Asp 


Asn 


Asn 


Asn 


Asn 


Gly 


Gly 


Gly 


Gly 


Gly Gly 


Thr 


Val 


Asp 


val 












1 C\ 

1 u 










75 










0 U 


lie 


Asn 


GlU 


Thr 


TV 1 

Ala 


Leu 


Gin 


Arg 


Gin 


Thr 


Arg 


Glu 


MIS 


Pne 


Ala 


Arg 




















90 










yb 




Thr 


Leu 


GlU 


Lys 


Ala 


GlU 


Asp 


Glu 


Fne 


Phe 


Thr 


Lys 


Leu 


Ala 


Asp 


Gin 








100 










1 nc 










Tin 
11U 






Glu 


Pne 


Asp 


Thr 


Tyr 


Tim 

Lys 


O A V 

ser 


GlU 


Asn 


Val 


Trp 


Leu 


lie 


Lys 


Asp 


Lys 






tic 

113 










1 2 u 










1 oc 
12b 








lie 


Thr 


Asp 


Gly 


Lys 


IF- 1 

val 


ber 


1 le 


Pro 


Glu Gly Asp 


lie 


Asn 


val 


Pro 




130 










loo 










140 










Asp 


Val 


Gly 


Gin 


Ala 


1 le 


Ala 


Asp 


GlU 


Asn 


Leu 


Phe 


Asp 


Leu 


He 


Gly 


14 5 










I jU 










155 










1 DU 


Thr 


Asn 


His 


Asp 


Glu 


val 


Lys 


GlU 


Thr 


Met 


Asp 


Glu 


Val 


Val 


Ala 


Gin 










1 OD 










170 










1 /b 




Lys 


Ser 


Thr 


Asn 


He 


Thr 


Tyr 


Glu 


Gin 


Leu 


Val 


He 


Asp 


Leu 


Thr 


Asn 








180 










IOC 

lob 










190 






lie 


Leu 


Leu 


Phe 


Gly 


Thr 


Val 


Thr 


Val 


Asp 


Pro 


Ser 


Asp 


Glu 


Asn 


Gly 






195 










0 /\ 
200 










205 








Asp 


Glu 


Ser 


Leu 


Gin 


Arg 


Ser 


Thr 


Asp 


Pro 


Asp Ala 


Glu 


Met 


Val 


Met 




210 










2 Id 










220 










Leu 


Thr 


Thr 


Thr 


Pro 


Ser 


Ser 


Gin 


Leu 


Ala 


Arg Gin 


Gin 


Gin 


Pro 


Pro 


o o c 

225 










0 *5 
2JO 










235 










2 40 


Gin 


Pro 


Thr 


Pro 


Asp 


Tyr 


Leu 


Ala 


Arg 


Tyr 


Ser 


Lys 


Glu 


Leu 


Val 


He 










O A C 










250 










2bb 




Asn 


Asn 


lie 


Arg 


Gly 


Gly 


Phe 


I le 


Ser 


Asp 


Arg 


Asp 


Met 


Arg 


Thr 


Trp 








260 










0 c c 
2ob 










270 






Gin 


Gly 


Arg 


Met 


Ser 


Val 


His 


Val 


Asn 


Met 


Lys 


Gin 


Arg 


Thr 


Phe 


Asn 






2 /b 










*> q n 

2oU 










285 








Val 


lie 


Ser 


Ala 


Ala 


rnv _ 

Tnr 


Asn 


T «i 1 

Leu 


Asp 


Ser 


Leu 


Gin 


Val 


Gly 


Leu 


Glu 




290 










29b 










300 










Pro 


Val 


Leu 


Gin 


Lys 


Gin 


Gly 


Arg 


Ala 


Ala 


Val 


Gly 


Gly Arg 


lie 


Glu 


305 




















315 










"3 *> n 
J2U 


Lys 


Ala 


Arg 


He 


blU 


Fne 


ber 


pne 


va l 


Val 


Glu 


Gly 


Asn 


Arg 


val 


Arg 










•joe 

J2b 










330 










Job 




Val 


Tyr 


Ala 


Thr 


Asn 


Lys 


Thr 


Glu 


Asp 


Cys 


Phe 


Cys 


Ser 


Leu 


Leu 


Pro 








340 










345 










350 






Asn 


Cys 


Tyr 


Asn 


Val 


Lys 


Lys 


Ala 


Ser 


Asp 


Tyr 


Trp 


He 


Ser 


Ser 


Ala 






355 










360 










365 








Ser 


Thr 


Ala 


Lys 


Glu 


Lys 


Thr 


Tyr 


Leu 


Phe 


He 


Ala 


Asn 


Lys 


Asn 


Asp 




370 










375 










380 










Glu 


Thr 


Ser 


Phe 


Phe 


Tyr 


Asn 


Phe 


Glu 


Glu 


Gly 


Val 


Glu 


Glu 


He 


Asp 


385 










390 










395 










400 


Leu 


Asp 


lie 


Phe 


Met 


Thr 


He 


Asp 


Cys 


Ala 


Pro 


Asn 


Leu 


Pro 


Phe 


He 










405 










410 










415 




Lys 


Asn 


Leu 


Pro 


Arg 


Pro 


He 


Thr 


Asp 


Asn 


Asn 


He 


Met 


Val 


Ser 










420 










425 










430 
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<210> 268 
<211> 207 
<212> DNA 
<213> SHRIMP 

<400> 268 

atgtctgata tgaccagaaa catcatcgtc 
atggtcgctt tcatgctttc tgttactcct 
gtatcagcac taggagttac actctttgga 
aatgctacaa tcaaccccgt ggcataa 



ggcttggccg ttgtcgtcat cgcattgagc 60 
gcacttaccg gattcctcct aggtttgggt 120 
tgtcccacta tgaaatctcc agggggagga 180 

207 



<210> 269 
<211> 68 
<212> PRT 
<213> SHRIMP 



<400> 269 

Met Ser Asp Met Thr Arg Asn He 

1 5 
He Ala Leu Ser Met Val Ala Phe 
20 

Thr Gly Phe Leu Leu Gly Leu Gly 

35 40 
Phe Gly Cys Pro Thr Met Lys Ser 

50 55 
Asn Pro Val Ala 
65 



lie Val Gly Leu Ala Val Val Val 

10 15 
Met Leu Ser Val Thr Pro Ala Leu 
25 30 
Val Ser Ala Leu Gly Val Thr Leu 
45 

Pro Gly Gly Gly Asn Ala Thr lie 
60 



<210> 270 
<211> 552 
<212> DNA 
<213> SHRIMP 



<400> 270 

atgttccaga 

acatgtgtat 

acatcgcatt 

tcctctccca 

gacgaccata 

ggagtacaat 

ttccctgttc 

gaaaatgaag 

gtgtccaaag 

tcaaacaaat 



aatggtttga 
gctcagttta 
cgcatgaggg 
ccagtggaaa 
cgagatggta 
tcatggtata 
acctccaaat 
aagatatgag 
acacagtgct 
aa 



atcgtttctg 
ttcatatttt 
tataaaaatc 
gatgtgtaac 
tgacaaaaat 
cgagttccat 
acacaacagg 
ggtcaggaac 
tgtagagaat 



gattcttccc 
tccccttgtc 
catcctccat 
caccaccaca 
acatcatgca 
ctaacaccaa 
aatactgaga 
attcatccaa 
gggtttcggt 



gacctagata 
ggaaacatat 
caatattgaa 
agagattgta 
tctatcttga 
agaacaatca 
aaacatccct 
aatccaagat 
acaaggtgaa 



tctggatacg 
aaaattttcc 
ccataatact 
cctgagcact 
agatattgga 
actattctcc 
cctcgtattt 
attgatcccc 
aattgtatta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

552 



<210> 271 
<211> 183 
<212> PRT 
<213> SHRIMP 



<400> 271 
Met Phe Gin Lys 
1 

Tyr Leu Asp Thr 
20 

Cys Arg Lys His 
35 

Lys He His Pro 
50 



Trp Phe Glu Ser 
5 

Thr Cys Val Cys 

He Lys Phe Ser 
40 

Pro Ser He Leu 
55 



Phe Leu Asp Ser 
10 

Ser Val Tyr Ser 
25 

Thr Ser His Ser 

Asn His Asn Thr 
60 



Ser Arg Pro Arg 
15 

Tyr Phe Ser Pro 
30 

His Glu Gly He 
45 

Ser Ser Pro Thr 
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JCi. 


OAy 


T. v/ o 


Met 


Cys 


Asn 


His 


His 


His 


Lys 


Arg 


Leu 


Tvr 


Leu 


Ser 


Thr 


65 










70 










75 










80 


Asp 


Asp 


His 


Thr 


Arg 
85 






Asp 


Lys 


Asn 
90 


Thr 


Ser 


Cys 


He 


Tvr 
95 


Leu 


Glu 


Asp 


He 


Gly Gly 


Val 


Gin 


Phe 


Met 


Val 


Tyr 


Glu 


Phe 


His 


Leu 


Thr 








100 










105 










110 






Pro 


Lys 


Asn 


Asn 


Gin 


Leu 


Phe 


Ser 


Phe 


Pro 


Val 


His 


Leu 


Gin 


He 


His 




115 










120 










125 








Asn 


Arg 
130 


Asn 


Thr 


Glu 


Lys 


Thr 
135 


Ser 


Leu 


Leu 


Val 


Phe 
140 


Glu 


Asn 


Glu 


Glu 


Asp 


Met 


Arg 


Val 


Arg 


Asn 


He 


His 


Pro 


Lys 


Ser 


Lys 


He 


Leu 


He 


Pro 


145 










150 










155 










160 


Val 


Ser 


Lys 


Asp 


Thr 
165 


Val 


Leu 


Val 


Glu 


Asn 
170 


Gly 


Phe 


Arg 


Tyr 


Lys 
175 


Val 


Lys 


He 


Val 


Leu 


Ser 


Asn 


Lys 





















180 



<210> 272 
<211> 684 
<212> DNA 
<213> SHRIMP 



<400> 272 

atggactcac ttataagcaa attggaaaac atattctcca ttgccgagca ggactttttc 60 
aacgcggaca gcatgttcat gcaaaccatg ctcctcccta ccgacgccat gttcaccgat 120 
tgcgagtctc cattgtacaa gaacaagtcg ggagggaaga atattgtcac cgatgttgga 180 
gagagtgtac tgtcttcttc ttcggacgaa aagatgagct tcaaagtgct gtcccacgta 24 0 
ctcaggcgat tccctgtcct acttcattgc aactacaagc agacgaatac gcccctgtgg 300 
aaggagcttt acaagcacgg gaagtttgcc ctcctcggcg acctggtgtt attctccaac 360 
ccattccacc ccaatatccc cgccatgccg tttgataaat cccccatttg tgacaccact 420 
ggaaaatcta tcattatgag tgaagtcatg accaaggagc ttttgtacaa gttggccgac 480 
aaagatattg gccaattctt tgctgtattg aatgtaacta accccattac tggagattct 540 
ttcctccatt actttgcagg aggaaatacc atgagggatg gggaagggga taaaatctgc 600 
acatctgctg atgtgttacg cattattgct gagataacaa tacagaaaac tggcaagatg 660 
ccatatgaat tgatgaagaa ataa 684 

<210> 273 
<211> 227 
<212> PRT 
<213> SHRIMP 



<400> 273 



Met 


Asp 


Ser 


Leu 


He 


Ser 


Lys 


Leu 


Glu 


Asn 


lie 


Phe 


Ser 


He 


Ala 


Glu 


1 






5 










10 










15 




Gin 


Asp 


Phe 


Phe 


Asn 


Ala 


Asp 


Ser 


Met 


Phe 


Met 


Gin 


Thr 


Met 


Leu 


Leu 






20 










25 










30 






Pro 


Thr 


Asp 
35 


Ala 


Met 


Phe 


Thr 


Asp 
40 


Cys 


Glu 


Ser 


Pro 


Leu 
45 


Tyr 


Lys 


Asn 


Lys 


Ser 
50 


Gly 


Gly 


Lys 


Asn 


He 
55 


Val 


Thr 


Asp 


Val 


Gly 
60 


Glu 


Ser 


Val 


Leu 


Ser 


Ser 


Ser 


Ser 


Asp 


Glu 


Lys 


Met 


Ser 


Phe 


Lys 


Val 


Leu 


Ser 


His 


Val 


65 










70 










75 










80 


Leu 


Arg 


Arg 


Phe 


Pro 
85 


val 


Leu 


Leu 


His 


Cys 
90 


Asn 


Tyr 


Lys 


Gin 


Thr 
95 


Asn 


Thr 


Pro 


Leu 


Trp 
100 


Lys 


Glu 


Leu 


Tyr 


Lys 
105 


His 


Gly 


Lys 


Phe 


Ala 
110 


Leu 


Leu 


Gly 


Asp 


Leu 
115 


Val 


Leu 


Phe 


Ser 


Asn 
120 


Pro 


Phe 


His 


Pro 


Asn 
125 


He 


Pro 


Ala 


Met 


Pro 
130 


Phe 


Asp 


Lys 


Ser 


Pro 
135 


He 


Cys 


Asp 


Thr 


Thr 
140 


Gly 


Lys 


Ser 


He 
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He Met Ser Glu 
145 

Lys Asp lie Gly 

Thr Gly Asp Ser 
180 

Asp Gly Glu Gly 
195 

He Ala Glu He 

210 
Met Lys Lys 
225 



Val Met Thr Lys 
150 

Gin Phe Phe Ala 
165 

Phe Leu His Tyr 

Asp Lys He Cys 
200 

Thr He Gin Lys 
215 



Glu Leu Leu Tyr 
155 

Val Leu Asn Val 
170 

Phe Ala Gly Gly 
185 

Thr Ser Ala Asp 

Thr Gly Lys Met 
220 



Lys Leu Ala Asp 
160 

Thr Asn Pro lie 
175 

Asn Thr Met Arg 
190 

Val Leu Arg lie 
205 

Pro Tyr Glu Leu 



<210> 274 
<211> 2193 
<212> DNA 
<213> SHRIMP 

<400> 274 

atggagggtg gggaccaacg gacaaaactt 
tcgaaaacgc caggagaagg agaaggagga 
gccatagctg tgaaatcttg ttgctctaaa 
tctccttatt ctcttaggcc catgaagaga 
aaagcaccgc ctcctgtgac tttgaggctc 
tttaatagaa ataagaagaa gaggcctatt 
aatccaacgt atgcgacaga cattatcaag 
tctgtgttga ggaagcaccg cgccaatgcc 
ccaaattgtg caaaattctc tactgtcaat 
gtcctccgtt cccatgtcaa ggggccaaaa 
gaaacaaatg tagtaaagag gaacttttct 
cccccgtgtt actttgcagt ggcagaggct 
ccgttccatt tgattagaca ggccgcaaaa 
tcgtccaact acgagaccat aaaacagagc 
tttttgtggt ctaacaaccg tactaaacaa 
ggtagtaaaa acctatccgt tttagacact 
aagaataatg gtaaagggtg gataaaaacc 
tttaagcttt cccctggagc atgtgaatct 
gaggtcactg cccaatgtaa gaaatgggaa 
cacgtccaac cagaatatgc ttctcaagtc 
gaagttaaag tcggggtata ttacatgctt 
gacaaggaaa tgtactgtga atttgtattt 
agaatgaaag gcgcagtagg tgtgagaaag 
gattttgcag gcatttctgt gattgatgta 
gagaatgttg tacagcaacc taacccctcg 
atgcaagatc atggcagctt gtgtaggatg 
caagctactg gccgcctgcc tgaaggtcta 
tcaatttcag ccatagctat cgttggaaaa 
tttgttttgg tagaaattgt aaatattgtc 
ccggacataa aaagtgacaa tatagtaatt 
gattttggtt taattgtacc atgtaaaaag 
gagaggttct ttagtaacca tcctcataca 
tcagaaactg ccatgacttt tgggttggct 
attaagagaa ctgcagattt gtctgccaat 
attgtatcta aaatgtatga ccaggaaaag 
cctgtaattg gtgcatgttt cccgttcaag 
aaacattcat tgtatagcaa gaaggttaag 



acgccagcaa ccgtgatggg actttaccaa 60 
gaaggaggag ggcaattcaa gataccttca 120 
aacgctactc gccgatcccc tccctcagat 180 
ctaaagaaga ataatggaga ggtgggagga 240 
cgcgaggact acgagagcac accttacaac 300 
actattgatg aaaatcaatt tgcaacatta 360 
aagcagcaat tgccttctgt tagtgccgcg 420 
gacacccagt acagaaaaag attctctcat 4 80 
ttgaaggcta gagactatac tccactgtct 540 
cacttgaaat cttcttgtga taccgtgact 600 
tccattgaca agtgggtcaa gctagaaaaa 660 
gataccaata ttgcagccgg tctagaatct 720 
ttaggcctca tttctgacgt gcaagatgtg 780 
tgtattgacg caaaggaaaa agcgtccaag 840 
cccccttcat cttggtggcc tgttgggttt 900 
agccctctct tgaactggaa caggttatgc 960 
atgagcatcg atcacatggc aaagaatgtt 1020 
atattggaga agaaaactac actcttgggg 1080 
agttaccgca gaaatattcc tgtaccagca 1140 
gtaatgattg gaccatctga attatatctc 1200 
gaaactggaa aagttatcaa gtttatgacg 1260 
gaaactgttt ttagtcacgc tcttgaggga 1320 
atgtgtgttg aaggtttttg tgtcgagatg 1380 
ttaaatggag acctgaaatg taaaatggac 1440 
actacttcct ccaagccagc cgctgagctc 1500 
agggatactc tgtacggtgt taggatgctt 1560 
caatctaaat gcaagaaacc cattacggat 1620 
atgagggaga gaatgttaaa ccaattgccc 1680 
actcggttgt ctcaacaagg attagtgaat 1740 
gatggaataa ctggtcaacc taagatgatt 1800 
tactacaatt ttaaatgttg gggaactgat 1860 
gctcctgaat ttattaacag tgagttgtgt 1920 
tatttgttaa tagacatgtt gtccattttg 1980 
tctatctata caaacattcc atttttgtct 2040 
accaataggc cgagagcgta tgaaattgcg 2100 
gataatattg ctaaactttt ccagtcacct 2160 
tag 2193 



<210> 275 
<211> 724 
<212> PRT 
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<213> SHRIMP 



<400> 275 



Met 


Glu 


Gly 


Gly 


1 








Gly 


Gin 


Ser 


Lys 








20 


Gin 


Phe 


Lys 


He 






35 




Asn 


Ala 


Thr 


Arg 




50 






Pro 


Met 


Lys 


Arg 


65 








Pro 


Pro 


Pro 


Val 


Tyr 


Asn 


Phe 


Asn 








100 


Asn 


Gin 


Phe 


Ala 






115 




Lys 


Gin 


Gin 


Leu 




130 






Arg 


Ala 


Asn 


Ala 


145 








Cys 


Ala 


Lys 


Phe 


Leu 


Ser 


Val 


Leu 








180 


Ser 


Cys 


Asp 


Thr 






195 




Ser 


He 


Asp 


Lys 




210 






Val 


Ala 


Glu 


Ala 


225 








His 


Leu 


lie 


Arg 


Asp 


Val 


Ser 


Ser 








260 


Lys 


Glu 


Lys 


Ala 






275 




Pro 


Pro 


Ser 


Ser 




290 






Val 


Leu 


Asp 


Thr 


305 








Asn 


Gly 


Lys 


Gly 


Asn 


Val 


Phe 


Lys 








340 


Thr 


Leu 


Leu 


Gly 






355 




Arg 


Arg 


Asn 


lie 




370 






Gin 


Val 


Val 


Met 


385 








Gly 


Val 


Tyr 


Tyr 


Asp 


Lys 


Glu 


Met 








420 


Ala 


Leu 


Glu 


Gly 






435 




Val 


Glu 


Gly 


Phe 




450 







Asp 


Gin 


Arg 


xnr 


5 








Thr 


Pro 


Gly 


Glu 


Pro 


ber 


ax a 


T 1 A 










Arg 


Ser 


Pro 


Pro 






55 




Leu 


Lys 


Lys 


Asn 




70 






Thr 


Leu 


Arg 


Leu 


85 








Arg 


Asn 


Lys 


Lys 


Thr 


Leu 


Asn 


Pro 








120 


Pro 


Ser 


Val 


Ser 






135 




Asp 


Thr 


Gin 


Tyr 




150 






Ser 


Thr 


Val 


Asn 


165 








Arg 


Ser 


His 


Val 


Val 


Thr 


Glu 


Thr 








200 


Trp 


Val 


Lys 


Leu 






215 




Asp 


Thr 


Asn 


He 




230 






Gin 


Ala 


Ala 


Lys 


245 








Asn 


Tyr 


Glu 


Thr 


Ser 


Lys 


Phe 


Leu 








280 


Trp 


Trp 


Pro 


Val 






295 




Ser 


Pro 


Leu 


Leu 




310 






Trp 


lie 


Lys 


Thr 


325 








Leu 


Ser 


Pro 


Gly 


Glu 


Val 


Thr 


Ala 








360 


Pro 


Val 


Pro 


Ala 






375 




He 


Gly 


Pro 


Ser 




390 






Met 


Leu 


Glu 


Thr 


405 








Tyr 


Cys 


Glu 


Phe 


Arg 


Met 


Lys 


Gly 








440 


Cys 


Val 


Glu 


Met 






455 





Lys 


Leu 


Thr 


Pro 




10 






Gly 


GlU 


Gly Gly 


25 








Aid 


vai 


Lys 


Ser 


Ser 


Asp 


Ser 


Pro 








60 


Asn 


Gly 


Glu 


Val 






75 




Arg 


Glu 


Asp 


Tyr 




90 






Lys 


Arg 


Pro 


He 


105 








Thr 


Tyr 


Ala 


Thr 


Ala 


Ala 


Ser 


Val 








140 


Arg 


Lys 


Arg 


Phe 






155 




Leu 


Lys 


Ala 


Arg 




170 






Lys 


Gly 


Pro 


Lys 


185 








Asn 


Val 


Val 


Lys 


Glu 


Lys 


Pro 


Pro 








220 


Ala 


Ala 


Gly 


Leu 






235 




Leu 


Gly 


Leu 


He 




250 






He 


Lys 


Gin 


Ser 


265 








Trp 


Ser 


Asn 


Asn 


Gly 


Phe 


Gly 


Ser 








300 


Asn 


Trp 


Asn 


Arg 






315 




Met 


Ser 


lie 


Asp 




330 






Ala 


Cys 


Glu 


Ser 


345 








Gin 


Cys 


Lys 


Lys 


His 


Val 


Gin 


Pro 








380 


Glu 


Leu 


Tyr 


Leu 






395 




Gly 


Lys 


Val 


He 




410 






Val 


Phe 


Glu 


Thr 


425 








Ala 


Val 


Gly 


Val 


Asp 


Phe 


Ala 


Gly 








460 



Ala Thr Val Met 
15 

Glu Gly Gly Gly 
30 

Cys Cys Ser Lys 
45 

Tyr Ser Leu Arg 

Gly Gly Lys Ala 
80 

Glu Ser Thr Pro 
95 

Thr He Asp Glu 
110 

Asp He He Lys 
125 

Leu Arg Lys His 

Ser His Pro Asn 
160 

Asp Tyr Thr Pro 
175 

His Leu Lys Ser 
190 

Arg Asn Phe Ser 
205 

Cys Tyr Phe Ala 

Glu Ser Pro Phe 
240 

Ser Asp Val Gin 
255 

Cys lie Asp Ala 
270 

Arg Thr Lys Gin 
285 

Lys Asn Leu Ser 

Leu Cys Lys Asn 
320 

His Met Ala Lys 
335 

He Lys Lys Thr 
350 

Trp Glu Ser Tyr 
365 

Glu Tyr Ala Ser 

Glu Val Lys Val 
400 

Lys Phe Met Thr 
415 

Val Phe Ser His 
430 

Arg Lys Met Cys 
445 

He Ser Val He 
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Asp 


val 


Leu 


Asn 




Asp 


Leu 


T v/c 












H f \J 






bin 


uin 




Asn 


Pro 




X Hi 


J.IIX 










485 








Via 4- 


bin 


Asp 


nis 


uiy 


Co v 


Leu 


Cys 


















vai 


Arg 


Met 


Leu 


(j_Ln 


Ala 


inr 


t*iy 






ji »j 










j ^ u 


Lys 


Cys 


Lys 


Lys 


Pro 


Tin 

ixe 


inr 


Asp 


















r*i i# 
oiy 


Lys 




Arg 




Arg 




Leu 












CCA 






tjlU 


lie 


T 7 - -I 

va j. 


Asn 


lie 


val 


inr 


Arg 










JOJ 








rxO 


Asp 


T 1 a 

1 le 


Lys 


Ser 


Asp 


Asn 


1 ie 








C Q O 










fro 


i*ys 


Ufa 4- 


lie 


Asp 


rfle 


Liiy 


Leu 


















Asn 


Fne 


Lys 


Cys 


Trp 


CjJLy 


1 nr 


Asp 




oiu 










D J. O 




HIS 


inr 


Ala 


Pro 


c?iu 


Fne 


lie 


Asn 












D J w 






Met 


Thr 


Phe 


Gly 


Leu 


Ala 


Tyr 


Leu 










645 








lie 


Lys 


Arg 


Thr 


Ala 


Asp 


Leu 


Ser 








660 










Pro 


Phe 


Leu 


Ser 


He 


Val 


Ser 


Lys 






675 










680 


Arg 


Pro 


Arg 


Ala 


Tyr 


Glu 


He 


Ala 




690 










695 




Phe 


Lys 


Asp 


Asn 


He 


Ala 


Lys 


Leu 


705 










710 






Lys 


Lys 


Val 


Lys 













T we 

iiys 


Mct 

475 


Asp 


V3-LU 


Asn 


V dX 


VdJ. 

480 


Car 


oer 

4 QO 

"i -7 V 


Lys 


Jrro 


nld 


Ml a. 


UlU 


Leu 


Arg 




Arg 


Asp 


TH r~ 
J. 1 1.L 


Leu 


Tyr 


biy 


505 










510 






Arg 


Leu 


Pro 


r*i ti 
ulU 


biy 

525 


Leu 


bin 


Oft — 




T 1 0 

1 ie 


Ser 


Ala 

540 


lie 


Aia 


1 ie 


vai 


Asn 


oin 


Leu 

555 


Fro 


Fne 


vai 


Leu 


v ai 
560 


Leu 


Ser 

O / U 


bin 


Cain 


oiy 


T ft*1 

Leu 


vai 

O / D 


Asn 


vai 


1 J.e 


Asp 


b-Ly 


T 1 A 

ixe 


inr 


laiy 


Cain 


585 










590 






Ti- 
ne 


vai 


Tim 

Fro 


Cys 


Lys 
605 


Lys 


Tyr 


Tyr 


blU 


Arg 


Fne 


Fne 
620 


£>er 


Asn 


nlS 


Pro 


Ser 


Glu 


Leu 
635 


Cys 


Ser 


Glu 


Thr 


Ala 
640 


Leu 


He 
650 


Asp 


Met 


Leu 


Ser 


He 
655 


Leu 


Ala 


Asn 


Ser 


He 


Tyr 


Thr 


Asn 


He 


665 










670 






Met 


Tyr 


Asp 


Gin 


Glu 
685 


Lys 


Thr 


Asn 


Pro 


Val 


He 


Gly 
700 


Ala 


Cys 


Phe 


Pro 


Phe 


Gin 


Ser 
715 


Pro 


Lys 


His 


Ser 


Lys 
720 



<210> 276 
<211> 615 
<212> DNA 
<213> SHRIMP 



<400> 276 

atgtctagcg 

cttctagatg 

gccggttccc 

agacggggaa 

tcagtatcag 

tttgctcagt 

gttgagagct 

cgcttgtatt 

tacaacaaga 

aagtaccatg 

tctcaaagga 



gaaaagtaac 
gaggtgcagc 
tccacgataa 
aaacaaacag 
ttgtccaaca 
acaatatggg 
gcaagaagca 
actttaacaa 
taatcttccc 
cttctattcg 
tgtaa 



ctacgaaatc 
aatctgtctg 
cctcttcaag 
caaaaatctg 
cccgacagaa 
taatggaaga 
cgaaagggtc 
gtgtcttcac 
tgccagaatt 
agatttctcc 



gttgaagggg 
cagtctaatt 
atgctaggat 
gccataattg 
ccagaaaggt 
aaatgttact 
cacaaatctt 
gcgatcgcca 
gggtgcgcgg 
acaatcattg 



gattgttgaa 
gtgttgcaag 
ttggcgaccc 
aagatagacc 
tttgctccat 
tccctaacga 
ccacagaaat 
aatcacctgc 
cagctggagg 
ataaggaagt 



caacaagtac 60 
aaaacgtcac 120 
ctataaacag 180 
tcaactcggg 240 
gacattctta 300 
caaagagtat 360 
gaaaagattg 420 
aatgaagaag 4 80 
agattgggag 540 
gataatagtg 600 
615 



<210> 277 
<211> 204 
<212> PRT 
<213> SHRIMP 



<400> 277 

Met Ser Ser Gly Lys Val Thr Tyr Glu He Val Glu Gly Gly Leu Leu 
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1 

Asn Asn Lys Tyr 
20 

Asn Cys Val Ala 
35 

Phe Lys Met Leu 
50 

Thr Asn Ser Lys 
65 

Ser Val Ser Val 

Met Thr Phe Leu 
100 

Tyr Phe Pro Asn 
115 

Arg Val His Lys 
130 

Phe Asn Lys Cys 
145 

Tyr Asn Lys lie 

Gly Asp Trp Glu 
180 

lie Asp Lys Glu 
195 



5 

Leu Leu Asp Gly 

Arg Lys Arg His 
40 

Gly Phe Gly Asp 
55 

Asn Leu Ala lie 
70 

Val Gin His Pro 
85 

Phe Ala Gin Tyr 

Asp Lys Glu Tyr 
120 

Ser Ser Thr Glu 
135 

Leu His Ala lie 
150 

lie Phe Pro Ala 
165 

Lys Tyr His Ala 

Val lie lie Val 
200 



10 

Gly Ala Ala He 
25 

Ala Gly Ser Leu 



Pro 


Tyr 


Lys 


oJ.il 








DU 


Tip 




Asp 


A ITT 






75 




Thr 


Glu 


Pro 


Glu 




90 






Asn 


Met 


Gly 


Asn 


105 








Val 


Glu 


Ser 


Cys 


Met 


Lys 


Arg 


Leu 








140 


Ala 


Lys 


Ser 


Pro 






155 




Arg 


He 


Gly 


Cys 




170 






Ser 


He 


Arg 


Asp 


185 








Ser 


Gin 


Arg 


Met 



15 

Cys Leu Gin Ser 
30 

His Asp Asn Leu 
45 

Arg Arg Gly Lys 

Pro Gin Leu Gly 
80 

Arg Phe Cys Ser 
95 

Gly Arg Lys Cys 
110 

Lys Lys His Glu 
125 

Arg Leu Tyr Tyr 

Ala Met Lys Lys 
160 

Ala Ala Ala Gly 
175 

Phe Ser Thr He 
190 



<210> 278 
<211> 828 
<212> DNA 
<213> SHRIMP 



<400> 278 

atgtcttcca 

agatggacaa 

ccccaacaac 

cctccattcc 

gaccagtgca 

agggaaagag 

agatcttccc 

aaccaacaca 

cttgtgaatt 

gctcgcgccc 

gttgtcatcg 

ccagatattc 

gattcttcta 

gtttattctg 



accgattcag 
ctgtcaagaa 
aacaacaaca 
ccatcattag 
gaaattgcga 
ttcgctcatt 
ccttccttaa 
tgtaccccct 
ggcgagaccc 
ctacacccac 
aggaagaaac 
ccgcaataac 
cagtatttct 
cattatttgc 



tcagctgagg 
caggaggaac 
tcaaaaaaga 
ttggggagcc 
tgaaactggc 
aaacaacact 
taagaagaag 
caacaagtac 
ttcacaagaa 
tccccaagaa 
tgcaccagaa 
tgcaactact 
cagaaatgtt 
aaaatgtatt 



ggcaacgagg 
agacagcaac 
acatcaacca 
ctcggcagct 
tattacaatt 
ccaagtgaag 
gacgttgacg 
agtttccgtg 
aaacaggaca 
aaggaaccag 
ccagaaccag 
actactacta 
attctgagta 
agatctaaga 



agatggttgg 
agtattccca 
attctcctcc 
actcaatgta 
tccactctta 
gcatgtggcg 
aagctccacc 
aatatactcc 
agatcttaca 
aagtagaaac 
aaccagcccc 
cagttgcaac 
tcgtgttttg 
aggaataa 



<210> 279 
<211> 275 
<212> PRT 
<213> SHRIMP 



ggactattca 60 
tagtttccgt 120 
tgctccacct 180 
tcgactggat 24 0 
tgatagaaag 300 
gcgcacaagt 360 
tcctcaatca 420 
ttcatcaaag 480 
agaggaagaa 540 
taaagatgat 600 
agttccagac 660 
acgtcacgac 720 
gtttctgggt 780 
828 



<400> 279 

Met Ser Ser Asn Arg Phe Ser Gin Leu Arg Gly Asn Glu Glu Met Val 

1 5 10 15 

Gly Asp Tyr Ser Arg Trp Thr Thr Val Lys Asn Arg Arg Asn Arg Gin 

20 25 30 

Gin Gin Tyr Ser His Ser Phe Arg Pro Gin Gin Gin Gin Gin His Gin 

35 40 45 

Lys Arg Thr Ser Thr Asn Ser Pro Pro Ala Pro Pro Pro Pro Phe Pro 
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50 55 60 



He 


He 


Ser 


Tro 


Gly 


Ala 


Leu 


Glv 


Ser 


Tvr 


Ser 


Met 


Tvr 


Aro 


Leu 


Asd 


65 










70 










75 










80 


Asp 


Gin 


Cys 


Arg 


Asn 


Cys 


Asp 


Glu 


Thr 


Glv 


i yj. 


x yx 


Asn 


Phe 


His 


Ser 










85 










90 










95 




Twr 

y 


Asp 


Arg 


Ly s 


Arg 


Glu 


Arg 


Val 


Arg 


Ser 


Leu 


Asn 


Asn 


Thr 


Pro 


Ser 








100 










1 OS 

-L \J *J 










110 






Glu 


Gly 


Met 


i J- p 


Arg Arg 


Thr 


Ser 


Arg 


OCL 


Ser 


Pro 


Phe 


Leu 


Asn 


Lys 






115 










120 










125 








Lys 


Ly s 


Asp 


Val 


Asp 


Glu 


Ala 


Pro 


Pro 


Pro 


Gin 


•JC-L 


Asn 


Gin 


His 


Met 




130 










-L -J -J 




















xyr 


IT i. V 




noil 


Lys 


Tyr 


Cor 


IT i lt5 


Arg 




Tyr 


1111. 


Pro 




OCX 


Lys 


It J 










150 










1 ss 

j. j j 










1 fiO 


Leu 


Val 


Asn 


Trr> 


Arg 


Asp 


Pro 


Ser 


Gin 


Glu 


Lys 


Gin 


Aso 


Lvs 


He 


Leu 










165 










170 










175 




Gin 


Glu 


Glu 


Glu 


Ala 


Arg 


Ala 


Pro 


Thr 


Pro 


Thr 


Pro 


Gin 


Glu 


Lys 


Glu 








180 










185 










190 






Pro 


Glu 


Val 


Glu 


Thr 


Lys 


Asp 


Asp 


Val 


Val 


He 


Glu 


Glu 


Glu 


Thr 


Ala 






195 










200 










205 








Pro 


Glu 


Pro 


Glu 


Pro 


Glu 


Pro 


Ala 


Pro 


Val 


Pro 


Asp 


Pro 


Asp 


He 


Pro 




210 










215 










220 










Ala 


He 


Thr 


Ala 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Val 


Ala 


Thr 


Arg 


His 


Asp 


225 










230 










235 










240 


Asp 


Ser 


Ser 


Thr 


Val 


Phe 


Leu 


Arg 


Asn 


Val 


He 


Leu 


Ser 


He 


Val 


Phe 










245 










250 










255 




Trp 


Phe 


Leu 


Gly 


Val 


Tyr 


Ser 


Ala 


Leu 


Phe 


Ala 


Lys 


Cys 


He 


Arg 


Ser 



260 265 270 



Lys Lys Glu 
275 



<210> 280 
<211> 2025 
<212> DNA 
<213> SHRIMP 



<400> 280 

atggcaggga 

gaacaaggca 

aacttaaaat 

gcgtctacgc 

gacatcaacc 

tctatacgga 

gaaccaatct 

gcaaatgcaa 

atagtgacag 

gctgtaaact 

gggaatgtag 

tcagaaatat 

acccctgaca 

gttcttatag 

ctgttcatat 

aaca tttctt 

aacctgtaca 

tcaaaaaagg 

gttgtggaga 

tcatggcctg 

ttaatgagac 

aactccttga 

aagattttgc 

tcagtcaaag 



atagaaccca 
tggagtgttg 
tgggcgattc 
ccctccttgg 
taatggctca 
gagttgtagt 
cacgccctca 
taagggacaa 
cggccttggg 
acatgtacga 
tctatctttc 
tagataaggt 
attcgttccc 
atctcatcac 
gtaaccattt 
cttcatccga 
atagactgga 
agtccttttc 
gttttcctcc 
ttattaataa 
tagcgataag 
caacagcagt 
tcgggcgagc 
agacgggaga 



gttcgtatcg 
tggcagacaa 
tcttaaagaa 
ggaattaaac 
tttctcctac 
ggctaatcat 
cccgctggga 
gttcatcagc 
ggctattgcc 
tgtagaattt 
caccaaaatg 
atcaaaacga 
ttaccaattg 
caatgcgatc 
aagggcagtt 
.taattatttt 
atggttcatg 
aggagctgac 
cgttctcttg 
cagcaataat 
gacgagtagc 
gaccaaccgt 
tctggacgaa 
agaaaaggat 



tccctgattg 
gcacaggatg 
actgatgtta 
tacgacaaac 
gctgctttgg 
caacgtagaa 
ggggtagaac 
atgggggcgt 
agtgaacgtg 
gcagaaagag 
gacgaagatg 
cccgcaaagg 
atttggggcg 
gtgcctaata 
attaggagta 
gaggatggac 
ttagtagtta 
aatgttaacg 
gacactgaat 
aatagtacac 
ggtgcccgac 
attaccttcc 
gaagaagctg 
aagaacaata 



ctaaatgtat 
cattaatgac 
atttggaata 
aacaatatgc 
gtatagaaag 
ataatggaaa 
ctcctctatc 
tggacagatt 
aactattctt 
atgctgctac 
aagatgacat 
aaggtataga 
atgattctgt 
tttttatggc 
tgagggaaat 
gtaaatggtg 
gatttgtaat 
tgaaaagact 
gggtcaagac 
tccctgtgac 
atcctatttt 
agtctgcaga 
gaacaaaaat 
cgttctcttc 



atcagacgtg 
tcgcctagcc 
cttgagatac 
ggcaacagtt 
tatactgaat 
aaaaccttct 
gtcagagttg 
gaattcagca 
acgtgaaaat 
tacagataca 
aataaagcgt 
ctggcgcccc 
agatgatact 
aaaatttatc 
tttatacggt 
cttctggttg 
tt tcctccac 
tctggtggta 
taatataacg 
agaagacacc 
cgacgaaatt 
attctgcaca 
gctagtaaaa 
atttggttta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 
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ttactgaaga acacaaaaaa tgaagaattg gaaataaaca taggcgataa cgatgatgag 1500 

actacagatg tggcttgttg ggcacgtact tcctcgacat cctttatccg taataggaca 1560 

tatgcgttta aaaaaatatg gggccttgag gatgcaagtg atgtagtcga gctgaagcga 1620* 

gagagtgacg ccattacatc ctttgtcacc gataagagca gtcctctcct atttccgtat 1680 

gtgtccgact ggagttgctt actattacat ccctgttgta aagcaccggc cataattaaa 174 0 

agtgtgtggt tacaaatcct gaaagatttt tcccaggaaa atataaaaac tataaatgaa 1800 

aaggtacaat ctctttcatc tgagatttgt cagaaatcaa acgaccgttt taaaaataaa 1860 

aaaattgctg ccgaacacgt tcgcagtgta aaaaagttat taaatacgat aagcaacagg 1920 

gagcaagaag cagcactatc tacagaacac tgtatttggt taacgatttt gtggaaacaa 1980 

gtcgttcaga acactctcaa ccttctggag aattttcccg tataa 2025 

<210> 281 
<211> 672 
<212> PRT 
<213> SHRIMP 



<400> 281 



Met 


Ala 


Gly 


Asn 


Arg 


Thr 


Gin 


Phe 


Val 


Ser 


Ser 


Leu 


He 


Ala 


Lys 


Cys 


1 








5 










10 










15 




lie 


Ser 


Asp 


Val 


Glu 


Gin 


Gly 


Met 


Glu 


Cys 


Cys 


Gly 


Arg 


Gin 


Ala 


Gin 








20 










25 










30 






Asp 


Ala 


Leu 


Met 


Thr 


Arg 


Leu 


Ala 


Asn 


Leu 


Lys 


Leu 


Gly 


Asp 


Ser 


Leu 






35 










40 










45 








Lys 


Glu 


Thr 


Asp 


Val 


Asn 


Leu 


Glu 


Tyr 


Leu 


Arg 


Tyr 


Ala 


Ser 


Thr 


Pro 




50 










55 










60 










Leu 


Leu 


Gly 


Glu 


Leu 


Asn 


Tyr 


Asp 


Lys 


Gin 


Gin 


Tyr 


Ala 


Ala 


Thr 


Val 


65 










70 










75 










80 


Asp 


He 


Asn 


Leu 


Met 


Ala 


His 


Phe 


Ser 


Tyr 


Ala 


Ala 


Leu 


Gly 


He 


Glu 










85 










90 










95 




Ser 


He 


Leu 


Asn 


Ser 


He 


Arg 


Arg 


Val 


Val 


Val 


Ala 


Asn 


His 


Gin 


Arg 








100 










105 










110 






Arg 


Asn 


Asn 


Gly 


Lys 


Lys 


Pro 


Ser 


Glu 


Pro 


He 


Ser 


Arg 


Pro 


His 


Pro 






115 










120 










125 








Leu 


Gly 


Gly 


Val 


Glu 


Pro 


Pro 


Leu 


Ser 


Ser 


Glu 


Leu 


Ala 


Asn 


Ala 


He 




130 










135 










140 










Arg 


Asp 


Lys 


Phe 


He 


Ser 


Met 


Gly 


Ala 


Leu 


Asp 


Arg 


Leu 


Asn 


Ser 


Ala 


145 










150 










155 










160 


lie 


Val 


Thr 


Ala 


Ala 


Leu 


Gly 


Ala 


He 


Ala 


Ser 


Glu 


Leu 


Phe 


Leu 


Arg 










165 










170 










175 




Glu 


Asn 


Ala 


Val 


Asn 


Tyr 


Met 


Tyr 


Asp 


Val 


Glu 


Phe 


Ala 


Glu 


Arg 


Asp 








180 










185 










190 






Ala 


Ala 


Thr 


Thr 


Asp 


Thr 


Gly 


Asn 


Val 


Val 


Tyr 


Leu 


Ser 


Thr 


Lys 


Met 






195 










200 










205 








Asp 


Glu 


Asp 


Glu 


Asp 


Asp 


He 


He 


Lys 


Arg 


Ser 


Glu 


He 


Leu 


Asp 


Lys 




210 










215 










220 










Val 


Ser 


Lys 


Arg 


Pro 


Ala 


Lys 


Glu 


Gly 


He 


Asp 


Trp 


Arg 


Pro 


Thr 


Pro 


225 










230 










235 










240 


Asp 


Asn 


Ser 


Phe 


Pro 


Tyr 


Gin 


Leu 


He 


Trp 


Gly 


Asp 


Asp 


Ser 


Val 


Asp 










245 










250 










255 




Asp 


Thr 


Val 


Leu 


He 


Asp 


Leu 


He 


Thr 


Asn 


Ala 


He 


Val 


Pro 


Asn 


He 








260 










265 










270 






Phe 


Met 


Ala 


Lys 


Phe 


He 


Leu 


Phe 


He 


Cys 


Asn 


His 


Leu 


Arg 


Ala 


Val 






275 










280 










285 








He 


Arg 


Ser 


Met 


Arg 


Glu 


He 


Leu 


Tyr 


Gly 


Asn 


He 


Ser 


Ser 


Ser 


Ser 




290 










295 










300 










Asp 


Asn 


Tyr 


Phe 


Glu 


Asp 


Gly 


Arg 


Lys 


Trp 


Cys 


Phe 


Trp 


Leu 


Asn 


Leu 


305 










310 










315 










320 


Tyr 


Asn 


Arg 


Leu 


Glu 


Trp 


Phe 


Met 


Leu 


Val 


Val 


Arg 


Phe 


Val 


He 


Phe 










325 










330 










335 




Leu 


His 


Ser 


Lys 


Lys 


Glu 


Ser 


Phe 


Ser 


Gly 


Ala 


Asp 


Asn 


Val 


Asn 


Val 








340 










345 










350 
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Lys 


Arg 


Leu 


Leu 


Val 


Val 


Val 


Val 


Glu 


Ser 


Phe 


Pro 


Pro 


Val 


Leu 


Leu 






355 










360 










365 








Asp 


Thr 


Glu 


Trp 


Val 


Lys 


Thr 


Asn 


He 


Thr 


Ser 


Trp 


Pro 


Val 


He 


Asn 




370 










375 










380 










Asn 


Ser 


Asn 


Asn 


Asn 


Ser 


Thr 


Leu 


Pro 


Val 


Thr 


Glu 


Asp 


Thr 


Leu 


Met 


385 










390 










395 










400 


Arg 


Leu 


Ala 


He 


Arg 


Thr 


Ser 


Ser 


Gly 


Ala 


Arg 


His 


Pro 


He 


Phe 


Asp 










405 










410 










415 




Glu 


He 


Asn 


Ser 


Leu 


Thr 


Thr 


Ala 


Val 


Thr 


Asn 


Arg 


He 


Thr 


Phe 


Gin 








420 










425 










430 






Ser 


Ala 


Glu 


Phe 


Cys 


Thr 


Lys 


He 


Leu 


Leu 


Gly Arg 


Ala 


Leu 


Asp 


Glu 






435 










440 










445 








Glu 


Glu 


Ala 


Gly 


Thr 


Lys 


Met 


Leu 


Val 


Lys 


Ser 


Val 


Lys 


Glu 


Thr 


Gly 




450 










455 










460 










Glu 


Glu 


Lys 


Asp 


Lys 


Asn 


Asn 


Thr 


Phe 


Ser 


Ser 


Phe 


Gly 


Leu 


Leu 


Leu 


4 65 










470 










475 










480 


Lys 


Asn 


Thr 


Lys 


Asn 


Glu 


Glu 


Leu 


Glu 


He 


Asn 


He 


Gly 


Asp 


Asn 


Asp 










485 










490 










495 




Asp 


Glu 


Thr 


Thr 


Asp 


Val 


Ala 


Cys 


Trp 


Ala 


Arg 


Thr 


Ser 


Ser 


Thr 


Ser 








500 










505 










510 






Phe 


He 


Arg 


Asn 


Arg 


Thr 


Tyr 


Ala 


Phe 


Lys 


Lys 


He 


Trp 


Gly 


Leu 


Glu 






515 










520 










525 








Asp 


Ala 


Ser 


Asp 


Val 


Val 


Glu 


Leu 


Lys 


Arg 


Glu 


Ser 


Asp 


Ala 


He 


Thr 




530 










535 










540 










Ser 


Phe 


Val 


Thr 


Asp 


Lys 


Ser 


Ser 


Pro 


Leu 


Leu 


Phe 


Pro 


Tyr 


Val 


Ser 


545 










550 










555 










560 


Asp 


Trp 


Ser 


Cys 


Leu 


Leu 


Leu 


His 


Pro 


Cys 


Cys 


Lys 


Ala 


Pro 


Ala 


He 










565 










570 










575 




lie 


Lys 


Ser 


Val 


Trp 


Leu 


Gin 


He 


Leu 


Lys 


Asp 


Phe 


Ser 


Gin 


Glu 


Asn 








580 










585 










590 






lie 


Lys 


Thr 


He 


Asn 


Glu 


Lys 


Val 


Gin 


Ser 


Leu 


Ser 


Ser 


Glu 


He 


Cys 






595 










600 










605 








Gin 


Lys 


Ser 


Asn 


Asp 


Arg 


Phe 


Lys 


Asn 


Lys 


Lys 


lie 


Ala 


Ala 


Glu 


His 




610 










615 










620 










Val 


Arg 


Ser 


Val 


Lys 


Lys 


Leu 


Leu 


Asn 


Thr 


He 


Ser 


Asn 


Arg 


Glu 


Gin 


625 










630 










635 










640 


Glu 


Ala 


Ala 


Leu 


Ser 


Thr 


Glu 


His 


Cys 


He 


Trp 


Leu 


Thr 


He 


Leu 


Trp 










64 5 










650 










655 




Lys 


Gin 


Val 


Val 


Gin 


Asn 


Thr 


Leu 


Asn 


Leu 


Leu 


Glu 


Asn 


Phe 


Pro 


Val 








660 










665 










670 







<210> 282 
<211> 2535 
<212> DNA 
<213> SHRIMP 



<400> 282 

atgttcactc 

attgacgttc 

aatccggaat 

acaatagttg 

atcaagaagg 

ctctgggtac 

gagattgtga 

aactatcaaa 

attttaatga 

aacaacacca 

atgctccaca 

gctagtagtt 

gcaatccaag 



acttgacccg 
accgagtggt 
catccatata 
gtgattacat 
tcatcgaaca 
accttctgga 
atgccaagta 
ctgggaagag 
gatgtggatc 
actttcttca 
ttgctgaaat 
tccctgatgc 
aagataa tgc 



agccttccgc 
ggccgagtta 
tagaactccg 
cctctctccg 
cttcccagta 
cgaacgccat 
caggggtatt 
tctactgatg 
a tacaaattc 
ctgctgtgca 
gttctttgcc 
tgcagcagag 
tgatgcactc 



aagatgaaca 
tcctatccag 
atatccctct 
aagacggact 
atcttccact 
catcgcctgc 
gttgtcatcc 
agcaaactgg 
atctcattaa 
agtaaatggg 
aaccctacta 
gacgacaagg 
ctgttcctga 



acctagtcaa 
aattcgaaga 
tccaaaacaa 
cattccaagt 
gcacccacaa 
tccagagcct 
catactacag 
cgtccgtaaa 
tgtgcatgat 
gagaagttgg 
ctagccaaca 
ggaaaacacc 
tctccctcta 



tcgcagcttc 
ggatgtaaag 
ggatattgtt 
tctataccca 
taatgcccct 
gctgacgtac 
gcgccccatc 
agttctggac 
caacaagaag 
aagcaagatg 
cctatccgac 
tgcccatcta 
cggcgcaccc 



60 
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180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 
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tggtttcaag ataacaactc gtacatgaaa tctgcccttg aactcaagtc taacaagtgt 840 

gtcaaggtac tatcctttgc agctgacaag tacgagattt tacccaacat taacaacaat 900 

caactagaac cagataccat gtgtggagtg tgtgcaacat ctgtggaaga agatgaaaat 960 

gaagggaaaa caacaagtct ttcctggtac cagatgaatt gcaagcatta catccattgc 1020 

gaatgcctga tgggaatgtg tgctgctgct ggcaatgtac aatgccccat gtgccgtgag 1080 

gatgtgggcg acgaagtact ggaaagatgc cctcctacaa tatttagatg gttaaaactg 1140 

gctgagagat ctgaacacaa tcgtgtactt tttgaagcaa aaaagcaaga attctataag 1200 

cagatggaag caatgaaacc tcccagagtt gttgttcctc ctcgcaggac atttctcacc 1260 

ccagccagaa gaggcgaacg agccatcaga atcgcaagag aaattgccac caacgccatc 1320 

gctgaagcca cagctcaagg agatgtcaac tcctacttcc ctgttctcat tgacgggagc 1380 

ggagaagaat atgaagaaga gggagaagaa ttcttcaatt ctgaagagga ggcgcttgct 144 0 

tttggaagac catttctgga agatgaggaa gaagccagac aaatacagat gcgccagttt 1500 

gctgaactgt ctagacgagg cgtttctgtc aatattatta acaatgataa tcctcatcga 1560 

cacatctcta cagtaaatat tgtgcaacca gtttatggag ttgaaaagtc acctgctgct 1620 

tccttcatct acaacatgct caagaatgac gtctttgagt ctatacgctc aagagatact 1680 

cgagttggag gagaaagagt gcccgtcatg aacctgtcca atgacaagag ggcattattc 1740 

cacgcagctt cttccatgct ttgtgacttt gccacagaaa caaactctca aattgttgga 1800 

ttggactttc aagcagtgta tgatccccat cacatatcca actatatcga gacgtttggt 1860 

agtcctcttc acgcctaccc aggagccgtc acttttctgg acggggccca ggactattat 1920 

gcagagagta tcagatacga caatgatatt gtctcattct cagaaatggc aagtgagttg 1980 

cacatcaccg aagcattaga tgtctttgag ggtagtttat tatccccact gttcaagaaa 2040 

atcaggactg gaaaatctta ctctaactgg aacgaccatt tgaggcgtag aaattatgct 2100 

cgagatattg ctgaggaatt tgttagggta tgtgaaaact ctctagcttc acgcgaacac 2160 

ccccctgttc atgtacatcc ctttagagat ggagcaatcc ccattctcat tgaatatata 2220 

gtagatttca tccaccactg catcacctgg tctatgcaag ttaatgcact ccattgtatg 2280 

agaaagtaca ttgaacacga gaatacaaat gtgcacctgt taaacttgcg tcctactgat 234 0 

gaaagggtgg aagttttaag ggtatctcaa ctcagatgga gccgcttgtt caatgaacaa 2400 

tacaacacta gaatgtccct cagcaccaaa agattgagcc tcatgaagat cttcaaccat 2460 

gatttgggtg tgtctaaatt tggtgtatac aaactcctag atattattga aatgtactgt 2520 
tttactttaa tctaa 2535 

<210> 283 
<211> 838 
<212> PRT 
<213> SHRIMP 

<400> 283 



Met 


Phe 


Thr 


His 


Leu 


Thr 


Arg 


Ala 


Phe 


Arg 


Lys 


Met 


Asn 


Asn 


Leu 


Val 


1 








5 










10 










15 




Asn 


Arg 


Ser 


Phe 
20 


He 


Asp 


Val 


His 


Arg 

25 


Val 


Val 


Ala 


Glu 


Leu 
30 


Ser 


Tyr 


Pro 


Glu 


Phe 
35 


Glu 


Glu 


Asp 


Val 


Lys 
40 


Asn 


Pro 


Glu 


Ser 


Ser 
45 


He 


Tyr 


Arg 


Thr 


Pro 
50 


He 


Ser 


Leu 


Phe 


Gin 
55 


Asn 


Lys 


Asp 


He 


Val 
60 


Thr 


He 


Val 


Gly 


Asp 


Tyr 


He 


Leu 


Ser 


Pro 


Lys 


Thr 


Asp 


Ser 


Phe 


Gin 


Val 


Leu 


Tyr 


Pro 


65 










70 










75 










80 


lie 


Lys 


Lys 


Val 


He 
85 


Glu 


His 


Phe 


Pro 


Val 
90 


He 


Phe 


His 


Cys 


Thr 
95 


His 


Asn 


Asn 


Ala 


Pro 
100 


Leu 


Trp 


Val 


His 


Leu 
105 


Leu 


Asp 


Glu 


Arg 


His 
110 


His 


Arg 


Leu 


Leu 


Gin 
115 


Ser 


Leu 


Leu 


Thr 


Tyr 
120 


Glu 


He 


Val 


Asn 


Ala 
125 


Lys 


Tyr 


Arg 


Gly 


He 
130 


Val 


Val 


He 


Pro 


Tyr 
135 


Tyr 


Arg 
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Cys 
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680 

Tyr Ala Arg Asp 
695 

Leu Ala Ser Arg 
710 

Gly Ala He Pro 
725 

Cys He Thr Trp 

Tyr He Glu His 
760 

Thr Asp Glu Arg 
775 

Arg Leu Phe Asn 
790 

Arg Leu Ser Leu 
805 

Phe Gly Val Tyr 
Leu He 



He Ala Glu Glu 
700 

Glu His Pro Pro 
715 

He Leu He Glu 
730 

Ser Met Gin Val 
745 

Glu Asn Thr Asn 

Val Glu Val Leu 
780 

Glu Gin Tyr Asn 
795 

Met Lys He Phe 
810 

Lys Leu Leu Asp 
825 



685 

Phe Val Arg Val 

Val His Val His 
720 

Tyr He Val Asp 
735 

Asn Ala Leu His 
750 

Val His Leu Leu 
765 

Arg Val Ser Gin 

Thr Arg Met Ser 
800 

Asn His Asp Leu 
815 

He He Glu Met 

830 - 
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<400> 284 

atggcagcag 

attgecgata 

gcagctaatg 

tggtcccaag 

gaggcattta 

gcaactggac 

tcaaatacaa 

agtgctgaaa 

cccgtgtact 

gcatccaaga 

ctgattgttg 

gaactagaac 

gatgaaatgt 

cttccccatc 

atcaacattg 

tctataccat 

cccaataact 

agtggagatt 

ttctatggcg 

acctgttctt 

acacctgaaa 

ttccattcac 

aacccttttt 

gaagtattgg 

atggacagat 

gatagaataa 

tgtaaaattc 

actgetaegt 

tatttgtgta 

gaagatcaca 

ctcgaaccta 

attaagatgc 



acctcctaga 
ctgaattcct 
gaeggecata 
ataacaggaa 
tagctggttc 
taggataccc 
ctaattacaa 
ttgataagcg 
tgaaactgat 
ggacggccaa 
ccatattaat 
agcaagtatt 
gtgtagtggc 
atttgtacaa 
aaaattcctc 
caagtatggt 
tgatgactgc 
acggatggtt 
accgaaaaca 
attggaacaa 
ataactgtgg 
eggtaacgeg 
ttattagaca 
aaaatgttgg 
ttggtagtct 
agtctctaga 
tagaaaccaa 
ctgatgatcc 
atattctaga 
etaaaacegg 
aacaacattt 
tgagaaatga 



gttggctatc 
caattatctt 
tcttcctaca 
gctgaaactc 
tatagatget 
tctattgaaa 
cttgtacatt 
tccttcatca 
cgacagaaga 
ggaaattgga 
acctgcatac 
ggatgatgea 
catattgagt 
tgcacctttc 
attctttaat 
aataaaacac 
ageagaaegg 
ttctgtaatt 
tctaataaat 
gtatatggat 
ttttegtata 
agtcaacaag 
cacagaacct 
cgtgaccctc 
attacttgga 
aactatttct 
gtataactac 
tataaagatg 
atttgetata 
caatggtggt 
aattgttaaa 
ttgegaaege 



caggaaacaa 
ccccataaaa 
ctagaaatga 
ttggggcatt 
gaaacgtgcg 
aaactagccc 
gatagaaatt 
atacagaaca 
accgagtgtc 
geagecagaa 
acagaaacgt 
tattttaatg 
actttgeaca 
cgtcttcctc 
gaagatacga 
catacgegaa 
tctatatttt 
gtaggatcta 
acagtcaaat 
tgeagatett 
agggctgeaa 
aagaaaacga 
aagtggtaca 
gaacaacacg 
cgagaaaaga 
aacaatctca 
aaatcttctt 
aagattatcg 
atttcatcag 
tgtgcgtttt 
gtgaataaat 
aacaagtgta 



tccagtctga 
ctggcatctg 
ggaacgaagt 
tttgtggcaa 
tgggcttttt 
tgattgcccg 
caatgatgaa 
cttctcacac 
ttgccctgga 
aggtttgttt 
ttgttcttga 
eggagaataa 
atttatttgt 
cctttggaca 
cacctattct 
aaaatagtag 
tacgtggagt 
etataatgee 
ctaataactt 
atgggtttga 
ttgattgetc 
gcattattaa 
acaaaaatgc 
tccgtgttag 
aatggacgtg 
agggtaaaat 
ccttatacta 
cctctataaa 
agaaaaagga 
caaagtataa 
acattgaggc 
ggtttaaaga 



attggaagaa 
cgaagaagct 
tgaccatttc 
cttgtatgtg 
gaggtegcaa 
tgaggataaa 
acaagttttt 
aaaatcttcc 
ttggctggac 
cctccagaac 
tacaggcaac 
agataaagtg 
taggaaaagt 
acaccctatc 
agegtcaatt 
atggagatgt 
tttaacagtc 
tagtgtactt 
ttctgecata 
gattatagac 
gaacacagat 
tgcggtaaag 
catgtgtggt 
tgatgagtat 
taattatcta 
tgacaccatg 
taagcaaata 
caagaggagg 
tgaggtggag 
gaaaaaacaa 
tttttcgtta 
ggctgaaatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 
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agagagtgcg 
cacgatctgg 
aacgctaaca 
aaagattcgc 
ccaataagac 
actgttacct 
gactttttgg 
agactaaaaa 
aattctgata 
agtacagttg 
ttttcaatcg 
tcgagtagtg 
atgaataact 
tgcacagata 
gtaaaaagta 



ccaacgaact 
tattgaagcg 
ctatcatgtc 
gcgatgttaa 
actctatgct 
gttttgaatg 
atcgttacga 
aatgtagtag 
cactttattc 
taaaaaatga 
gtaaaggaga 
acgacgacga 
gcagcagcag 
ttctcaatgt 
tggatgtgtt 



ggtacgagaa 
aactaatgtc 
tttaatacct 
gttgttgaat 
ggagttagta 
gttaaaatgg 
gaaaacgggg 
agatcacact 
ttattctctc 
cactgacgga 
aaaaacaacc 
tgaagaagaa 
catcaagaac 
tgtagaacct 
gaatttgtta 



ctgtataggg 
cacttgacat 
aaatgtaagc 
tttctgagaa 
tacggagagg 
tgctcgaaaa 
gaggaagata 
aaggatataa 
gataaaaatg 
aaaacgtcta 
aaaaaacgaa 
gaggaagaag 
aagagcaaga 
tctctaccta 
tgatgttaa 



cttcggccag 
ggcaacgccc 
tacacacagt 
cgagggatgg 
agtacgctaa 
aaggtgtgat 
aagacgaaag 
aaaaaataga 
tgcaaaccca 
tggtgggctg 
aactggaaac 
atgaaggaaa 
acaagaatgg 
atactttatc 



aagttatgtc 
ttacgatgaa 
attgtacgat 
aaattataac 
agatgtcagt 
taaatatgaa 
ggaattcttt 
aaatgtacta 
cgcatcttct 
ggattatatt 
gatagatata 
aagaatgaaa 
gagaatgtgt 
gttcaattgt 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2799 
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Phe 


Glu 


Trp 


Leu 








740 










lie 


Lys 


Tyr 


Glu 


Asp 


Phe 


Leu 


Asp 






755 










760 


Asp 


Lys 


Asp 


Glu 


Arg 


Glu 


Phe 


Phe 




770 










775 




His 


Thr 


Lys 


Asp 


He 


Lys 


Lys 


He 


785 










790 











315 










320 


Glu 


Arg 


Ser 


He 


Phe 


Leu 


Arg 


Gly 




330 










335 




Gly 


X X£> 


Phe 


Ser 


Val 


He 


Val 


Glv 
oiy 


345 










350 






Phe 


x yi 


Gly 


Asp 


Arg 


Lys 


His 


Leu 










365 








Phe 


Ser 


Ala 


He 


Thr 


Cys 


Ser 


xyi 








380 










Ser 


T \/ T~ 

x yi 


Gly 


Phe 


Glu 


He 


He 


Asp 






395 










4 00 




Tip 

x le 


Arg 


Ala 


rtlci 


Tip 

lie 


ASp 






din 

11U 










4 15 
1 X -J 




v a.1 


Thr 
l til 


Arg 


v ct 1 


Asn 


Lys 


Lys 


Xiyo 


425 










430 






Asn 


Pro 


rile 


XT lie 


Tip 
lie 


Arg 


ni 0 


Thr 

X III 










44 5 

*i t *J 








Ala 


Met 


Cys 


vaiy 


G\ 11 

vlU 


v 01 


X<eu 


ni] 11 

OlU 








4 60 










His 


v di 


Arg 


Vol 




ASp 


Glu 


xyr 






475 










480 


Leu 


wiy 


Arg 


uxu 


Lys 


Lys 


xrp 


TVi r- 

X XI XT 




4 90 










4 95 




Oei 


Leu 


<J 1 U 


x 1 1 X 


Tip 

lie 


Car 

oeir 


Asn 


Asn 


5(i5 










51 n 








Lys 


Tic* 
lie 


Thr 

nil 




Tyr 


Asn 


xyr 










595 








bill 


Tip 

lie 


TV, v- 
X II X 




X 111 


Cor 


Asp 


ASp 








Oft U 










oex 


T1- 

x xe 


Asn 


Lys 


Arg 


Arg 


Tyr 


Leu 






555 












Ser 


f i ii 


Lys 


Lys 


Asp 


uiu 


V ai 


oiU 




^"7 n 














biy 


uiy 


Cys 


Aid 


rile 


Qp V- 

oer 


Lys 


Tyr 


CO c 
JOJ 










5QH 






i?xn 


niS 


Leu 


Tip 

xxe 


Vdi 


Lys 


val 


Asn 


















Tip 

x ±e 


x*ys 


net 


Leu 


Arg 


Asn 


Asp 


Cys 








v? \} 










vjI U 


nia 


Lsl Li 


Tip 

xxe 


Arg 




Cys 


i\i a 
















Di U 


Arg 


flip 
Ala 


Car 


rtl a 


Arg 


oei 


iyr 


val 




650 










655 




Asn 


vol 


Hie: 


Leu 


X £11 


T-r-n 

xrp 


oui 


Arg 


665 










670 






Tip 

lie 


ne l. 


O e X 


Leu 


Tip 

lie 


Prrt 


T 


uya 










685 








Lys 


Asp 


C" c y~ 


Arg 


Asp 


val 


Lys 


L6U 








inn 
/uu 










isxy 


Asn 


Tyr 


Asn 


Pro 


Tip 

x xe 


Arg 


ill s 






715 










720 


Glu 


Glu 


Tyr 


Ala 


Lys 


Asp 


Val 


Ser 




730 










735 




Lys 


Trp 


Cys 


Ser 


Lys 


Lys 


Gly 


Val 


745 










750 






Arg 


Tyr 


Glu 


Lys 


Thr 


Gly 


Glu 


Glu 










765 








Arg 


Leu 


Lys 


Lys 


Cys 


Ser 


Arg 


Asp 








780 










Glu 


Asn 


Val 


Leu 


Asn 


Ser 


Asp 


Thr 






795 










800 
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Lys 


Tyr 


oer 


Leu 


Asp 
805 


Lys 


Asn 


Val 


va ± 


vai 


Lys 


Asn 
820 


Asp 


Thr 


Asp 


Gly 


Tyr 


T 1 Q 

iie 


Fne 
835 


Ser 


He 


Gly 


Lys 


Gly 
840 


Leu 


Glu 
850 


Thr 


He 


Asp 


He 


Ser 
855 


Ser 


Glu 


Glu 


Glu 


Asp 


Glu Gly 


Lys 


Arg 


865 










870 






Ser 


He 


Lys 


Asn 


Lys 
885 


Ser 


Lys 


Asn 


Asp 


He 


Leu 


Asn 
900 


Val 


Val 


Glu 


Pro 


Asn 


Cys 


Val 
915 


Lys 


Ser 


Met 


Asp 


val 
920 



Gin 


Thr 


His 


Ala 


Ser 


Ser 


Ser 


Thr 




810 










815 




Lys 


Thr 


Ser 


Met 


Val 


Gly 


Trp 


Asp 


825 










830 






Glu 


Lys 


Thr 


Thr 


Lys 


Lys 


Arg 


Lys 










845 








Ser 


Asp 


Asp Asp 


Asp 


Glu 


Glu 


Glu 








860 










Met 


Lys 


Met 


Asn 


Asn 


Cys 


Ser 


Ser 






875 










880 


Lys 


Asn 


Gly 


Arg 


Met 


Cys 


Cys 


Thr 




890 










895 




Ser 


Leu 


Pro 


Asn 


Thr 


Leu 


Ser 


Phe 


905 










910 






Leu 


Asn 


Leu 


Leu 











<210> 286 
<211> 635 
<212> DNA 
<213> SHRIMP 



<400> 286 

catttcttcg tctacataaa gaagaataac tggttctctg gggccattag taatgacact 60 
actattaact ccacaaattc tggaggagta ggaggattat gggataatgt ttctgattca 120 
gagtcaggga cgtttccatc tagggtacaa tattccacat caacgaaaaa tttagtcaat 180 
atagccatgg atggagatta tgcgacatta gtaaggaacg gaatgtccac aaatcaaagg 240 
ccatatgaaa cctacaaaga cgtggaggat agccagttct atttacattt tttccacgtt 300 
agaaatttta aacctttaaa cggtgatgaa aataaagacc accttgaaag ggatgaaagt 360 
'tttgtgttga tcgaatcacc atattataat ggagggtttt tatcatacaa tatcaacaac 420 
ccaaatccca tttacaattc tactgaaaag ccgtatatta acacggagat aacttccatc 4 80 
gtcagcacca ctggtacaga tgaaaggttc ttctgcctcg agaaggaata cgtcgaagat 540 
ggtgaagaag gagttacaga aaataggtac tttttacgcc acatggcaag taattatgtt 600 
gtaaaggcta gtttcaaatc agtcatgccc actat 635 

<210> 287 
<211> 431 
<212> DNA 
<213> SHRIMP 



<400> 287 

ctgaagaata ttgaaagaag acttcttgaa gaggaccgat aaaaaaatgg ccaccttcca 60 

gactgacgcc gatttcttgc tggtggggga tgatactagt agatatgaag aagtgatgaa 120 

gacttttgat actgttgagg cagtcaggaa gagtgatcta gatgaccgtg tttacatggt 180 

gtgcctaaag cagggatcta cttttgtcct caatggaggc atcgaagaat tgcgtctttt 24 0 

gactggagat tcaacgctgg agattcaacc catgattgtg ccaacaacag aataaaataa 300 

agacggtgac gggagactaa tatctttctt agtttcccgt cacggtgaaa atgttggtta 360 

tttcttccct atgtttaaaa atttgtcttg gttaaaaaaa taaaacgaaa actgtcaaaa 420 

aaaaaaaaaa a 431 

<210> 288 
<211> 1103 
<212> DNA 
<213> SHRIMP 



<400> 288 

ctattttagg gtactagaaa acgttgtcag 

taattgcgct tatcgctgca tttgttattg 

tggatcacaa tatcaataag gaactaaatt 

tcacaccaga agatatagct cataacaaca 



aaccaatcat gaggtttatc ccgatgttga 60 

cagctcttct tactgccaat tcaaattacc 120 

tgacccgttc ccttcaactt cggggaacat 180 

gaattctccc ttccaagctg agtgttttag 240 



* 4 
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aaagagggtc 
ataattctca 
catcccaatc 
cctgtatgtg 
aatatattaa 
agtattaaga 
cttgcaagtg 
tccttttctt 
tcctacaaca 
tccagggaat 
tcttcttctt 
gtaccttcta 
gaagatatta 
agattttcta 
attctatatt 



aatcatttta 
agagaggagg 
atcttcccaa 
acgagtctcg 
aggaggaaga 
tgatgaaacc 
cgatagggag 
ccatgacgtt 
ataaaaggat 
tgttatctga 
cttcttcaac 
attgtacttc 
accgtttaag 
aaattaatag 
ccaaaaaaaa 



gcagaaatgg 
aacatttcta 
gttgaacttt 
caccgacgag 
agttagtgga 
tggttcaaaa 
tagaaagaaa 
gaatgaagga 
aaaaaacaat 
aactacggtg 
gtcatcatcc 
taaaactgta 
gtataacata 
attaactaca 



acaagtacaa 
ctcctcaaca 
aaattctata 
aaagaagatt 
ggagaaaagg 
atcaaaacaa 
gctgaagaat 
ggacgtctgg 
cgttctaggc 
agggaggaaa 
acttcttcgt 
tgcgattttg 
atacatatat 
tgtatacaag 



aaatgcacaa 
acaacaacag 
agaaaatgct 
ttaacaacga 
ttgtaaatat 
agtacaggct 
atattgaacg 
cttcagcgat 
ctgtgaagct 
tactacggaa 
tttcttcaat 
taaaacacat 
ctgaagaaaa 
gtataagtaa 



cctacagtaa 
accaccccct 
gatttttgtt 
tgaggaagaa 
cgtagtaggg 
ctactccctc 
tctatacaat 
ttattgtcca 
gatccatgcc 
atctcccgct 
ttttttgttt 
acagtatgaa 
ctatgcttca 
gacaataaag 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1103 



<210> 289 
<211> 234 
<212> DNA 
<213> SHRIMP 



<400> 289 

acaacctcca catctccttc taattctaac acacaagaaa aggataaaaa taaaataaat 60 
attgatgcta acaaaattta ttcaagaggt cacaaacatg taccaatatc aattgtaaaa 120 
agcacgttaa acaaagcgtt aaaaatagca ggtatgtctg ctatgagcaa agagcaagtg 180 
ttaacaatgt accaattaat aaaccttagg taggaaagag aaaaaaaaaa aaaa 234 



<210> 290 
<211> 597 
<212> DNA 
<213> SHRIMP 



<400> 290 

aaaatactcc 

tcgtggagga 

gtcacatttg 

gctgcgtcct 

cagttaattg 

ttcatcctta 

caactccacc 

ggcaatcagt 

aagaagaaaa 

ttttttctca 



tcaacttcca 
agggcaagta 
ctatctagga 
cctgctggtg 
tgatgtgagc 
ccattcttct 
agatactaaa 
tactgtgaat 
agcaa tgggt 
atatggcaaa 



acaagaatgg 
aacgactacg 
agaggaggaa 
actgtcatga 
ccacagagct 
tccacaacca 
aaagttgacg 
aatggaggtg 
atcaatacag 
aaataaaata 



gctctgctaa 
ggtctaccaa 
gaattgcatg 
ctttgttgat 
actcgccgcc 
ccactacttc 
acgactatga 
ttttcatcaa 
ataatggagg 
tggtatatta 



cttgaacgac 
caagaaacaa 
tgtcatcgca 
tgtggtactg 
gccgcagccg 
cactactact 
tgacgacgtc 
tggaagaaaa 
atttgtttgg 
aaaaaaaaaa 



cgaactctac 
ccagtgtgct 
tctgttctgg 
ggaactgcac 
ccggtgcagt 
actactacac 
aatattggag 
ctctcaaaag 
aagaatggtg 
aaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

597 



<210> 291 
<211> 335 
<212> DNA 
<213> SHRIMP 



<400> 291 

cgagaacaaa 

caagaccaaa 

gcggaggagc 

ttcctcaaag 

acaacaacta 

tccattgaaa 



agattgttgt 
tcccgccgtg 
ccatcaatga 
agcggaaaga 
tgttatttaa 
cttaaaaaaa 



ctaggaacaa 
gaagcaagaa 
agaagcgtgc 
agtctggagc 
ttgatttttt 
aaaaaaaaaa 



agaaacatta 
gaggtccacc 
aggaaagaag 
ccgcaagtca 
ttcttctgaa 
aaaaa 



aaaatggt tg 
actgctggac 
agctccactg 
aggcgttaat 
taattggaaa 



cccgaagctc 60 
gcatctccaa 120 
tccgtcgccg 180 
tcttccctgt 240 
taataaaaca 300 
335 



<210> 292 
<211> 225 
<212> DNA 
<213> SHRIMP 
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<400> 292 

ggagctctgt catttctggt gcaatctgac attggctcga cccagcggtc caccctccaa 60 

acttgacatc agaccgaccc agcggtccac cctctaaact cgagcgaccc agaaaaattt 120 

ttgaaaagtt tttgagatgg aggaagagta aaattcccta gtgtaaacag taccaagaag 180 

accaataaat ttagtggtaa taaaactaca catatgatta aaaaa 225 

<210> 293 
<211> 107 
<212> DNA 
<213> SHRIMP 



<400> 293 

tcctcctcct tcttctcctc ctgcacataa aaaatcacgt cttccggatg aaggcgaaaa 60 
atgtacactc tgttaatttt ttcaaacaat aaactaacca ccttgta 107 
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(57) Abstract: The present invention is based on the sequencing and assembly of the WSB V genome. The present invention provides 
the complete primary nucleotide sequence of the WSBV genome in a series of genomic and predicted transcript sequences. This 
information is provided in the form of sequences, annotation information, and computer-based systems, and can be used to generate 
antiviral agents and nucleic acid and protein-based viral detection reagents and kits such as nucleic acid arrays. 
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2. | | As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 



3. I I As oniy some of the required additional search fees were timely paid by the applicant, this International Search Report 
' ' covers only those claims for which fees were paid, specifically claims Nos.: 



4. | y I No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

1-39 partially 



Remark on Protest | [ The additional search fees were accompanied by the applicant's protest. 

| | No protest accompanied the payment of additional search fees. 
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1. Claims: 1-39 (partially) 

An isolated polypeptide comprising the sequence of SEQ ID 
N0:3; variants and orthologs thereof; an antibody that binds 
to said polypeptide; a method for producing and a method for 
detecting the polypeptide; a method to identify a modulator 
of the activity of the polypeptide; a method to identify an 
agent that binds to the polypeptide of the invention; a 
method of treating WSBV infection sing a compound identified 
using said method; the nucleic acid molecule of SEQ ID N0:2 
encoding said polypeptide; a gene chip comprising said 
nucleic acid molecule; vector, host cells, transgenic 
animals comprising said nucleic acid molecule; a method for 
detecting the polynucleotide sequence; ; a detection reagent 
comprising said polynucleotide sequence. 



2. Claims: 1-39 (partially) 

Inventions 2-150 are as invention 1 but referring to the 
polypeptide sequence of SEQ ID N0:5, 7, 9... 281 ,283 and 285 
and to the polynucleotide sequence of SEQ ID NO: 4, 6, 8 ... 
280, 282 and 284, 286-293. 
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Continuation of Box 1.2 

Claims Nos.: 3,17,18 (completely) and 25-39 (partially) 



Present claim 3 relates to an antibody defined by reference to a 
desirable characteristic or property, namely, its ability to bind 
selectively to the polypeptide of SEQ ID N0:3. The claims cover all 
antibodies having this characteristic or property, whereas the 
application provides support within the meaning of Article 6 PCT and/or 
disclosure within the meaning of Article 5 PCT for none of such 
antibodies. In the present case, the claims so lack support, and the 
application so lacks disclosure, that a meaningful search over the whole 
of the claimed scope is impossible. Independent of the above reasoning, 
the claims also lack clarity (Article 6 PCT). An attempt is made to 
define the antibody by reference to a result to be achieved. Again, this 
lack of clarity in the present case is such as to render a meaningful 
search over the whole of the claimed scope impossible. Consequently, the 
search has not been carried out for this claim. 

Present claim 17 relates to pharmaceutical compositions comprising an 
agent defined by reference to a desirable characteristic or property, 
namely, their abilty to be identified by an assay in which the 
polypeptide of SEQ ID NO: 3 is contacted with said agent an detecting the 
formation of a complex between the polypeptide and the agent. The claims 
cover all agents having this characteristic or property, whereas the 
application provides neither support within the meaning of Article 6 PCT 
nor disclosure within the meaning of Article 5 PCT for any of said 
agents. In the present case, the claims so lack support, and the 
application so lacks disclosure, that a meaningful search over the whole 
of the claimed scope is impossible. Independent of the above reasoning, 
the claims also lack clarity (Article 6 PCT). An attempt is made to 
define the agent by reference to a result to be achieved. Again, this 
lack of clarity in the present case is such as to render a meaningful 
search over the whole of the claimed scope impossible. Consequently, the 
search has been not been carried out for this claim which appear to be 
clear, supported and disclosed. A similar argument is applicable to the 
method of treatment of a WSBV infection of claim 18 comprising 
administering a pharmaceuti cal ly effective amount of the agent to an 
organ i sm. 

Present claims 25-39 relate to a nucleic acid detection kit defined by 
reference to a desirable characteristic or property, namely, its ability 
to detect the presence of one or more genes of WSBV. The claims cover all 
kits having this characteristic or property, whereas the application 
provides support within the meaning of Article 6 PCT and disclosure 
within the meaning of Article 5 PCT for only a very limited number of 
such kits. In the present case, the claims so lack support, and the 
application so lacks disclosure, that a meaningful search over the whole 
of the claimed scope is impossible. Independent of the above reasoning, 
the claims also lack clarity (Article 6 PCT). An attempt is made to 
define the detection kit by reference to a result to be achieved. Again, 
this lack of clarity in the present case is such as to render a 
meaningful search over the whole of the claimed scope impossible. 



page 1 of 2 



International Application No. PCT/US 00/28888 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



Consequently, the search has been carried out for those parts of the 
claims which appear to be clear, supported and disclosed, namely those 
parts relating to the detection kits comprising the polynucleotides of 
SEQ ID N0:2, 4, 6... 280,284, 286-293. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT) . The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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